JavaScript's scopeCompletionSource yields properties that are inaccessable with the `obj.prop` syntax

DGCK81LNN · June 1, 2023, 5:25pm

using these extensions

import { basicSetup } from "codemirror"
import { javascript, javascriptLanguage, scopeCompletionSource } from "@codemirror/lang-javascript"
const extensions = [
  basicSetup,
  javascript(),
  javascriptLanguage.data.of({
    autocomplete: scopeCompletionSource(window),
  }),
]

When I type “location.href.” in the editor, the autocompletion gives

etc (which are string indexes), and if I type “RegExp.” instead it yields

$_
$'
$*
$&
$`
$+

most of which would be invalid if used as property names in the usual obj.prop syntax. They need the bracket syntax like location.href[0] or RegExp["$`"]. Therefore I don’t think these properties should be suggested after typing the dot, only valid identifiers should be given.

DGCK81LNN · June 2, 2023, 9:13am

Perhaps I should post this in the github issues instead ?

marijn · June 2, 2023, 9:45am

This patch should fix that.

DGCK81LNN · June 2, 2023, 3:37pm

Thanks but there may still be a problem. People may name their properties in other languages (I don’t do that my self, though), and often /^[a-zA-Z_$][\w$]*$/ just isn’t enough in that case. However, to accurately detect if a string is a valid identifier requires either a regular expression with the Unicode(u) flag, or a lot of unicode data, it seems.

DGCK81LNN · June 2, 2023, 3:58pm

After consulting the ECMAScript spec, I wrote a (JavaScript) RegExp that matches only valid identifiers, keywords and reserved words:

/^[\p{ID_Start}$_][\p{ID_Continue}$\u200c\u200d]*$/u

This will be much harder to do without the u flag, as ID_Start and ID_Continue are very complicated. They are defined in the Unicode Character Database’s DerivedCoreProperties.txt (Ctrl+F for “Derived Property: ID_Start”)

DGCK81LNN · June 2, 2023, 4:42pm

If it is necessary to work around the complexity, we may as well just match an inaccurate and much wider range of characters, and exclude invalid characters that are in ASCII, such as:

/^[a-zA-Z_$\xaa-\uffdc][$\w\xaa-\uffdc]*$/

(This pattern allows \u{10000}-\u{10ffff} because the regexp matches by UTF-16 due to lack of u flag and the ranges include UTF-16 surrogates.)

marijn · June 5, 2023, 7:19am

\p isn’t universally supported yet, but using a wide range like that seems reasonable.

khedau · November 27, 2023, 9:20am

Hi @marijn
Can you help how we can enable autocomplete when something is added between two curly braces like below example:

const a = {
    b: {
      c: 'good',
    },
  };

  const extensions = useMemo(
    () => [
      javascript(),
      javascriptLanguage.data.of({
        autocomplete: [
          scopeCompletionSource(a),
        ],
      }),
    ],
    [a]
  );
console.log('he is {{ b.c }}')

marijn · November 27, 2023, 9:39am

Please don’t respond to old topics to ask a question that has nothing to do with the topic.