Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA0702 |
Symbol | aroC |
ID | 3104813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | + |
Start bp | 738345 |
End bp | 739445 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637169912 |
Product | chorismate synthase |
Protein accession | YP_113212 |
Protein GI | 53805108 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGGAA ACACCATCGG CAAACTGTTT ACCGTCACGA CCTTCGGCGA AAGCCACGGG CCTGCGCTCG GCTGCATCGT CGACGGCTGC CCGCCGGGAC TTGCGTTGTC CGAGGCCGAT CTGCAGCACG ATCTGTATCG CCGCCGGCCG GGCCAGTCCC GCCACACCAC CCAGCGGCGT GAGTCGGACA CCGTCAAGAT CCTGTCCGGG GTGTTCGAGG GACTCACCAC CGGGACGCCG ATCGGTCTCC TGATCGAGAA CGAGGACCAG CGGTCCAAGG ATTACGCCAG CATCGCCGAC CGCTTCCGCC CCGGCCATGC CGACTACACC TACCACATGA AATACGGCTT CCGCGACTAC CGTGGCGGCG GTCGCTCGTC GGCGCGTGAA ACCGCGATGC GGGTGGCGGC GGGAGGCATC GCCAAGAAAT ACCTGCGTGA GCGGTTGGGT GTCGAAATCC GCGGCTACCT GGCCCAGCTC GGGCCGATCC GGATCGACCC GGTGGACTGG AACGCCATCG ACGACAACCC CTTCTTCTGT CCCGATCCCG CCAGGGTTCC CGAGCTTGAA GCTTACATGG ATGCCCTGCG CAAGGAAGGT GATTCGAGCG GCGCCCGGGT CAACGTGGTG GCCAGGGGCG TGCCGCCGGG CTTGGGCGAG CCGGTCTTCG ACCGGCTCGA CGCCGAGCTG GCGTATGCGC TGATGAGCAT CAACGCCGTC AAGGGTGTGG AAATCGGCGC CGGTTTCGGC TGTGTCGAAG CCAAGGGTTC GGTGTTCCGC GATGAGATGA GTCCGGAAGG TTTCCTGGGG AATTCGGCGG GCGGTATTCT GGGCGGGATA TCCACCGGCC AGGACATCGT TGCCAGCATC GCGCTGAAGC CTACCTCCAG TCTGCGTCTC CCGGGCCGGT CGGTGAACAT CCGCGGGGAA TCGGTGGAAG TCGTGACCAC CGGACGCCAT GATCCCTGTG TCGGCATCCG GGCCACGCCG ATCGCCGAGG CGATGATGGC CATCGTGCTG ATGGATCATT ATCTGCGCCA CCGGGGTCAG AACCAGGACG TCGTGCGCAC GCTCGATCCC ATCCCGCCCA GCGCGTTCTA G
|
Protein sequence | MSGNTIGKLF TVTTFGESHG PALGCIVDGC PPGLALSEAD LQHDLYRRRP GQSRHTTQRR ESDTVKILSG VFEGLTTGTP IGLLIENEDQ RSKDYASIAD RFRPGHADYT YHMKYGFRDY RGGGRSSARE TAMRVAAGGI AKKYLRERLG VEIRGYLAQL GPIRIDPVDW NAIDDNPFFC PDPARVPELE AYMDALRKEG DSSGARVNVV ARGVPPGLGE PVFDRLDAEL AYALMSINAV KGVEIGAGFG CVEAKGSVFR DEMSPEGFLG NSAGGILGGI STGQDIVASI ALKPTSSLRL PGRSVNIRGE SVEVVTTGRH DPCVGIRATP IAEAMMAIVL MDHYLRHRGQ NQDVVRTLDP IPPSAF
|
| |