Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SYO3AOP1_0801 |
Symbol | |
ID | 6332721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfurihydrogenibium sp. YO3AOP1 |
Kingdom | Bacteria |
Replicon accession | NC_010730 |
Strand | - |
Start bp | 842294 |
End bp | 843373 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642657097 |
Product | chorismate mutase |
Protein accession | YP_001930988 |
Protein GI | 188996737 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0077] Prephenate dehydratase [COG1605] Chorismate mutase |
TIGRFAM ID | [TIGR01801] chorismate mutase domain of gram positive AroA protein [TIGR01807] chorismate mutase domain of proteobacterial P-protein, clade 2 [TIGR01808] monofunctional chorismate mutase, high GC gram positive type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 73 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTATC AAGAAAAGCT AAAAGCTTTG CGGCAAGAAA TTGACAGTAT AGACAATCAG ATTTTAGAGC TTATAAATAA AAGAGCAACC TTAGCTAAAG AAGTTGGAGA GATTAAAAAA GCAAACAACC TTCCAATCTT CGTTCCAAGC AGAGAGAAGG AAATTTTTGA TAGATTAGAA AAACTCAACA AAGGACCACT TCCAACGGAT ATTGTAAAGC ATATATTTAG AGAGATAATT TCAGCTTGTA GAAGCATAGA AGAGAATATT AAAGTCGTTT ATCTTGGACC AAAGGCAACG TTTACCCATC AAGCAAGCCT AAAATACTTT GGACATTCTG TAGAGCATAT ACCCGTATCA ACAATAAAAG ATGTATTTGA AGAGATAGTC AAGAAAAAAG CAGACTTTGG CGTAGTTCCG GTAGAGAATA CAATAGAGGG AGTTGTTAAC TATACTCTCG ATATGTTTTT AGAGTATGAT TTAAAAATCA TAGGTGAAGT TATTTTAGAA ATTTCTTTAC ATCTAATGAG TATAAATCCA AACATTAATG AAATTCAAAG AATTTACAGT CATAAATTTG CAATAGCAGA ATGTAGAGAT TGGATACTAA AAAATATGCC ACATGTTCAG CTTATAGAAG TTGAAAGTAC AGCAAAAGCT GCAGAAATGG CAAAAGATGA TTATGAATCT GCAGCAATAG CCAGCGAATC AGCGGCAGAA GTTTATGGAT TATACATTTT AGAAAGAAAG ATAGATAAGC ATCTTTATAA CTATACAAGA TTTTTAATCA TTGGAAATGA AATACCAAGC AAAACAGGAA ATGACAAAAC AACGTTTATT TTTTCTGTAA AAAACGAAGT TGGTGCGTTA TATAAAGCCT TAGAGCCATT TTATAGAAAT GGTATAAATA TGACTAAAAT AGAATCAAGA CCATCTAAAA AAGAGGCTTG GGATTACATT TTCTTTACGG ATATTGAGGG TCATATTGAT GACGAAGTAG TAAAGAATAC TCTTGAAGAA TTAAAATCCA ACGTTCCATT TTTCAAGATT TTAGGCTCAT ATCCTAAGGC GGTTGATTAG
|
Protein sequence | MDYQEKLKAL RQEIDSIDNQ ILELINKRAT LAKEVGEIKK ANNLPIFVPS REKEIFDRLE KLNKGPLPTD IVKHIFREII SACRSIEENI KVVYLGPKAT FTHQASLKYF GHSVEHIPVS TIKDVFEEIV KKKADFGVVP VENTIEGVVN YTLDMFLEYD LKIIGEVILE ISLHLMSINP NINEIQRIYS HKFAIAECRD WILKNMPHVQ LIEVESTAKA AEMAKDDYES AAIASESAAE VYGLYILERK IDKHLYNYTR FLIIGNEIPS KTGNDKTTFI FSVKNEVGAL YKALEPFYRN GINMTKIESR PSKKEAWDYI FFTDIEGHID DEVVKNTLEE LKSNVPFFKI LGSYPKAVD
|
| |