Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1415 |
Symbol | aroA |
ID | 3103431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | - |
Start bp | 1501120 |
End bp | 1502388 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637170590 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_113872 |
Protein GI | 53804255 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.586582 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGGCG ACATCCGGGT ACCGGGCGAC AAGTCCATCT CCCACCGGTC GGTGATGCTG GGCTCGCTCG CCGAGGGCGT GACTGAGGTG AGTGGCTTCC TCCAGGCTGA GGACTGTTTG GCGACCATGG CGGCGTTCCG GGCCATGGGC GTCGAAATCG AAGGCCCGAC GGAGGGCCGG CTGCGGATCC ACGGCGTCGG CCTGCACGGC CTGAAGCCAC CTGCCGCCCC CCTGGATCTC GGCAATTCCG GCACCTCCAT GCGGCTATTG AGCGGACTGT TGGCGGGACA GGCATTCGAC ACCACGCTGA CCGGCGATGC CTCCCTGGTG CGCCGGCCGA TGCGGCGGGT GACCGAACCG CTGCGCGCCA TGGGCGCGCG GATCGACACC ACCGAAGCCG GCACCGCGCC ACTGCGCATC GCCGGCGGAA GCCGCCTCAA AGGGATCGAC TATGCGATGC CGGTCGCCAG CGCCCAGGTG AAATCCTGTC TGCTGCTGGC GGGCCTCTAC GCGGAAGGGA AGACCTGTGT CACCGAGCCG GCGCCGACCC GCGACCACAC CGAACGCATG CTGGCGGGTT TCGGCTATCC GGTGGCGCGA GATGGCAACC GTGTATGCAT CCAATCCGGC GGCAAGCTTT CCGCGACCCG TATCGACGTA CCGGCGGACA TTTCCTCGGC GGCGTTCTTC ATGATAGGCG CAGCGATCAG CCCTGGGTCC GACGTGTTCC TCCGCCATGT CGGGATCAAT CCGACCCGGA CCGGCGTCAT CGAAATCCTG CGCGAAATGG GCGCCGACAT CGAGATACTC GCTCCGCGCG AAGTCGGCGG TGAACCGGTG GCGGACCTCC GCATCCGTTA CCGGGAACTG CGCGGCATCC GCATTCCCGA ACATACCGTG CCGCTGGCCA TTGACGAATT CCCGGCCCTG TTCATCGCCG CAGCCTGCGC CACAGGCGAA ACGGTGCTGA CCGGGGCCGA GGAGCTGCGA GTCAAGGAAA GCGACCGTAT CCAGGCCATG GCCGACGGCC TGACCACGCT GGGCATCGAT GCCCGCCCGA CCCCCGATGG CATGGTCATC CGGGGCGGGA GTTTCCGCGG CGGCGCAGTC GATTCGCGCG GCGATCATCG CATCGCCATG TCATTCTCGA TCGCGGCATT GCGCGCTCCC ATCCCCATCG AGATTCACGA CTGCGCCAAC GTGGCGACAT CTTTTCCCAA TTTCGTCGAA CTGGCGCGGA CCCTGGGTTT GGACATCGAG GTCAGCTGA
|
Protein sequence | MQGDIRVPGD KSISHRSVML GSLAEGVTEV SGFLQAEDCL ATMAAFRAMG VEIEGPTEGR LRIHGVGLHG LKPPAAPLDL GNSGTSMRLL SGLLAGQAFD TTLTGDASLV RRPMRRVTEP LRAMGARIDT TEAGTAPLRI AGGSRLKGID YAMPVASAQV KSCLLLAGLY AEGKTCVTEP APTRDHTERM LAGFGYPVAR DGNRVCIQSG GKLSATRIDV PADISSAAFF MIGAAISPGS DVFLRHVGIN PTRTGVIEIL REMGADIEIL APREVGGEPV ADLRIRYREL RGIRIPEHTV PLAIDEFPAL FIAAACATGE TVLTGAEELR VKESDRIQAM ADGLTTLGID ARPTPDGMVI RGGSFRGGAV DSRGDHRIAM SFSIAALRAP IPIEIHDCAN VATSFPNFVE LARTLGLDIE VS
|
| |