Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3438 |
Symbol | |
ID | 8138805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3972766 |
End bp | 3973848 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644871054 |
Product | chorismate synthase |
Protein accession | YP_003023219 |
Protein GI | 253702030 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0000000199375 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCTCTT CCTTCGGCAC CCTATTCAGG GTATCCACCT TTGGCGAAAG CCACTGCGCC GCGGTCGGCG CCGTGGTCGA CGGCGTCCCG GCGGGGATGC AACTGCGGGA GAGCGACATC CAGTCGCAGT TGCATCGGCG CCGCCCGGGG CAGAGCGAGC TTTCCACCCC GCGTGAGGAA AAGGACCTGG TGAGCATCCT CTCCGGCGTG GAGCTGGGGC TCACCCTCGG CACCCCCATC TGCCTCATGG TGCATAACCG CGACCAGCGC CCCGGCGACT ACCGGGAGAT GGAGGCGGTT CCCCGCCCCT CTCATGCTGA CTACAGCTAC CAGGTGAAAT ACGGCATCCG TGCCTCAAGC GGCGGCGGGC GCTCCTCAGC CCGGGAAACC ATAGGGCGGG TGGCGGCCGG CGCCATAGCG GAAAAGTACC TGGCGGAGCG TTTCGGATTG GAGATAGTGG CCTGGGTGGA CGGGGTAGGG GAGTTGGAGG GGGGCACGGT CGATCTGGAG GCGATCACCC GGGAACAGGT TGATGCCACA GCCGTCCGCT GCCCCAACCC GGAGGCTGCA GCGGCGATGA TGCAGCTGAT CGGATCGGTG CGGGACCGCA AAGACTCGGT GGGCGGGGTG GTTCGCTGTG TCTGCCGGAA CCTCCCCGCA GGGTTGGGGG AGCCGGTGTT CGACAAGCTG GACGCCCTCC TCGCCCATGC CATGCTCTCG CTTCCGGCAG CCAAGGGTTT CGAGGTCGGC TCCGGGTTCG TTGGGAGCCG GATGCTGGGG AGCGCCCATA ATGACCTGTT CGTCCAGAAA GAGGGACGTC TGGGTACCCA GACCAACTTT TCGGGAGGGG TGCAGGGTGG GATTTCCAAC GGGGAGCCGG TTCATTTCCG GGTTGCCTTC AAGCCGCCGG CGACCGTTTC CCTGCCCCAG AAAACGGCCG CTTTCGACGG CACCGAGACA GTGCTGGAAG CGAAGGGGCG CCATGACCCC TGCATCGTTC CCAGGGCGGT CCCTATCGTG GAATCCATGG TGGCGCTGGT CCTGATGGAT CTGGTGTTGA GGCAGGAATA CCGCAGGAGG TAG
|
Protein sequence | MSSSFGTLFR VSTFGESHCA AVGAVVDGVP AGMQLRESDI QSQLHRRRPG QSELSTPREE KDLVSILSGV ELGLTLGTPI CLMVHNRDQR PGDYREMEAV PRPSHADYSY QVKYGIRASS GGGRSSARET IGRVAAGAIA EKYLAERFGL EIVAWVDGVG ELEGGTVDLE AITREQVDAT AVRCPNPEAA AAMMQLIGSV RDRKDSVGGV VRCVCRNLPA GLGEPVFDKL DALLAHAMLS LPAAKGFEVG SGFVGSRMLG SAHNDLFVQK EGRLGTQTNF SGGVQGGISN GEPVHFRVAF KPPATVSLPQ KTAAFDGTET VLEAKGRHDP CIVPRAVPIV ESMVALVLMD LVLRQEYRRR
|
| |