Gene GM21_3438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3438 
Symbol 
ID8138805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3972766 
End bp3973848 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content66% 
IMG OID644871054 
Productchorismate synthase 
Protein accessionYP_003023219 
Protein GI253702030 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0000000199375 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCTCTT CCTTCGGCAC CCTATTCAGG GTATCCACCT TTGGCGAAAG CCACTGCGCC 
GCGGTCGGCG CCGTGGTCGA CGGCGTCCCG GCGGGGATGC AACTGCGGGA GAGCGACATC
CAGTCGCAGT TGCATCGGCG CCGCCCGGGG CAGAGCGAGC TTTCCACCCC GCGTGAGGAA
AAGGACCTGG TGAGCATCCT CTCCGGCGTG GAGCTGGGGC TCACCCTCGG CACCCCCATC
TGCCTCATGG TGCATAACCG CGACCAGCGC CCCGGCGACT ACCGGGAGAT GGAGGCGGTT
CCCCGCCCCT CTCATGCTGA CTACAGCTAC CAGGTGAAAT ACGGCATCCG TGCCTCAAGC
GGCGGCGGGC GCTCCTCAGC CCGGGAAACC ATAGGGCGGG TGGCGGCCGG CGCCATAGCG
GAAAAGTACC TGGCGGAGCG TTTCGGATTG GAGATAGTGG CCTGGGTGGA CGGGGTAGGG
GAGTTGGAGG GGGGCACGGT CGATCTGGAG GCGATCACCC GGGAACAGGT TGATGCCACA
GCCGTCCGCT GCCCCAACCC GGAGGCTGCA GCGGCGATGA TGCAGCTGAT CGGATCGGTG
CGGGACCGCA AAGACTCGGT GGGCGGGGTG GTTCGCTGTG TCTGCCGGAA CCTCCCCGCA
GGGTTGGGGG AGCCGGTGTT CGACAAGCTG GACGCCCTCC TCGCCCATGC CATGCTCTCG
CTTCCGGCAG CCAAGGGTTT CGAGGTCGGC TCCGGGTTCG TTGGGAGCCG GATGCTGGGG
AGCGCCCATA ATGACCTGTT CGTCCAGAAA GAGGGACGTC TGGGTACCCA GACCAACTTT
TCGGGAGGGG TGCAGGGTGG GATTTCCAAC GGGGAGCCGG TTCATTTCCG GGTTGCCTTC
AAGCCGCCGG CGACCGTTTC CCTGCCCCAG AAAACGGCCG CTTTCGACGG CACCGAGACA
GTGCTGGAAG CGAAGGGGCG CCATGACCCC TGCATCGTTC CCAGGGCGGT CCCTATCGTG
GAATCCATGG TGGCGCTGGT CCTGATGGAT CTGGTGTTGA GGCAGGAATA CCGCAGGAGG
TAG
 
Protein sequence
MSSSFGTLFR VSTFGESHCA AVGAVVDGVP AGMQLRESDI QSQLHRRRPG QSELSTPREE 
KDLVSILSGV ELGLTLGTPI CLMVHNRDQR PGDYREMEAV PRPSHADYSY QVKYGIRASS
GGGRSSARET IGRVAAGAIA EKYLAERFGL EIVAWVDGVG ELEGGTVDLE AITREQVDAT
AVRCPNPEAA AAMMQLIGSV RDRKDSVGGV VRCVCRNLPA GLGEPVFDKL DALLAHAMLS
LPAAKGFEVG SGFVGSRMLG SAHNDLFVQK EGRLGTQTNF SGGVQGGISN GEPVHFRVAF
KPPATVSLPQ KTAAFDGTET VLEAKGRHDP CIVPRAVPIV ESMVALVLMD LVLRQEYRRR