Gene GM21_3873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3873 
Symbol 
ID8139247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4459301 
End bp4460743 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content64% 
IMG OID644871490 
Productglyceraldehyde-3-phosphate dehydrogenase 
Protein accessionYP_003023648 
Protein GI253702459 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCATGA AGAAACAGGA AGCGTATTTG AAGGAGTGGC AGGGGCACGA GGAGCTCGCG 
GAACAGATGC TCCCCATCAT CGGTCGCTTG TACCGTGACC ACAACATCGT GACCACCGTG
TACGGCAGGT CGCTGGTCAA CAGCCCGACC ATCGAGATCC TCAAGGCGCA CCGGTTCGCC
CGTCTGATCC TCGACGGCGA ACTGACCGTC CAGGACACTT TCCCCATTCT CGAAGCCATC
GGCAAGATGG ACCTCGCTCC GGCCCGCATC GACCTGGGAA GGTTGGCGGT CCGCTACCAG
TCGCAGCAGG GAAGCTCGGT CGCCGACTTT GTGAGCCGCG AACTCGCCTC CGTCAACACC
GGCCGCACGC CGCTTCTGGA CGAGCCGCAG GACATCGTGC TGTACGGCTT CGGCCGCATC
GGTCGCCTGG TGGCCCGCAT CCTGGTCGAG AAGTCCGGCT CCGGCGAGAA GCTCAGGCTG
CGCGCGGCGG TGGTCCGCAA GGGCGGCCCG GACGACCTGG TGAAAAGGGC GAGCCTTTTG
CGCCGCGACT CGGTGCACGG ACCCTTCAAC GGGATCATCA CCATCGACGA GGAAGAAAAC
GCGATCATCG CCAACGGCAA CATGATCCGC ATCATCTACG CCGACGCGCC GGAGAACGTG
GACTACGCGC AGTACGGCAT CCGTAACGCG ATCGTGATCG ACAACACCGG CAAGTGGCGC
GACCGCGAAG GGCTTGGGCG TCACCTGAAG GCATCGGGAG TGAGCCAGGT CGTGCTCACC
GCCCCGGGCA AAGGGGACAT CCCCAACGTC GTCTTCGGCG TCAACAACGA ACTCATCGCC
TCCACCGAGA GCATCTTCTC CGCGGCGAGC TGCACCACCA ACGCCATCGT GCCGGTTTTG
AAGGCGGTGA GCGACAACTT CGGTATCGTG AGCGGCCACG TGGAAACCTG CCACTCCTAC
ACGAACGACC AGAACCTGAT CGACAACTAC CACAAGGCGG ACCGCCGTGG GCGGAGCGCC
CCGTTGAACA TGGTCATCAC CGAGACCGGC GCCGCCAAGG CCGTCGCCAA GGTGCTTCCG
GAGCTGACCG GAAAGCTGAC CGGCAACGCC ATCCGTGTAC CGACACCGAA CGTCTCGCTG
GCGATCCTGA ACCTGCAGCT CAAGTCGGAG ACCGACGTCG CGACGCTGAA CGGCTACCTG
CGCGCCATGT CGCTCGACTC GCCGCTGCAG AACCAGATCG ACTACACCAA CTCCCCGGAC
GTGGTCTCCA GCGACATGGT CGGTTCGCGC CACGCCGGCG TGGTCGACTC TCTCGCCACC
ATCGTTCAGG GCAACCGTTG CGTCCTTTAC GTCTGGTACG ACAACGAGTT CGGCTACAGC
TGCCAGGTAG TGCGCATGGT GCAGAAGATG GCAGGCCTGG AACTCCCGAT GCTGCCGGCG
TAA
 
Protein sequence
MIMKKQEAYL KEWQGHEELA EQMLPIIGRL YRDHNIVTTV YGRSLVNSPT IEILKAHRFA 
RLILDGELTV QDTFPILEAI GKMDLAPARI DLGRLAVRYQ SQQGSSVADF VSRELASVNT
GRTPLLDEPQ DIVLYGFGRI GRLVARILVE KSGSGEKLRL RAAVVRKGGP DDLVKRASLL
RRDSVHGPFN GIITIDEEEN AIIANGNMIR IIYADAPENV DYAQYGIRNA IVIDNTGKWR
DREGLGRHLK ASGVSQVVLT APGKGDIPNV VFGVNNELIA STESIFSAAS CTTNAIVPVL
KAVSDNFGIV SGHVETCHSY TNDQNLIDNY HKADRRGRSA PLNMVITETG AAKAVAKVLP
ELTGKLTGNA IRVPTPNVSL AILNLQLKSE TDVATLNGYL RAMSLDSPLQ NQIDYTNSPD
VVSSDMVGSR HAGVVDSLAT IVQGNRCVLY VWYDNEFGYS CQVVRMVQKM AGLELPMLPA