Gene Gmet_3072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_3072 
Symbol 
ID3740770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp3470702 
End bp3471910 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content65% 
IMG OID637780360 
Productthreonine dehydratase 
Protein accessionYP_386011 
Protein GI78224264 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0067161 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCCCT ACACCCTGAT CCAGGAAGCG TCAGACCGCC TCAAAAAGCG GGTCCGGCGC 
ACCGAGCTGA TCCACTCCCA CCACTTCAGC GAGCGGTTCG GCTTCCCCCT TCTCTTCAAG
TGCGAGAACC TCCAGCGGAC CGGCGCCTTC AAGATCCGGG GAGCCCTCAA CTTCATGACC
TCCCAGCCCC GGGAGGCCCT GACAAAGGGG GTCATCACCG CCTCGGCCGG CAACCACGCC
CAGGGGGTCG CCTTTTCCGC CGACCTCCTG GGGGCGCAGG CCACCGTCTT CATGCCCGAG
AGCACCCCGC CCCAGAAGGT GCAGGCCACC AAGGAGTATG GCGCCGACGT AGTCCTCACC
GGCAGGAACT TCGACGAGGC CTATGCCGCC GCAGTCCAGG CCCAGAAGGA GACCGGCGCC
CTCTTCGTGC ACCCCTTCGA CGATCCCCTG GTCATGGCCG GCCAGGGTAC CATCGGCCTC
GAAATCCTCG ATGAACTCCC CGACGTCTCA GCCATCCTGG TCCCCATCGG GGGAGGCGGG
CTCATCGCCG GCATTGCCAC GGCGGTAAAG GAGACTCACC CCCACGTGCG GATCATCGGC
ATCGAATCCA AGGCCGCACC CTCCATGCAC TTCTCCCTCA AGAAGGGAAA GATCGTGGAG
GCCCCCCTCA CGGTTACCCT TGCCGACGGG ATCGCCGTGA AGAAGGTGGG ACGGAACACC
TTCCCCATCA TCCGGGAACT GGTGGACGAC GTGGTGCTCG TGGAGGAAGA GGAGATCGCC
CTGGCCATCG TGGGGCTCCT GGAGCGGACG AAACTCCTCG TGGAAGGGGC TGGAGCCGTG
ACCCTGGCGG CCCTTCTGAA CGGCAAGGCG GGGAAACTGC CGGGGAAGAC GGTCTGCGTC
CTCTCCGGCG GAAACATCGA CGTGAAGACC ATTTCCACCG TGGTGGAGCG GGGCCTTGTG
GCGGCGGGGC GCTATCTGAA GCTGCAGGTG GTGCTGGACG ACATTCCCGG TTCCCTGGCG
GGGCTGGCCA CGGAGGTGGC TGCTGTCCGG GCCAACATCT TCCTCATCAA CCACGAGCGC
CGCTCCCTGG ACCTTCCCCT CGGCAAGACC GAGGTCCTTC TGGAACTGGA GACCCGGGGC
TACGAACACA TCCAGGAGAT CATCGGCCAC CTGGGGCGAC GGGGCTATGA GGTGGATGTC
GTAAAATGA
 
Protein sequence
MLPYTLIQEA SDRLKKRVRR TELIHSHHFS ERFGFPLLFK CENLQRTGAF KIRGALNFMT 
SQPREALTKG VITASAGNHA QGVAFSADLL GAQATVFMPE STPPQKVQAT KEYGADVVLT
GRNFDEAYAA AVQAQKETGA LFVHPFDDPL VMAGQGTIGL EILDELPDVS AILVPIGGGG
LIAGIATAVK ETHPHVRIIG IESKAAPSMH FSLKKGKIVE APLTVTLADG IAVKKVGRNT
FPIIRELVDD VVLVEEEEIA LAIVGLLERT KLLVEGAGAV TLAALLNGKA GKLPGKTVCV
LSGGNIDVKT ISTVVERGLV AAGRYLKLQV VLDDIPGSLA GLATEVAAVR ANIFLINHER
RSLDLPLGKT EVLLELETRG YEHIQEIIGH LGRRGYEVDV VK