Gene Gmet_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_0471 
Symbol 
ID3739838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp518299 
End bp519606 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content66% 
IMG OID637777745 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_383439 
Protein GI78221692 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCC AGATCGAAAT CGCCCGCGAA GGGACCATTT CTTCCCAGAT GATGGCCGTG 
GCAGCCGAAG AGCACGTTTC CCCCGACTAT GTCCGGCAGA TGGTCGCCGA AGGGAAGATC
GTCATCCCCG GCAACCACAG CCGCAAGCCC CGGGCCGTCG GCATCGGCAA GGGACTGCGC
ACCAAGGTGA ACGCCTCCAT CGGTACCTCC TCCGACATCA TCGACTATGG GGCCGAGGTG
CGGAAGGCCC TGGCCGCCCA GGAGGCGGGG GCCGACACCC TCATGGAGCT CTCCGTGGGG
GGCGATCTGG ACCGGGTGCG GCGGGAGGTG ATCGCCGCCG TGGATCTCCC CGTGGGGAAC
GTCCCCCTCT ACCAGGCCTT CTGCGAGGCG GCCCGCAAGT ACGGTGATCC CAATAAACTG
GATCCGGAGA TGCTCTTCGA CCTGATCGAA AAGCAGTGCG AGGACGGCAT GGCCTTCATG
GCGGTCCACT GCGGCATCAA CCTCTATACC ATCGAGCGCC TGAAGCGGCA GGGGTACCGC
TACGGCGGCC TCGTCTCCAA GGGGGGGGTC AGCATGGTGG CCTGGATGAT GGCCAACAGG
CGGGAAAATC CCCTCTACGA GCAATTCGAC CGGGTAACTT CGATCCTCAG GAAATACGAC
ACGGTCCTCT CGCTGGGGAA CGGGCTGCGG GCTGGCGCCA TCCACGACTC CTCGGACCGG
GCCCAGATCC AGGAGCTCCT CATTAACTGC GAGCTGGCTG AACTGGGGCG CGAGATGGGG
TGCCAGATGC TCGTGGAGGG GCCGGGGCAC GTGCCGCTCG ACGAGGTGGA GGGGAACATC
CAGCTCCAGA AGCGGATGAG CGGCGGCGCC CCCTACTACA TGCTGGGTCC CATCTCCACC
GACGTGGCCC CCGGCTTCGA CCACATCACC GCCGCCATCG GCGCGGCCCA GTCCTCCCGC
TACGGCGCCG ACCTCATCTG CTACATCACT CCGGCCGAGC ACCTGGCCCT CCCCAACGAG
GAGGATGTCC GCCAGGGGGT GAAGGCAGCG AAGATCGCGG CCTACATCGG CGACATGAAC
AAGTACCCGG AACGGGGCCG GGAGCGGGAC AAGGAGATGT CCAAGGCCCG CCGCGACCTG
GATTGGAAGA AGCAGTTCGA GCTGGCCCTC TTCCCGGAGG ATGCCAAAGC CATCCGTGCC
AGCCGCACCC CCGAGGACGA GGCCACCTGC ACCATGTGCG GCGACTTCTG CGCCTCGCGC
GGGGCGGGGA AGCTTTTTGC GGCGGATCTG CGGGGGGATA AGATTTAA
 
Protein sequence
MKTQIEIARE GTISSQMMAV AAEEHVSPDY VRQMVAEGKI VIPGNHSRKP RAVGIGKGLR 
TKVNASIGTS SDIIDYGAEV RKALAAQEAG ADTLMELSVG GDLDRVRREV IAAVDLPVGN
VPLYQAFCEA ARKYGDPNKL DPEMLFDLIE KQCEDGMAFM AVHCGINLYT IERLKRQGYR
YGGLVSKGGV SMVAWMMANR RENPLYEQFD RVTSILRKYD TVLSLGNGLR AGAIHDSSDR
AQIQELLINC ELAELGREMG CQMLVEGPGH VPLDEVEGNI QLQKRMSGGA PYYMLGPIST
DVAPGFDHIT AAIGAAQSSR YGADLICYIT PAEHLALPNE EDVRQGVKAA KIAAYIGDMN
KYPERGRERD KEMSKARRDL DWKKQFELAL FPEDAKAIRA SRTPEDEATC TMCGDFCASR
GAGKLFAADL RGDKI