Gene GM21_3893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3893 
Symbol 
ID8139267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4479605 
End bp4480912 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content65% 
IMG OID644871510 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003023668 
Protein GI253702479 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value0.377066 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACAC AGATAGAATC GGCGCGCGCC GGTGTTACCA CCCCGCAAAT GACGCAGGTA 
GCTTTCGACG AAGATCTGAC GCCGGAGTAC GTCCGGGAGA AGGTGGCGCA GGGGCAGATA
GTGATCCCCT GGAACCATGT CAGGAAACCG AAGGTGGCCG GGATCGGCGC GGGGCTCAGG
ACCAAGGTGA ACGCCTCCAT CGGGACCTCC TCCGACATCG TCGACTACGA GGCCGAGGTG
AAGAAGGGGC TCGCGGCGCA GGAGTCCGGG GCAGACACCC TTATGGAACT ATCCGTAGGA
GGGGACCTGG ACCGGGTGCG GCGCGAGGTG ATCGCGGCGG TGGATCTTCC CGTCGGCAAC
GTCCCCCTCT ACCAGGCCTT TTGCGAGGCA GCCAGGAAGT ACGGCGACCC CAATAAACTC
GACGACGAGA TGCTCTTCGA CATCATCGAA AGGCAGTGCG CCGACGGCAT GGCCTTCATG
GCGGTTCACT GCGGCATCAA CCTCTACACC CTGGAGCGCC TGAGGAAGCA GGGTTACCGC
TACGGCGGCC TGGTCTCCAA GGGTGGGGTG AGCATGGCGG CCTGGATGAT CGCCAACAAC
CGCGAGAACC CGCTCTACGA GAAGTTCGAC CGGGTGGTGG AGATCCTCAA GCGCTACGAC
ACCGTGCTGT CGCTGGGCAA CGGCTTGAGG GCCGGCGCCA TCCACGACTC CAGCGACCGG
GCGCAGATCC AGGAGCTTTT GATCAACTGC GAGCTGGCGG AACTCGGGCG CGACATGGGG
TGCCAGATGC TGGTGGAGGG GCCGGGACAC GTGCCTTTGG ACGAGATCGA GGGAAACATC
AAGCTGCAGA AGCGGATGAG CGGCGGTGCC CCCTACTACA TGCTCGGCCC CATCGCCACC
GACGTCGCCC CGGGCTTCGA CCATATCACC GCGGCCATAG GCGCGGCCCA GTCTTCGCGC
TACGGCGCCG ACCTCATCTG CTACATCACC CCCGCCGAGC ACCTGGCGCT TCCCAACGAG
GAGGACGTCC GCATGGGGGT GAAGGCCGCG AAGATCGCCG CCTACATAGG CGACATGAAC
AAGTACCCCG AGAAGGGTAG GGAGCGCGAC AAGGAGATGA GCAAGGCCAG AAGGGACCTC
GACTGGCAGC GCCAGTTCGA GCTCGCCCTG TTCCCCGAGG ACGCCGCCGC CATCAGGAAG
AGCCGCGTAC CCGAGGACGA GCAGACCTGC ACCATGTGCG GCGACTTCTG CGCCTCCCGC
GGCGCCGGCA AGATCTTCGC CGGCGACCTG AGGGGTGACA AGATCTAA
 
Protein sequence
MMTQIESARA GVTTPQMTQV AFDEDLTPEY VREKVAQGQI VIPWNHVRKP KVAGIGAGLR 
TKVNASIGTS SDIVDYEAEV KKGLAAQESG ADTLMELSVG GDLDRVRREV IAAVDLPVGN
VPLYQAFCEA ARKYGDPNKL DDEMLFDIIE RQCADGMAFM AVHCGINLYT LERLRKQGYR
YGGLVSKGGV SMAAWMIANN RENPLYEKFD RVVEILKRYD TVLSLGNGLR AGAIHDSSDR
AQIQELLINC ELAELGRDMG CQMLVEGPGH VPLDEIEGNI KLQKRMSGGA PYYMLGPIAT
DVAPGFDHIT AAIGAAQSSR YGADLICYIT PAEHLALPNE EDVRMGVKAA KIAAYIGDMN
KYPEKGRERD KEMSKARRDL DWQRQFELAL FPEDAAAIRK SRVPEDEQTC TMCGDFCASR
GAGKIFAGDL RGDKI