Gene GSU3005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3005 
SymbolthiC-2 
ID2688215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3297117 
End bp3298424 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content67% 
IMG OID637127698 
Productthiamine biosynthesis protein ThiC 
Protein accessionNP_954047 
Protein GI39998096 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.933742 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAACCC AGATCGAACA GGCCCGCGAG GGCATCATTA CCCCTCAGAT GGCGGCCGTG 
GCCGCGGAGG AGCACGTCTC CCCCGAGTAT GTCTGCCGGA TGGTGGCCGA GGGGAAGGTC
GTCATCCCCT GGAACCACGT GCGCGCGCCA AAGGCAGTCG GCATCGGCAA GGGGCTGCGG
ACCAAGGTGA ATGCCTCCAT CGGCACCTCA TCGGACATCG TTGACTACGA GGCCGAGGTG
CGCAAGGCCC GGGCCGCCCA GGAGTCAGGC GCCGACACCC TCATGGAGCT GTCCGTGGGC
GGCGACCTGG ACCGGGTCCG GCGCGAGGTC ATCGCGGCCG TGGACCTGCC GGTGGGAAAC
GTTCCGCTCT ACCAGGCCTT CTGCGAGGCG GCGAGGAAAT ACGGCGACCC CAACCGGCTC
GACCCTGAGA TGCTCTTTGA CCTGATTGAG CGCCAGTGCG CCGACGGCAT GGCCTTCATG
GCGGTCCACT GCGGCATCAA TCTCTACACC ATCGAACGGC TCCGTCGCCA GGGGTACCGC
TACGGCGGCC TCGTCTCAAA GGGAGGGGTG AGCATGGTGG GCTGGATGAT GGCCAACGGC
CGCGAGAATC CCCTCTACGA ACAGTTCGAC CGGGTGGTCG GTATCCTGAA GAAATACGAC
ACGGTCCTTT CCCTGGGCAA CGGCCTGCGG GCCGGCGCCA TCCACGATTC ATCGGACCGG
GCCCAGATCC AGGAGCTGCT GATCAACTGC GAACTGGCGG AGATGGGGCG CGAGATGGGC
TGCCAGATGC TGGTGGAGGG CCCCGGTCAC GTCCCCCTGG ACGAGGTGGA GGGGAACATC
CAGCTCCAGA AGCGGATGAG CGGCGGCGCA CCCTACTATA TGCTCGGGCC CATCTCCACC
GACGTGGCCC CCGGCTTCGA CCACATCACC GCCGCCATTG GCGCGGCCCA GTCGAGCCGC
TTCGGCGCCG ACCTGATCTG CTACATCACC CCGGCCGAGC ACCTGGCCCT CCCCAACGAA
GAGGACGTCC GCCAGGGGGT AAAGGCGGCC CGGGTCGCCG CCTACATCGG CGACATGAAC
AAGTACCCGG AGAAGGGGCG CGAGCGGGAC CGGGAGATGA GCAAGGCCCG TCGCGACCTG
GACTGGCAGA GGCAGTTCGA GCTGGCCCTC TATCCAGAGG ACGCCCGGGC GATCAGGGCC
AGCCGTACTC CCGAGGATGA GGCCACCTGC ACCATGTGCG GCGACTTCTG CGCCTCCCGG
GGGGCCGGCA GGCTGTTTGC CGGCGATCTC AGGGGGGATA AGGTGTAG
 
Protein sequence
MKTQIEQARE GIITPQMAAV AAEEHVSPEY VCRMVAEGKV VIPWNHVRAP KAVGIGKGLR 
TKVNASIGTS SDIVDYEAEV RKARAAQESG ADTLMELSVG GDLDRVRREV IAAVDLPVGN
VPLYQAFCEA ARKYGDPNRL DPEMLFDLIE RQCADGMAFM AVHCGINLYT IERLRRQGYR
YGGLVSKGGV SMVGWMMANG RENPLYEQFD RVVGILKKYD TVLSLGNGLR AGAIHDSSDR
AQIQELLINC ELAEMGREMG CQMLVEGPGH VPLDEVEGNI QLQKRMSGGA PYYMLGPIST
DVAPGFDHIT AAIGAAQSSR FGADLICYIT PAEHLALPNE EDVRQGVKAA RVAAYIGDMN
KYPEKGRERD REMSKARRDL DWQRQFELAL YPEDARAIRA SRTPEDEATC TMCGDFCASR
GAGRLFAGDL RGDKV