Gene GSU0604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU0604 
SymbolthiC-1 
ID2687308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp637928 
End bp639238 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content63% 
IMG OID637125271 
Productthiamine biosynthesis protein ThiC 
Protein accessionNP_951662 
Protein GI39995711 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATGA CGCAACTGGA ATACGCCCGC AACGGAATCA TCACCGACAA GATGAAAGAG 
GCCGCCCTGG CCGAAGGGGT ATCGCCCGAA TTCATCAAGG CGGGCATTGC CGACGGGACC
ATCATCATCT GCCACAACAA CAAGCATCAC AACGGCCGCC CCCTGGCCGT GGGCACAGGG
CTACGGACCA AGGTCAACGC AAACATCGGC ACCTCCGCCG ACGACACGGA TATCACCAAG
GAGCTTGAGA AGGCCCGGGT GGCGGTTCGC CACGGTGCCG ACGCCATCAT GGATCTCTCC
ACCGGCGGAC CGGTGGACGA AATCCGCCGC GCGATCATCG CCGAGACGAA TGCCTGCATC
GGCAGCGTCC CCCTCTACCA GGCGGCCCTT GATGCGGTCA GGACCAAGAA AAAGGCCATC
GTCGACATGA CCGTGGACGA TATCTTCGAG GGGATCATCA AGCATGCCGA GGACGGAGTC
GACTTCATAA CCGTCCACTG CGGCGTGACC CGCGCCACGG TCGAGCGCAT GAAGAACGAG
GGGCGCATCA TGGACGTGGT TTCCCGCGGC GGCGCCTTCA CCGTGGAGTG GATGACCTAC
AACAACGCCG AAAACCCTCT CTTCGAGCAC TTTGACCGGC TGCTCGACAT CGTCAAGGCA
TATGACATGA CCCTGTCGCT GGGGGACGGC TTCCGCCCCG GCTGCCTCGC GGATGCCACT
GACCGGGCCC AGATCCACGA GCTGATCCTT CTGGGCGAGC TGACCCAGCG GGCTCAGGAC
GTCGGGGTTC AGGTCATGAT CGAAGGCCCC GGGCACGTGC CGCTCAACCA GATCGAAGCG
AACATTCTCC TCCAGAAGCG ACTCTGCCAC GGCGCTCCCT TTTATGTCCT CGGCCCGCTG
GTAACCGATA TCGCTCCCGG CTACGACCAC ATCACCTGCG CGATCGGCGG CGCCATCGCC
GCGGCGGCCG GGGCCGACTT CCTCTGCTAC GTGACACCGA GCGAGCACCT GCGACTGCCG
AGCGTGGAAG ACGTCCGCGA AGGGGTCATC GCCTCCCGCA TCGCCGCCCA TGCCGCCGAT
ATAGCCAAAG GGGTAAAGGG GGCCATGGCT AAGGACATCG CCATGGCTAA ATGCCGTAAG
AAGCTGGACT GGGAAGGTCA ATTCGAACAG GCCCTCGACC CGGAAAAAGC CCGGCGCATG
CGCGACGAAT CGGGCGTGGC CGAGCACGGC GCCTGCACCA TGTGTGGCGA GTTCTGTGCT
TACAAGGTGA TGGACGACGC CATGGAAAAG CAGCGGACGG CAACCCTCTA G
 
Protein sequence
MAMTQLEYAR NGIITDKMKE AALAEGVSPE FIKAGIADGT IIICHNNKHH NGRPLAVGTG 
LRTKVNANIG TSADDTDITK ELEKARVAVR HGADAIMDLS TGGPVDEIRR AIIAETNACI
GSVPLYQAAL DAVRTKKKAI VDMTVDDIFE GIIKHAEDGV DFITVHCGVT RATVERMKNE
GRIMDVVSRG GAFTVEWMTY NNAENPLFEH FDRLLDIVKA YDMTLSLGDG FRPGCLADAT
DRAQIHELIL LGELTQRAQD VGVQVMIEGP GHVPLNQIEA NILLQKRLCH GAPFYVLGPL
VTDIAPGYDH ITCAIGGAIA AAAGADFLCY VTPSEHLRLP SVEDVREGVI ASRIAAHAAD
IAKGVKGAMA KDIAMAKCRK KLDWEGQFEQ ALDPEKARRM RDESGVAEHG ACTMCGEFCA
YKVMDDAMEK QRTATL