Gene GSU3194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3194 
SymbolthiL 
ID2687582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3502085 
End bp3503071 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content66% 
IMG OID637127887 
Productthiamine monophosphate kinase 
Protein accessionNP_954235 
Protein GI39998284 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGACTCG GCGAGATCGG TGAATTCGGC CTTATCGACA GGATTGCCGG CAAAGTAGCC 
GCCGGTGCCG GGGTTCGCCT CGGCATCGGC GACGACGCTG CCGTTACCGA AACGGAAGCG
GGGCGCCTCC TGCTGTCCAC CGCCGACATG CTCGTTGAAG GCATCCACTT TGACCTCTCC
TTCACCGACC CCTTCAGGCT CGGCCGCAAA TCCCTGGCGG TCAACGTCTC CGACATTGCC
GCCATGGGGG GACACCCTCG CCATGCGCTG CTCTGTCTTG CCATTCCAAC CGATCTGCCG
GTTGAATTTC TCGACCGGTT CGCCGACGGC GTCATCTCCC TGGCAGAGGA ATTCGGTGTC
ACCCTCATCG GCGGCGATAC CTGCCGCTCC TCCTCCGGGC TTGTCATCTC CATCACCCTC
CACGGCGAAC AGGTCCCCAC ACGGATCATC CCGCGCAACG GGGCTCGGCC GGGCGACGAC
GTGTTCGTCA CCGGTACCGT CGGAGATTCA GCCCTGGGGC TCGAACTGCT GCGCAGAGGC
GAACGCTCCG GACATGCCGT CGAACGCCAC CTCAACCCCT CGCCCCGCGT CTCCGCCGGC
CTGAGCCTGG CCGAATCGGG CATGGCCTCG GCCATGATCG ACGTGAGCGA CGGTGTCCTT
GCCGACCTGG GGCACATCCT GACCGGCTCG GGGGTCGGCG CCCGCATCGA CGCATCCCTC
ATCCCCCTTT CTCCCTACTT CAGCCAACGG GCGCCCGACG TGGCACCTGA CCCCCTCTCT
CTGGCACTGG CGGGAGGCGA GGACTACGAA CTACTCTTTA CTGCGGCGCC GGGACGGACG
GCCGAGGTGG AAACGCTGCT GGCGGCGTGC GGCGTTACAG CGACCCGGAT CGGTTCCATC
GTTGCCGGGT CGGACGTCAC GGTCACCGCA GCAGACGGGA CCCTTATCCC TCCGAGACGC
CGCGGCTTCA ACCATTTCGC GCCGTAA
 
Protein sequence
MRLGEIGEFG LIDRIAGKVA AGAGVRLGIG DDAAVTETEA GRLLLSTADM LVEGIHFDLS 
FTDPFRLGRK SLAVNVSDIA AMGGHPRHAL LCLAIPTDLP VEFLDRFADG VISLAEEFGV
TLIGGDTCRS SSGLVISITL HGEQVPTRII PRNGARPGDD VFVTGTVGDS ALGLELLRRG
ERSGHAVERH LNPSPRVSAG LSLAESGMAS AMIDVSDGVL ADLGHILTGS GVGARIDASL
IPLSPYFSQR APDVAPDPLS LALAGGEDYE LLFTAAPGRT AEVETLLAAC GVTATRIGSI
VAGSDVTVTA ADGTLIPPRR RGFNHFAP