Gene Clim_0664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0664 
SymbolthiH 
ID6354278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp737153 
End bp738220 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content58% 
IMG OID642668291 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001942726 
Protein GI189346197 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.428424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCTGC TACCTGGATG GCTGAGCGCC GGACATGATG AAACCCGCTT CAGGGCCATG 
CTTTCGCCGG ATTCGCCATA CACCCTCGAA CAGCTTGCGG GAGAGTCGAA AGCCATAACC
CTCCGCAGAT TCGGACGCAC CATGTCGCTC TACGCGCCGC TCTACCTTTC GAATCACTGT
TCGAGCGGCT GTGCCTACTG CGGCTTCGCC TCCGACAGAA AAACCCCTCG ACGCCGCCTC
GAACAGGAAG AAATCCGAAA AGAACTCGGA GCCATGAAAC GCCTCGGCAT CAGCGATGTT
CTGCTCCTGA CCGGAGAACG TACCAAAGCT GCGGATTTCG ACTACCTCAG AACCTCCGTC
GCCACTGCCG CAGCCGACAT GCAGCGTGTT ACGGTAGAAG CCTTTCCCAT GAGCGTCGCC
GAATACCGTG ACCTCGCTGA AGCGGGCTGT ACCGGCATAA CGATCTACCA GGAGACTTAC
GATCCCCTGC GCTACGCCGA ACTCCACCGC TGGGGTCCGA AAAAAGATTT CAGGGAACGG
CTTGAAACGC CATCGAGAGC CCTTGAAGGA GGCATCAAAA CCGTAGGCCT CGGCGTCCTG
CTCGGACTTG CCGACCCGCA GGAGGACGCG CTCATGCTCT ACCGTCATCT TCGTTATCTC
GGAAAAACCT ACTGGCGAGC CGGACTTTCG GTATCATTTC CGCGTATACG GCCGCAGACC
GGCAGTTACG AGCCGCCCTT CCCGGTAAGC GATCACCTGC TGGCACGCAT GATCTTTGCC
TTCCGCATAG CCCTTCCCGA TGTGGAACTT GTGCTCTCCA CCCGTGAAAG CCCGGCTTTC
CGCGACGGCA TGGCCGGCAT CGGCGTCACC CGAATGAGTA TCGCAAGCCG CACGACGGTT
GGCGGATACC TCGATGCAGA ATCCAGCGAC CGGGGACAGT TCGATGTCTT CGACGACCGC
ACGGCAGAAG CGTTCTGCAG CGCTCTGCGC GAAAAAAACA TCGAACCGGT TTTCAAGAAC
TGGGAACACG CCTATAACGG TCCGTCACAT CCGGAAGCGG AAAAATAA
 
Protein sequence
MNLLPGWLSA GHDETRFRAM LSPDSPYTLE QLAGESKAIT LRRFGRTMSL YAPLYLSNHC 
SSGCAYCGFA SDRKTPRRRL EQEEIRKELG AMKRLGISDV LLLTGERTKA ADFDYLRTSV
ATAAADMQRV TVEAFPMSVA EYRDLAEAGC TGITIYQETY DPLRYAELHR WGPKKDFRER
LETPSRALEG GIKTVGLGVL LGLADPQEDA LMLYRHLRYL GKTYWRAGLS VSFPRIRPQT
GSYEPPFPVS DHLLARMIFA FRIALPDVEL VLSTRESPAF RDGMAGIGVT RMSIASRTTV
GGYLDAESSD RGQFDVFDDR TAEAFCSALR EKNIEPVFKN WEHAYNGPSH PEAEK