Gene Plut_1566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlut_1566 
SymbolthiH 
ID3745413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium luteolum DSM 273 
KingdomBacteria 
Replicon accessionNC_007512 
Strand
Start bp1755815 
End bp1756906 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content61% 
IMG OID637769599 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_375463 
Protein GI78187420 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0773107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGAAG TCCCCGCATG GCTGGTGGAT GAAGGGAGTA CTGCGGAAAT GCGCCGGATG 
CTCTCTTCCG ATTCCCCGGT CGACATTGAA ACCCTCGCCG CCCGTGCCCG CGCCATCACC
CTGCGCCGAT TCGGACGAAC CATATCGCTC TATGCCCCGC TCTACCTGTC GAACCACTGC
CCGAGCGGTT GCGCATACTG CGGGTTCGCA TCCGACAGGA CCACCCTGAG ACGGCGGCTT
GAAGAGGATG AGATCAGAAG GGAGATTGCC GCCATGAAAA AGCTTGGCAT CCGGGACATC
CTCCTCCTCA CCGGCGAACG GACTGCGGTG GCCGGGTTCG ACTACCTGCG CCGTGCCGTG
GAGATCGCCG CAGAGGAAAT GCCGCGCGTG TCGGTGGAAA CCTTCCCGAT GAGCGTAGAA
GAATACAGGG AACTTGCCAG ATGCGGCTGC ACCGGCGTCA CCATCTACCA GGAGACCTAT
GACCGGGGGC GTTACGAAGA GCTCCACCGA TGGGGTCCCA AGAAAGATTT TCTCCACCGC
CTCGAAACCC CTGAACGGGC GCTGGAAGGC GGCATCAAAA CCGTCGGTAT CGGAGCCCTG
CTCGGGCTCT CCGAACCCGT CGAGGAAGCG CTCCGGCTCT ACCGCCATGC GCGCCATCTT
GCCAAAACCT GGTGGCGTGC AGGCATTTCG GCCTCATTCC CGCGCATGCG CCCTGAACAG
GGCGGCTGGC AGCCCCCATT CAATGTAAGC GACCATCAGC TCGCCCGTAT GATTCTGGCT
TTCCGCATCG GTCTTCCAGA CATGGATCTT GCGCTCTCGA CCCGCGAACG GGCATCATTC
CGCGACGGCA TGGCCGGACT CGGCGTAACG CGCATGAGCA TCGCCAGCAA AACAACTGTC
GGCGGATACG ATGAGGGGGA AACCGGCGAG CGGGGACAGT TTGACATTTC CGACGAGCGG
AGCGCCGGGG AGTTCTGTCA GGCACTGCGA AATCGGGGAA TTGAACCGGT CTTCAAGAAC
TGGGACGGGG CATACAACGG ACCGGCAACA CAAATCATCC CTACCGGAGG GCTTAAGGAA
ACCATCCCAT GA
 
Protein sequence
MKEVPAWLVD EGSTAEMRRM LSSDSPVDIE TLAARARAIT LRRFGRTISL YAPLYLSNHC 
PSGCAYCGFA SDRTTLRRRL EEDEIRREIA AMKKLGIRDI LLLTGERTAV AGFDYLRRAV
EIAAEEMPRV SVETFPMSVE EYRELARCGC TGVTIYQETY DRGRYEELHR WGPKKDFLHR
LETPERALEG GIKTVGIGAL LGLSEPVEEA LRLYRHARHL AKTWWRAGIS ASFPRMRPEQ
GGWQPPFNVS DHQLARMILA FRIGLPDMDL ALSTRERASF RDGMAGLGVT RMSIASKTTV
GGYDEGETGE RGQFDISDER SAGEFCQALR NRGIEPVFKN WDGAYNGPAT QIIPTGGLKE
TIP