Gene Athe_0370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0370 
SymbolthiH 
ID7409300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp423117 
End bp424226 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content38% 
IMG OID643714756 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_002572279 
Protein GI222528397 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000304803 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGAGT TTATAAGAAA AGCTGAAAAA GTTTGGGAAG AGTTTAAAGA TTATGTTCCC 
ACTTATGATG AGGTATGTGA AATCTTAGAA AAAGAAGTTG TAAATATTGA AGATGTAGCA
AAACTTTTGA ATGTAGAAGA CAAAAACTCA ATCCTTCTCA TGGCAAGCAA AGCTAAAAAG
CTCACAAGAG AAAACTTTGG CAAGGTCATC CTCTTGTATG CGCCGCTGTA TATCTCAAAC
TACTGTCAAA ACGGATGTGT TTATTGCGGA TTTTCTTGCA GGAAAAATTA TAAAAGAGAA
AAACTTGAGC TTGATGAAAT TGAAAATGAG CTAAGAAGTA TGAAAGAAGA GGGTATCGAC
TCTGTTATAA TCCTCACAGG AGAGGATAGA ATATACTCTC CAGTTGACTA TATTAAACAG
GCCTGCAAAA TAGCAACAGA ATATATGTCA GAAGTTTCGA TTGAGGTTTA TCCTCTTTTT
GAAGAAGAAT ATAGAAGTCT TGCAAACGCT GGTGTTGTGG GAATAACCAT ATATCAGGAG
ACATATCAAA AAGAAGATTA TGAGAAGCTA CATCCTTTTG GACCAAAAAG GGATTTTGAG
TTTAGGCTCA CTGCTGTTGA AAGAGCCCTA TCTGCCGGGT TTCATGAAGC GTGCGTGGGA
CCGCTTTTAG GGCTGTCTCA TCCCAAAAAA GATGTGCTTT GTACTTTGCT TTATGCAGAG
TATCTTCTTG ACAGATTTCC CAAAGCAGAA ATTTCAGTTT CATTCCCGCG CGTAAGATCC
GCAGGCTTAG ATTTTGTTCC AATATTTTCT GTTTCTGACA AGGAATTTAT AAAATTTTTG
ATTGTTGCAA GGATTTATCT TCCAAGAGTT GGAATTGTGA TATCCACAAG AGAAGATGCG
CGCCTTCGTG ATGCACTCAT TGATGTGTGC ATAACAAAGA TGTCGGCAGG TTCTAAAACA
ACTGTCGGCG GATATGCAAC ACAGGAAGAA AAAGATGCCC AGTTTGAGGT TGAAGATAGA
AGAACTGTTG CTGAGGTTGT AGAGAGTATA ATAAAAAAGG GACTGAGACC CGAGTTTACT
AACTGGGTAA GGGGTGTTGG AAGTTTATGA
 
Protein sequence
MTEFIRKAEK VWEEFKDYVP TYDEVCEILE KEVVNIEDVA KLLNVEDKNS ILLMASKAKK 
LTRENFGKVI LLYAPLYISN YCQNGCVYCG FSCRKNYKRE KLELDEIENE LRSMKEEGID
SVIILTGEDR IYSPVDYIKQ ACKIATEYMS EVSIEVYPLF EEEYRSLANA GVVGITIYQE
TYQKEDYEKL HPFGPKRDFE FRLTAVERAL SAGFHEACVG PLLGLSHPKK DVLCTLLYAE
YLLDRFPKAE ISVSFPRVRS AGLDFVPIFS VSDKEFIKFL IVARIYLPRV GIVISTREDA
RLRDALIDVC ITKMSAGSKT TVGGYATQEE KDAQFEVEDR RTVAEVVESI IKKGLRPEFT
NWVRGVGSL