Gene Emin_1333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_1333 
SymbolthiH 
ID6262928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp1435481 
End bp1436857 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content41% 
IMG OID642611813 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001876220 
Protein GI187251738 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000230637 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.1198e-18 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGATAATAA ACGACGCCGA GCTTTCCGAG CTTATAAAAA ACTCCAAAGC CCCTACAGAA 
AAAGAATTAA ACAAAATCCT TTTAAAAGCA AAAAAATTAA ACGGCCTTAA TAAAGACGAA
GTTTTAAGCC TTCTTAATGT TGAAGACGAA AAACAGCTTG AACAAATATA TAGTACCGCA
AAATTTATCA AAGAGGAAAT TTACGGCAAC CGCATGGTTT TGTTCGCGCC TCTTTATATT
TCAAATTTAT GTTCTAATGA ATGCCTTTAC TGCGCTTTCC GCGTTTCAAA CAAAAGCCTT
GTAAGAAGGG CCCTTCCGCA GGAAGAAATT GAAAAAGAAG TAATTGAACT TTTAAAACAA
GGCCACAAAA GAATACTGCT TGTAGCGGGC GAGTCTTACC CCGGCGGCGG GCTTAAATAT
ATTTTTGATT CCATAGACAC CGTTTACAAA ACAAAATGGA ACGGACAAAA CATAAGAAGG
GTAAACGTTA ACATAGCCCC CCTTACGGAA GAAGAATTTA AAGAATTGTC CAAACACAAT
ATAGGCACGT TCCAATTATT TCAGGAAACG TATCATAAAC CCACTTATTC GGGCCTTCAT
ATAGCGGGGC AGAAAAAGAA TTTTGAATTC CGCCTAAACG CTATGGACCG CGCCCTTAAA
AACGGGATTC ACGACGTGGG CATAGGCATA TTGTTTGGTC TTTATGACTA TAAATTTGAA
GTTATGGCCA TGCTTGAGCA TATATCCCAT TTGGAAAAGA CTTACGGCAT AGGCCCGCAC
ACAATTTCCG TTCCCCGCAT TGAACCGGCG GACGGCTCTG ACCTAAGCCT TAAACCGCCG
TACCAGCTTT CGGATTTGGA GTTTAAAAAA GTGCTGGCTA TATTAAGAAT AGCCGTTCCT
TATACGGGAA TTATTTTAAG CACGCGCGAA AACTCCCAAA TGAGAACGGC GGCCATTGAA
ATGGGCGTGT CACAGATGTC GGCAGGGTCA AAGACCAATC CCGGCGGATA TGAAGAAGGA
TCCGCGGGCG CGCAATTTTC TTTAGGCGAC CACAGAACCT TAGAACAAGT GATTTTAGAC
TTAGTTAAGC ATAACCATGT GCCGTCTTTT TGCACGGGAT GCTATCGTTT AGGCCGCGTG
GGCAAAGATT TTATGGATCT GGCTAAACCC GGGCTTATAA AACACCATTG CCTGCCAAAC
GCCATTTTTA CTTTTGCCGA ATACCTGCAT GACTTCGCGG GTGAGGAACT TAAACAAAAA
GGTTTTGCCT TAATAGAAAA AACCGTTAAT GAGGAAATCA AGGACGAAAA CCTAAAAAAA
CTGGCCCTTA AAAACCTTCA TGACATAAAA AACGGCAAAA GAGATATTTA CTTATAA
 
Protein sequence
MIINDAELSE LIKNSKAPTE KELNKILLKA KKLNGLNKDE VLSLLNVEDE KQLEQIYSTA 
KFIKEEIYGN RMVLFAPLYI SNLCSNECLY CAFRVSNKSL VRRALPQEEI EKEVIELLKQ
GHKRILLVAG ESYPGGGLKY IFDSIDTVYK TKWNGQNIRR VNVNIAPLTE EEFKELSKHN
IGTFQLFQET YHKPTYSGLH IAGQKKNFEF RLNAMDRALK NGIHDVGIGI LFGLYDYKFE
VMAMLEHISH LEKTYGIGPH TISVPRIEPA DGSDLSLKPP YQLSDLEFKK VLAILRIAVP
YTGIILSTRE NSQMRTAAIE MGVSQMSAGS KTNPGGYEEG SAGAQFSLGD HRTLEQVILD
LVKHNHVPSF CTGCYRLGRV GKDFMDLAKP GLIKHHCLPN AIFTFAEYLH DFAGEELKQK
GFALIEKTVN EEIKDENLKK LALKNLHDIK NGKRDIYL