Gene BCI_0046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCI_0046 
SymbolthiH 
ID4056749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBaumannia cicadellinicola str. Hc (Homalodisca coagulata) 
KingdomBacteria 
Replicon accessionNC_007984 
Strand
Start bp55603 
End bp56751 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content36% 
IMG OID637981405 
ProductthiH protein 
Protein accessionYP_588524 
Protein GI94676692 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.081488 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAACTT TTAGTCAGTG TTGGCAAAAA ATAGATTGGA ATACAGCTAG TTTACGTATT 
CATAGTAAGA GTGAGCAAGA CGTAGAACGA GCCCTTAATA CTAATAGTCC TAGTACAGAA
GATATGATGG CGTTGCTATC GCCATCAGCT AAAAATTATC TAGAACCACT AGCACAGCGG
GCGCGTTATT TAACGCGTCA GCGTTTCGGT AATACTGTTA ACTTTTTTGT ACCACTTTAT
CTATCGAATT TTTGTACCAA TGAATGTAGT TATTGTGGTT TCTCGATTAG TAATCGTATA
CAACGTAAAA TTCTTGATGA GCAAGAAATT ATACAAGAAT GCGAAGTAAT TAGTGCCCAA
AATGTTGATA ATATACTTCT CGTCACTGGA GAACATAAAC ATAAAGTAGG TATAGAATAT
TTCCGTCGTT ACGTACCAAA AGTACGTAAA TATTTTACGT ACATTATGAT GGAAGTACAG
CCTCTTTTAT CACAAGAGTA TGCTGAATTA AAGACACTAG GGTTAAATAG TATTTTAGTT
TACCAAGAAA CTTATCATTT ACCAACCTAT CAGATTCATC ATCTACGTGG TAAAAAACGT
GATTTTTTTT GGCGGTTAGA AACTCCTGAT CGCATAGCTA GTGTTGGTAT AGATAAAATT
GGTTTAGGAG TATTAATAGG ACTCTCGCAG GACTGGCGCA CTGATTGTTA TATGGTAGCC
CGGCATCTAC TTTATTTACG CAATCACTAT TGGCGTATTA ACTACTCTCT TTCTTTTCCC
CGACTTCGTC CTTATCCTGG ACAAGGAGTA ATACCTTCAT CTTTAATTGA TGAAGCTCAG
TTATTACAAG TTATGTGTGC CTTTCGTTTA TTTGCTCCTG AAGTAGAAAT TTCTCTTTCT
ACTAGAGAGT CTCCATATTT CCGTGATCAT ATTGTACCAA TCGTAGTAAA TAGTGTCAGT
GCGGGTTCAA AAACACAACC AGGTGGTTAT GCTAGTGAAA AACCGGAATT AGAACAGTTT
TTACCATCAG ATAATCGTTC TATGCAAGAA GTAGCGCAAG CATTTATCCA TGCTGGGCTA
CAACCTATAT GGAAAGATGG ACTAGAAAAG CCATTTGTTT GTTCACCTAC AACGACAAAA
AAATATTAA
 
Protein sequence
MITFSQCWQK IDWNTASLRI HSKSEQDVER ALNTNSPSTE DMMALLSPSA KNYLEPLAQR 
ARYLTRQRFG NTVNFFVPLY LSNFCTNECS YCGFSISNRI QRKILDEQEI IQECEVISAQ
NVDNILLVTG EHKHKVGIEY FRRYVPKVRK YFTYIMMEVQ PLLSQEYAEL KTLGLNSILV
YQETYHLPTY QIHHLRGKKR DFFWRLETPD RIASVGIDKI GLGVLIGLSQ DWRTDCYMVA
RHLLYLRNHY WRINYSLSFP RLRPYPGQGV IPSSLIDEAQ LLQVMCAFRL FAPEVEISLS
TRESPYFRDH IVPIVVNSVS AGSKTQPGGY ASEKPELEQF LPSDNRSMQE VAQAFIHAGL
QPIWKDGLEK PFVCSPTTTK KY