Gene SbBS512_E4480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4480 
SymbolthiH 
ID6270122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4191591 
End bp4192724 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content55% 
IMG OID641728274 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_001882676 
Protein GI187733267 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACCT TCAGCGATCG CTGGCGACAA CTGGACTGGG ATGACATCCG CCTGCGTATC 
AACGGCAAAA CGGCTGTTGA CGTAGAGCGG GCGCTAAATG CCTCGCAATT CACCCGCGAC
GATATGATGG CGCTGTTATC GCCAGCCGCC AGTGGCTATC TGGAACAACT GGCCCAACGG
GCGCAGCGTC TGACCCGTCA GCGATTTGGC AACGCAGTTA GTTTCTACGT CCCGCTTTAT
CTTTCCAATC TTTGCGCTAA CGACTGCACG TACTGCGGAT TTTCCATGAG TAATCGCATC
AAGCGCAAAA CGCTGGATGA AGCGGATATT GCCAGGGAAA GCGCCGCTAT ACGGGAGATG
GGCTTTGAAC ATCTGCTATT AGTCACTGGT GAACATCAGG CGAAAGTGGG GATGGATTAC
TTTCGTCGTC ATCTCCCCGC CCTGCGTGAA CAGTTCTCTT CACTACAAAT GGAAGTGCAA
CCGCTGGCGA AGACGGAATA CGCCGAGTTA AAGCAGCTTG GTCTGGATGG CGTGATGGTT
TATCAGGAGA CATATCACGA GGCGACTTAT GCCCGCCATC ATCTGAAAGG CAAAAAACAG
GACTTCTTCT GGCGGCTGGA AACGCCGGAT CGGCTGGGGC GTGCGGGGAT TGATAAGATA
GGCCTCGGCG CGCTAATTGG CCTTTCCGAC AACTGGCGCG TTGACTGCTA TATGGTTGCC
GAACATTTGC TATGGCTGCA ACAGCATTAC TGGCAAAGCC GTTACTCTGT CTCTTTTCCG
CGCCTGCGCC CGTGTACTGG CGGCATTGAG CCTGCGTCGA TTATGGATGA ACGCCAGTTA
GTGCAAACCA TCTGCGCCTT CCGACTGCTT GCACCGGAGA TTGAACTGTC ACTCTCCACG
CGGGAATCAC CGTGGTTTCG CGATCGCGTT ATTCCGCTGG CGATCAATAA CGTCAGCGCC
TTCTCGAAAA CGCAGCCAGG TGGCTATGCC GATAATCACC CCGAGTTGGA ACAGTTCTCA
CCGCACGACG ATCGCAGACC GGAAGCGGTT GCTGCCGCGT TAACCGCTCA GGGTTTGCAG
CCGGTATGGA AAGACTGGGA CAGCTATCTG GGACGGGCCT CGCAAAGACT ATGA
 
Protein sequence
MKTFSDRWRQ LDWDDIRLRI NGKTAVDVER ALNASQFTRD DMMALLSPAA SGYLEQLAQR 
AQRLTRQRFG NAVSFYVPLY LSNLCANDCT YCGFSMSNRI KRKTLDEADI ARESAAIREM
GFEHLLLVTG EHQAKVGMDY FRRHLPALRE QFSSLQMEVQ PLAKTEYAEL KQLGLDGVMV
YQETYHEATY ARHHLKGKKQ DFFWRLETPD RLGRAGIDKI GLGALIGLSD NWRVDCYMVA
EHLLWLQQHY WQSRYSVSFP RLRPCTGGIE PASIMDERQL VQTICAFRLL APEIELSLST
RESPWFRDRV IPLAINNVSA FSKTQPGGYA DNHPELEQFS PHDDRRPEAV AAALTAQGLQ
PVWKDWDSYL GRASQRL