Gene SNSL254_A4491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A4491 
SymbolthiH 
ID6486480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp4367159 
End bp4368292 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content54% 
IMG OID642739721 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_002043407 
Protein GI194443535 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.721357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.00000387053 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAACCT TCACCGACCG TTGGCGGCAA CTGGAGTGGG ACGATATTCG CCTGCGCATC 
AATGGTAAAA CCGCCGCTGA CGTGGAACGT GCGCTGAACG CCGCGCATCT TAGCCGGGAT
GATTTGATGG CACTGCTCTC CCCCGCCGCC GCCGATTATC TGGAACCGAT AGCGCAGCGG
GCGCAAAGGC TGACCCGACA ACGCTTTGGC AACACCGTCA GTTTCTACGT GCCGCTTTAT
CTCTCAAACC TCTGTGCCAA CGACTGTACC TACTGCGGTT TTTCGATGAG CAACCGCATT
AAGCGTAAAA CGCTGGATGA GGTGGATATT CAAAGGGAGT GCGATGCTAT CCGTAAACTG
GGCTTTGAGC ATCTGCTGTT AGTCACCGGC GAACATCAGG CCAAAGTCGG CATGGACTAT
TTTCGCCGTC ATTTACCCAC CATCCGCCGT CAATTTTCCT CTTTACAGAT GGAAGTCCAG
CCCTTGTCGC AAGAAAACTA TGCGGAGCTC AAAACACTGG GGATCGATGG CGTGATGGTT
TATCAGGAGA CTTATCATGA GGCAATCTAT GCACAGCATC ACCTGAAGGG AAAGAAACAG
GACTTTTTCT GGCGGCTGGA AACGCCGGAT CGGTTAGGCC GGGCAGGTAT CGACAAAATC
GGTCTTGGCG CGCTAATTGG TCTGTCGGAC AACTGGCGGG TGGATTGCTA TATGGTGGCG
GAGCATCTGT TGTGGATGCA GAAACACTAC TGGCAGAGTC GCTATTCTGT TTCCTTCCCG
CGTCTGCGTC CGTGTACTGG CGGTGTGGAA CCCGCATCTG TGATGGATGA AAAGCAACTG
GTGCAAACGA TTTGCGCTTT CCGGTTATTG GCGCCGGAAA TTGAATTATC ACTCTCCACC
CGCGAATCGC CGTGGTTTCG AGACCATGTG ATTCCTCTGG CAATCAACAA CGTCAGCGCC
TTCTCAAAAA CGCAGCCCGG CGGCTACGCT GACGATCATC CGGAACTGGA GCAGTTTTCT
CCCCACGATG CCCGTCGGCC TGAAACAGTA GCAAGCGCGT TAAGCGCGCA AGGATTGCAG
CCCGTCTGGA AAGACTGGGA CAGTTGGCTG GGGCGCGCTT CGCAAACGCG GTGA
 
Protein sequence
MKTFTDRWRQ LEWDDIRLRI NGKTAADVER ALNAAHLSRD DLMALLSPAA ADYLEPIAQR 
AQRLTRQRFG NTVSFYVPLY LSNLCANDCT YCGFSMSNRI KRKTLDEVDI QRECDAIRKL
GFEHLLLVTG EHQAKVGMDY FRRHLPTIRR QFSSLQMEVQ PLSQENYAEL KTLGIDGVMV
YQETYHEAIY AQHHLKGKKQ DFFWRLETPD RLGRAGIDKI GLGALIGLSD NWRVDCYMVA
EHLLWMQKHY WQSRYSVSFP RLRPCTGGVE PASVMDEKQL VQTICAFRLL APEIELSLST
RESPWFRDHV IPLAINNVSA FSKTQPGGYA DDHPELEQFS PHDARRPETV ASALSAQGLQ
PVWKDWDSWL GRASQTR