Gene SeAg_B4402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B4402 
SymbolthiH 
ID6794211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp4295211 
End bp4296344 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content56% 
IMG OID642778497 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_002149067 
Protein GI197251851 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACCT TCACCGATCG CTGGCGGCAA CTGGAGTGGG ACGATATTCG CCTGCGCATC 
AACGGTAAAA CCACCGCTGA CGTGGAACGC GCGCTTAACG CTCCACAGCT AAGTCGTGAC
GACCTGATGG CACTGCTCTC CCCCGCCGCC GCCGATTATC TGGAACCGCT GGCGCAGCGG
GCACAAAGGC TGACCCGCCA GCGCTTTGGC AACACCGTCA GTTTCTATGT GCCGCTTTAT
CTCTCAAACC TCTGTGCCAA CGACTGCACC TACTGCGGTT TTTCGATGAG CAACCGCATC
AAGCGTAAGA CGCTGGATGA TGTGGATATT CAAAGGGAGT GCGATGCTAT CCGTGAGTTG
GGTTTTGAGC ATCTGCTGTT AGTCACCGGC GAACATCAGG CCAAAGTCGG CATGGACTAT
TTTCGCCGAC ATTTACCGGC CATTCGTCGC CAGTTTTCCT CGCTGCATAT GGAAGTGCAG
CCGCTGGCGA CCGAGGAGTA TGCCGAGCTG AAGACGCTGG GGCTGGATGG CGTGATGGTG
TATCAGGAGA CCTATCACGA ACCGGTATAC GCTCAGCACC ATTTACGGGG CAAAAAGCAA
GACTTCTTCT GGCGGCTGGA GACGCCAGAC AGACTGGGAC GCGCGGGGAT CGATAAAATT
GGTTTAGGCG CGCTAATCGG GCTTTCCGAC AGCTGGCGGG TTGATTGCTA TATGGTGGCG
GAGCATCTGT TGTGGATGCA GAAACACTAC TGGCAGAGTC GCTATTCTGT TTCCTTCCCG
CGTCTGCGTC CGTGTACTGG CGGTGTGGAA CCCGCATCTG TGATGGATGA AAAGCAACTG
GTGCAAACGA TTTGCGCTTT CCGGTTATTG GCGCCGGAAA TTGAATTATC ACTCTCCACC
CGCGAATCGC CGTGGTTTCG AGACCATGTG ATTCCTCTGG CAATCAACAA CGTCAGCGCC
TTCTCAAAAA CCCAGCCCGG CGGCTACGCT GACGATCATC CGGAACTGGA GCAGTTTTCT
CCCCACGATG CCCGTCGGCC TGAAACCGTA GCAAGCGCGT TAAGCGCGCA AGGGTTACAG
CCCGTGTGGA AAGACTGGGA CAGTTGGCTG GGGCGCGCTT CGCAAATTCG GTGA
 
Protein sequence
MKTFTDRWRQ LEWDDIRLRI NGKTTADVER ALNAPQLSRD DLMALLSPAA ADYLEPLAQR 
AQRLTRQRFG NTVSFYVPLY LSNLCANDCT YCGFSMSNRI KRKTLDDVDI QRECDAIREL
GFEHLLLVTG EHQAKVGMDY FRRHLPAIRR QFSSLHMEVQ PLATEEYAEL KTLGLDGVMV
YQETYHEPVY AQHHLRGKKQ DFFWRLETPD RLGRAGIDKI GLGALIGLSD SWRVDCYMVA
EHLLWMQKHY WQSRYSVSFP RLRPCTGGVE PASVMDEKQL VQTICAFRLL APEIELSLST
RESPWFRDHV IPLAINNVSA FSKTQPGGYA DDHPELEQFS PHDARRPETV ASALSAQGLQ
PVWKDWDSWL GRASQIR