Gene SeD_A4565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A4565 
SymbolthiH 
ID6873901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp4407458 
End bp4408591 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content55% 
IMG OID642787473 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_002218075 
Protein GI198243870 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.000453457 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAACCT TCAGCGACCG TTGGCGGCAA CTGGAGTGGG ACGATATTCG CCTGCGCATC 
AACGGTAAAA CTGCCGCCGA CGTGGAGCGC GCGCTGAACA CCGCACATCT TAGCCGGGAC
GATTTGATGG CGTTGCTCTC CCCCGCCGCC GCCAATTATC TGGAACCGAT GGCGCAGCGG
GCACAAAGGC TGACCCGACA ACGCTTTGGC AACACCGTCA GTTTCTATGT GCCGCTTTAT
CTCTCAAACC TCTGTGCCAA CGACTGCACC TACTGCGGTT TTTCGATGAG CAACCGCATC
AAGCGTAAGA CGCTGGATGA GGTGGATATT CAAAGGGAGT GCGATGCTAT CCGTAAACTG
GGCTTTGAGC ATCTGTTGCT GGTGACTGGC GAACATCAGG CCAAAGTAGG AATGAACTAT
TTTCGCCGCC ATTTACCGGC TATCCGCCGT CAATTTTCAT CATTACAGAT GGAAGTCCAG
CCCTTGTCGC AAGAAAACTA CGCGGAGCTC AAAACGCTGG GGATCGATGG CGTGATGGTT
TATCAGGAAA CTTACCATGA GGCAATCTAT GCACAGCATC ACCTGAAGGG AAAGAAACAG
GACTTTTTCT GGCGGCTGGA AACGCCGGAT CGGTTAGGCC GGGCAGGTAT CGACAAAATC
GGTCTTGGCG CGCTAATTGG TCTGTCGGAC AACTGGCGGG TGGATTGCTA TATGGTGGCG
GAGCATCTGT TGTGGATGCA GAAACACTAC TGGCAGAGTC GCTATTCTGT TTCCTTCCCG
CGTCTGCGTC CGTGTACTGG CGGTGTGGAA CCCGCATCTG TGATGGATGA AAAGCAACTG
GTGCAAACGA TTTGCGCTTT CCGGTTATTG GCGCCGGAAA TTGAATTATC ACTCTCCACC
CGCGAATCGC CGTGGTTTCG AGACCATGTG ATTCCTCTGG CAATCAACAA CGTCAGCGCC
TTCTCAAAAA CGCAGCCCGG CGGCTACGCT GACGATCATC CGGAACTGGA GCAGTTTTCT
CCCCACGATG CCCGTCGGCC TGAAACAGTA GCAAGCGCGT TAAGCGCGCA AGGATTGCAG
CCCGTCTGGA AAGACTGGGA CAGTTGGCTG GGGCGCGCTT CGCAAACGCG GTGA
 
Protein sequence
MKTFSDRWRQ LEWDDIRLRI NGKTAADVER ALNTAHLSRD DLMALLSPAA ANYLEPMAQR 
AQRLTRQRFG NTVSFYVPLY LSNLCANDCT YCGFSMSNRI KRKTLDEVDI QRECDAIRKL
GFEHLLLVTG EHQAKVGMNY FRRHLPAIRR QFSSLQMEVQ PLSQENYAEL KTLGIDGVMV
YQETYHEAIY AQHHLKGKKQ DFFWRLETPD RLGRAGIDKI GLGALIGLSD NWRVDCYMVA
EHLLWMQKHY WQSRYSVSFP RLRPCTGGVE PASVMDEKQL VQTICAFRLL APEIELSLST
RESPWFRDHV IPLAINNVSA FSKTQPGGYA DDHPELEQFS PHDARRPETV ASALSAQGLQ
PVWKDWDSWL GRASQTR