Gene SeSA_A4369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A4369 
SymbolthiH 
ID6519876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp4243024 
End bp4244157 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content54% 
IMG OID642749321 
Productthiamine biosynthesis protein ThiH 
Protein accessionYP_002117060 
Protein GI194737225 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR02351] thiazole biosynthesis protein ThiH 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCT TCACCGATCG CTGGCGGCAA CTGGAGTGGG ACGATATTCG CCTGCGCATC 
AACGGTAAAA CCGCCACTGA CGTGGAATGT GCGCTTAACA CTCCACAGCT TAGCCGGGAC
GATCTGATGG CATTACTCTC CCCCGCCGCC GCCGATTATC TGGAACCGAT GGCGCAGCGG
GCACAAAGGC TGACCCGACA ACGCTTTGGC AACACCGTCA GTTTCTACGT GCCGCTTTAT
CTCTCAAACC TGTGCGCCAA CGACTGTACC TACTGCGGTT TTTCGATGAG CAACCGCATT
AAGCGTAAAA CGCTGGATGA GGTGGATATT CAAAGGGAGT GCGATGCTAT CCGTAAACTG
GGCTTTGAGC ATCTGCTGTT GGTCACCGGC GAACACCAGA GCAAAGTGGG CATGGACTAT
TTCCGTCGTC ATTTACCGGC TATTCGCCGC CAATTTTCAT CATTACAGAT GGAAGTCCAG
CCCTTGTCGC AAGAAAACTA TGCGGAGCTC AAAACGCTGG GGATCGATGG CGTGATGGTT
TATCAGGAAA CTTACCATGA GGCAATCTAT GCACAGCATC ACCTGAAGGG AAAGAAACAG
GACTTTTTCT GGCGGCTGGA AACGCCGGAT CGGTTAGGCC GGGCAGGTAT CGACAAAATC
GGTCTTGGCG CGCTGATTGG GCTTTCTGAC AGTTGGCGGG TGGACTGCTA TATGGTGGCG
GAGCATCTGT TGTGGATGCA GAAACACTAC TGGCAGAGTC GCTATTCTGT TTCCTTCCCG
CGTCTGCGTC CGTGTACTGG CGGTGTGGAA CCCGCATCTG TGATGGATGA AAAGCAACTG
GTGCAAACGA TTTGCGCTTT CCGGTTATTG GCGCCGGAAA TTGAATTATC ACTCTCCACC
CGCGAATCGC CGTGGTTTCG CGATAACGTG ATCCCGTTGG CGATCAACAA CGTTAGCGCC
TTCTCGAAAA CCCAGCCCGG TGGCTACGCT GACGATCATC CGGAACTTGA GCAGTTTTCT
CCCCACGATG CCCGTCGGCC TGAAGCCGTA GCAAGCGCGT TAAGCGCGCA AGGGTTACAG
CCCGTCTGGA AAGACTGGGA CAGTTGGCTG GGGCGCACTT CGCAAATGCG GTGA
 
Protein sequence
MKTFTDRWRQ LEWDDIRLRI NGKTATDVEC ALNTPQLSRD DLMALLSPAA ADYLEPMAQR 
AQRLTRQRFG NTVSFYVPLY LSNLCANDCT YCGFSMSNRI KRKTLDEVDI QRECDAIRKL
GFEHLLLVTG EHQSKVGMDY FRRHLPAIRR QFSSLQMEVQ PLSQENYAEL KTLGIDGVMV
YQETYHEAIY AQHHLKGKKQ DFFWRLETPD RLGRAGIDKI GLGALIGLSD SWRVDCYMVA
EHLLWMQKHY WQSRYSVSFP RLRPCTGGVE PASVMDEKQL VQTICAFRLL APEIELSLST
RESPWFRDNV IPLAINNVSA FSKTQPGGYA DDHPELEQFS PHDARRPEAV ASALSAQGLQ
PVWKDWDSWL GRTSQMR