Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B4402 |
Symbol | thiH |
ID | 6794211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 4295211 |
End bp | 4296344 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642778497 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_002149067 |
Protein GI | 197251851 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACCT TCACCGATCG CTGGCGGCAA CTGGAGTGGG ACGATATTCG CCTGCGCATC AACGGTAAAA CCACCGCTGA CGTGGAACGC GCGCTTAACG CTCCACAGCT AAGTCGTGAC GACCTGATGG CACTGCTCTC CCCCGCCGCC GCCGATTATC TGGAACCGCT GGCGCAGCGG GCACAAAGGC TGACCCGCCA GCGCTTTGGC AACACCGTCA GTTTCTATGT GCCGCTTTAT CTCTCAAACC TCTGTGCCAA CGACTGCACC TACTGCGGTT TTTCGATGAG CAACCGCATC AAGCGTAAGA CGCTGGATGA TGTGGATATT CAAAGGGAGT GCGATGCTAT CCGTGAGTTG GGTTTTGAGC ATCTGCTGTT AGTCACCGGC GAACATCAGG CCAAAGTCGG CATGGACTAT TTTCGCCGAC ATTTACCGGC CATTCGTCGC CAGTTTTCCT CGCTGCATAT GGAAGTGCAG CCGCTGGCGA CCGAGGAGTA TGCCGAGCTG AAGACGCTGG GGCTGGATGG CGTGATGGTG TATCAGGAGA CCTATCACGA ACCGGTATAC GCTCAGCACC ATTTACGGGG CAAAAAGCAA GACTTCTTCT GGCGGCTGGA GACGCCAGAC AGACTGGGAC GCGCGGGGAT CGATAAAATT GGTTTAGGCG CGCTAATCGG GCTTTCCGAC AGCTGGCGGG TTGATTGCTA TATGGTGGCG GAGCATCTGT TGTGGATGCA GAAACACTAC TGGCAGAGTC GCTATTCTGT TTCCTTCCCG CGTCTGCGTC CGTGTACTGG CGGTGTGGAA CCCGCATCTG TGATGGATGA AAAGCAACTG GTGCAAACGA TTTGCGCTTT CCGGTTATTG GCGCCGGAAA TTGAATTATC ACTCTCCACC CGCGAATCGC CGTGGTTTCG AGACCATGTG ATTCCTCTGG CAATCAACAA CGTCAGCGCC TTCTCAAAAA CCCAGCCCGG CGGCTACGCT GACGATCATC CGGAACTGGA GCAGTTTTCT CCCCACGATG CCCGTCGGCC TGAAACCGTA GCAAGCGCGT TAAGCGCGCA AGGGTTACAG CCCGTGTGGA AAGACTGGGA CAGTTGGCTG GGGCGCGCTT CGCAAATTCG GTGA
|
Protein sequence | MKTFTDRWRQ LEWDDIRLRI NGKTTADVER ALNAPQLSRD DLMALLSPAA ADYLEPLAQR AQRLTRQRFG NTVSFYVPLY LSNLCANDCT YCGFSMSNRI KRKTLDDVDI QRECDAIREL GFEHLLLVTG EHQAKVGMDY FRRHLPAIRR QFSSLHMEVQ PLATEEYAEL KTLGLDGVMV YQETYHEPVY AQHHLRGKKQ DFFWRLETPD RLGRAGIDKI GLGALIGLSD SWRVDCYMVA EHLLWMQKHY WQSRYSVSFP RLRPCTGGVE PASVMDEKQL VQTICAFRLL APEIELSLST RESPWFRDHV IPLAINNVSA FSKTQPGGYA DDHPELEQFS PHDARRPETV ASALSAQGLQ PVWKDWDSWL GRASQIR
|
| |