Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A4369 |
Symbol | thiH |
ID | 6519876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | - |
Start bp | 4243024 |
End bp | 4244157 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642749321 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_002117060 |
Protein GI | 194737225 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACCT TCACCGATCG CTGGCGGCAA CTGGAGTGGG ACGATATTCG CCTGCGCATC AACGGTAAAA CCGCCACTGA CGTGGAATGT GCGCTTAACA CTCCACAGCT TAGCCGGGAC GATCTGATGG CATTACTCTC CCCCGCCGCC GCCGATTATC TGGAACCGAT GGCGCAGCGG GCACAAAGGC TGACCCGACA ACGCTTTGGC AACACCGTCA GTTTCTACGT GCCGCTTTAT CTCTCAAACC TGTGCGCCAA CGACTGTACC TACTGCGGTT TTTCGATGAG CAACCGCATT AAGCGTAAAA CGCTGGATGA GGTGGATATT CAAAGGGAGT GCGATGCTAT CCGTAAACTG GGCTTTGAGC ATCTGCTGTT GGTCACCGGC GAACACCAGA GCAAAGTGGG CATGGACTAT TTCCGTCGTC ATTTACCGGC TATTCGCCGC CAATTTTCAT CATTACAGAT GGAAGTCCAG CCCTTGTCGC AAGAAAACTA TGCGGAGCTC AAAACGCTGG GGATCGATGG CGTGATGGTT TATCAGGAAA CTTACCATGA GGCAATCTAT GCACAGCATC ACCTGAAGGG AAAGAAACAG GACTTTTTCT GGCGGCTGGA AACGCCGGAT CGGTTAGGCC GGGCAGGTAT CGACAAAATC GGTCTTGGCG CGCTGATTGG GCTTTCTGAC AGTTGGCGGG TGGACTGCTA TATGGTGGCG GAGCATCTGT TGTGGATGCA GAAACACTAC TGGCAGAGTC GCTATTCTGT TTCCTTCCCG CGTCTGCGTC CGTGTACTGG CGGTGTGGAA CCCGCATCTG TGATGGATGA AAAGCAACTG GTGCAAACGA TTTGCGCTTT CCGGTTATTG GCGCCGGAAA TTGAATTATC ACTCTCCACC CGCGAATCGC CGTGGTTTCG CGATAACGTG ATCCCGTTGG CGATCAACAA CGTTAGCGCC TTCTCGAAAA CCCAGCCCGG TGGCTACGCT GACGATCATC CGGAACTTGA GCAGTTTTCT CCCCACGATG CCCGTCGGCC TGAAGCCGTA GCAAGCGCGT TAAGCGCGCA AGGGTTACAG CCCGTCTGGA AAGACTGGGA CAGTTGGCTG GGGCGCACTT CGCAAATGCG GTGA
|
Protein sequence | MKTFTDRWRQ LEWDDIRLRI NGKTATDVEC ALNTPQLSRD DLMALLSPAA ADYLEPMAQR AQRLTRQRFG NTVSFYVPLY LSNLCANDCT YCGFSMSNRI KRKTLDEVDI QRECDAIRKL GFEHLLLVTG EHQSKVGMDY FRRHLPAIRR QFSSLQMEVQ PLSQENYAEL KTLGIDGVMV YQETYHEAIY AQHHLKGKKQ DFFWRLETPD RLGRAGIDKI GLGALIGLSD SWRVDCYMVA EHLLWMQKHY WQSRYSVSFP RLRPCTGGVE PASVMDEKQL VQTICAFRLL APEIELSLST RESPWFRDNV IPLAINNVSA FSKTQPGGYA DDHPELEQFS PHDARRPEAV ASALSAQGLQ PVWKDWDSWL GRTSQMR
|
| |