Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C4489 |
Symbol | thiH |
ID | 6489006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 4372422 |
End bp | 4373555 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642744564 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_002048144 |
Protein GI | 194449350 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.00456857 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAACCT TCACCGACCG CTGGCGGCAA CTGGACTGGG ACGATATTCG CCTGCGCATC AACGGTAAAA CCGCCGCCGA TGTGGAGCGG GCGCTGAATG CTTCACGCCT CAACCGCGAG GATATGATGG CGTTACTTTC CCCCGCCGCC GCCGATTATC TTGAGCCGCT GGCGCAGCGG GCACAAAGGC TGACCCGCCA GCGCTTTGGC AACACCGTCA GTTTCTATGT GCCGCTTTAT CTCTCAAACC TCTGTGCCAA CGACTGCACC TACTGCGGTT TTTCGATGAG CAACCGCATC AAGCGTAAAA CGCTGAATGA GGTGGATATT GAAAGGGAGT GCGACGCTAT CCGTGAGTTA GGTTTTGAGC ATCTGCTATT AGTCACCGGC GAACATCAGG CCAAAGTCGG CATGGACTAT TTTCGCCGTC ATTTACCCAC CATCCGCCGT CAATTTTCCT CTTTACAGAT GGAAGTCCAG CCCTTGCCGC AAGAAAACTA TGCGGAGCTC AAAACGCTGG GGATCGATGG CGTGATGGTT TATCAGGAGA CTTATCATGA GGCAATCTAT GCACAGCATC ACCTGAAGGG AAAGAAACAG GACTTTTTCT GGCGGCTGGA AACGCCGGAT CGGTTAGGCC GGGCAGGTAT CGACAAAATC GGTCTTGGCG CGCTAATTGG TCTGTCGGAC AACTGGCGGG TGGATTGCTA TATGGTGGCG GAGCATCTGT TGTGGATGCA AAAACAGTAC TGGCAGAGTC GCTATTCTGT TTCCTTCCCG CGTCTGCGTC CGTGTACTGG CGGTGTGGAA CCCGCATCTG TGATGGATGA AAAGCAACTG GTGCAAACGA TTTGCGCTTT CCGGTTATTG GCGCCGGAAA TTGAATTATC ACTCTCCACC CGCGAATCGC CGTGGTTCCG CGATAACGTG ATCCCGCTGG CGATCAATAA CGTCAGCGCC TTCTCGAAAA CCCAGCCCGG TGGCTACGCT GACGATCATC CGGAACTGGA GCAGTTTTCT CCCCACGATG CCCGTCGGCC AGAAGTCGTT GCAAGCGCGT TAAGCGCGCA AGGGTTACAG CCCGTATGGA AAGACTGGGA CAGTTGGCTG GGGCGCGCTT CGCAAATGCG GTGA
|
Protein sequence | MKTFTDRWRQ LDWDDIRLRI NGKTAADVER ALNASRLNRE DMMALLSPAA ADYLEPLAQR AQRLTRQRFG NTVSFYVPLY LSNLCANDCT YCGFSMSNRI KRKTLNEVDI ERECDAIREL GFEHLLLVTG EHQAKVGMDY FRRHLPTIRR QFSSLQMEVQ PLPQENYAEL KTLGIDGVMV YQETYHEAIY AQHHLKGKKQ DFFWRLETPD RLGRAGIDKI GLGALIGLSD NWRVDCYMVA EHLLWMQKQY WQSRYSVSFP RLRPCTGGVE PASVMDEKQL VQTICAFRLL APEIELSLST RESPWFRDNV IPLAINNVSA FSKTQPGGYA DDHPELEQFS PHDARRPEVV ASALSAQGLQ PVWKDWDSWL GRASQMR
|
| |