Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A4491 |
Symbol | thiH |
ID | 6486480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 4367159 |
End bp | 4368292 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642739721 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_002043407 |
Protein GI | 194443535 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.721357 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.00000387053 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAACCT TCACCGACCG TTGGCGGCAA CTGGAGTGGG ACGATATTCG CCTGCGCATC AATGGTAAAA CCGCCGCTGA CGTGGAACGT GCGCTGAACG CCGCGCATCT TAGCCGGGAT GATTTGATGG CACTGCTCTC CCCCGCCGCC GCCGATTATC TGGAACCGAT AGCGCAGCGG GCGCAAAGGC TGACCCGACA ACGCTTTGGC AACACCGTCA GTTTCTACGT GCCGCTTTAT CTCTCAAACC TCTGTGCCAA CGACTGTACC TACTGCGGTT TTTCGATGAG CAACCGCATT AAGCGTAAAA CGCTGGATGA GGTGGATATT CAAAGGGAGT GCGATGCTAT CCGTAAACTG GGCTTTGAGC ATCTGCTGTT AGTCACCGGC GAACATCAGG CCAAAGTCGG CATGGACTAT TTTCGCCGTC ATTTACCCAC CATCCGCCGT CAATTTTCCT CTTTACAGAT GGAAGTCCAG CCCTTGTCGC AAGAAAACTA TGCGGAGCTC AAAACACTGG GGATCGATGG CGTGATGGTT TATCAGGAGA CTTATCATGA GGCAATCTAT GCACAGCATC ACCTGAAGGG AAAGAAACAG GACTTTTTCT GGCGGCTGGA AACGCCGGAT CGGTTAGGCC GGGCAGGTAT CGACAAAATC GGTCTTGGCG CGCTAATTGG TCTGTCGGAC AACTGGCGGG TGGATTGCTA TATGGTGGCG GAGCATCTGT TGTGGATGCA GAAACACTAC TGGCAGAGTC GCTATTCTGT TTCCTTCCCG CGTCTGCGTC CGTGTACTGG CGGTGTGGAA CCCGCATCTG TGATGGATGA AAAGCAACTG GTGCAAACGA TTTGCGCTTT CCGGTTATTG GCGCCGGAAA TTGAATTATC ACTCTCCACC CGCGAATCGC CGTGGTTTCG AGACCATGTG ATTCCTCTGG CAATCAACAA CGTCAGCGCC TTCTCAAAAA CGCAGCCCGG CGGCTACGCT GACGATCATC CGGAACTGGA GCAGTTTTCT CCCCACGATG CCCGTCGGCC TGAAACAGTA GCAAGCGCGT TAAGCGCGCA AGGATTGCAG CCCGTCTGGA AAGACTGGGA CAGTTGGCTG GGGCGCGCTT CGCAAACGCG GTGA
|
Protein sequence | MKTFTDRWRQ LEWDDIRLRI NGKTAADVER ALNAAHLSRD DLMALLSPAA ADYLEPIAQR AQRLTRQRFG NTVSFYVPLY LSNLCANDCT YCGFSMSNRI KRKTLDEVDI QRECDAIRKL GFEHLLLVTG EHQAKVGMDY FRRHLPTIRR QFSSLQMEVQ PLSQENYAEL KTLGIDGVMV YQETYHEAIY AQHHLKGKKQ DFFWRLETPD RLGRAGIDKI GLGALIGLSD NWRVDCYMVA EHLLWMQKHY WQSRYSVSFP RLRPCTGGVE PASVMDEKQL VQTICAFRLL APEIELSLST RESPWFRDHV IPLAINNVSA FSKTQPGGYA DDHPELEQFS PHDARRPETV ASALSAQGLQ PVWKDWDSWL GRASQTR
|
| |