Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_0202 |
Symbol | thiH |
ID | 5111656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 236701 |
End bp | 237834 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640490364 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001174943 |
Protein GI | 146309869 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0117208 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCT TTGTGGAACG TTGGCGGCAA CTCAACTGGG ATGATATTCG CCTGCGGATC AACAGCAAAA CCCCTGCAGA TGTTGAGCGT GCACTGCATG CGCGTCATCC CACCCGCGAA GACATGATGG CTCTGCTCTC CCCCGCCGCC GGTGCATATC TGGAACCGAT GGCGCAACGT GCCCAGCAGC TGACTCGGCA ACGCTTTGGC AATACGGTCA GTTTTTACGT GCCGCTCTAT CTTTCCAATC TTTGCGCCAA CGACTGTACC TATTGCGGTT TTTCGATGAG CAACCGTATC AAGCGTAAAA CTCTTGATGA CGTCGAGATC GCACGTGAGT GCGCGGCGAT CCGCGAAATG GGTTTTGAGC ATCTTTTATT GGTGACAGGG GAACATCAGG CAAAAGTGGG GATGGACTAT TTTCGGCGTC ATTTCCCGGC TATTCGCCGC CAGTTCGCTT CATTGCAGAT GGAAGTCCAG CCGCTGTCTG AGGATGAATA TGCAGAGCTT AAAACGTTAG GCCTCGACGG CGTGATGGTC TATCAGGAGA CGTATCACGA AAAAATATAC GCTCAGCATC ACCTGAAGGG TAAAAAGCAG GATTTCTTTT TCCGCCTGGA GACGCCTGAC CGGCTCGGGC GCGCAGGAAT CGATAAAATC GGTCTCGGCG CATTGACGGG GTTGTCAGAC AGCTGGCGGG TGGACAGCTA TATGGTGGCG GAGCATTTGC TATGGCTTCA GCAACATTAC TGGCAGAGCC GCTATTCGAT CTCATTTCCT CGCCTACGCC CTTGCACTGG CGGTATCGAG CCGGCATCGA TCATGGATGA ACGACAGCTG GTACAGGCCA TTTGCGCATT TCGTTTGCTG GCTCCGGATG TCGAATTATC GTTATCCACG CGGGAATCGC CCGAATTTCG GGATCGGGTC ATTCCGTTGG CGATTAATAA CGTCAGCGCG TTTTCGAAAA CACAACCGGG GGGCTACGCC GATGAGCATC CTGAGCTGGA GCAATTTTCC CCTCATGATG GCCGTCGTCC TGAAGATGTT GCACAGGCGC TTATGGCACA AGGGTTACAG CCAGTGTGGA AAGACTGGGA TAGCTGGCTT GGGCGCGCCT CGCAGCTTCG TTGA
|
Protein sequence | MSTFVERWRQ LNWDDIRLRI NSKTPADVER ALHARHPTRE DMMALLSPAA GAYLEPMAQR AQQLTRQRFG NTVSFYVPLY LSNLCANDCT YCGFSMSNRI KRKTLDDVEI ARECAAIREM GFEHLLLVTG EHQAKVGMDY FRRHFPAIRR QFASLQMEVQ PLSEDEYAEL KTLGLDGVMV YQETYHEKIY AQHHLKGKKQ DFFFRLETPD RLGRAGIDKI GLGALTGLSD SWRVDSYMVA EHLLWLQQHY WQSRYSISFP RLRPCTGGIE PASIMDERQL VQAICAFRLL APDVELSLST RESPEFRDRV IPLAINNVSA FSKTQPGGYA DEHPELEQFS PHDGRRPEDV AQALMAQGLQ PVWKDWDSWL GRASQLR
|
| |