Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_1333 |
Symbol | thiH |
ID | 6262928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | + |
Start bp | 1435481 |
End bp | 1436857 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 642611813 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001876220 |
Protein GI | 187251738 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000230637 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 1.1198e-18 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGATAATAA ACGACGCCGA GCTTTCCGAG CTTATAAAAA ACTCCAAAGC CCCTACAGAA AAAGAATTAA ACAAAATCCT TTTAAAAGCA AAAAAATTAA ACGGCCTTAA TAAAGACGAA GTTTTAAGCC TTCTTAATGT TGAAGACGAA AAACAGCTTG AACAAATATA TAGTACCGCA AAATTTATCA AAGAGGAAAT TTACGGCAAC CGCATGGTTT TGTTCGCGCC TCTTTATATT TCAAATTTAT GTTCTAATGA ATGCCTTTAC TGCGCTTTCC GCGTTTCAAA CAAAAGCCTT GTAAGAAGGG CCCTTCCGCA GGAAGAAATT GAAAAAGAAG TAATTGAACT TTTAAAACAA GGCCACAAAA GAATACTGCT TGTAGCGGGC GAGTCTTACC CCGGCGGCGG GCTTAAATAT ATTTTTGATT CCATAGACAC CGTTTACAAA ACAAAATGGA ACGGACAAAA CATAAGAAGG GTAAACGTTA ACATAGCCCC CCTTACGGAA GAAGAATTTA AAGAATTGTC CAAACACAAT ATAGGCACGT TCCAATTATT TCAGGAAACG TATCATAAAC CCACTTATTC GGGCCTTCAT ATAGCGGGGC AGAAAAAGAA TTTTGAATTC CGCCTAAACG CTATGGACCG CGCCCTTAAA AACGGGATTC ACGACGTGGG CATAGGCATA TTGTTTGGTC TTTATGACTA TAAATTTGAA GTTATGGCCA TGCTTGAGCA TATATCCCAT TTGGAAAAGA CTTACGGCAT AGGCCCGCAC ACAATTTCCG TTCCCCGCAT TGAACCGGCG GACGGCTCTG ACCTAAGCCT TAAACCGCCG TACCAGCTTT CGGATTTGGA GTTTAAAAAA GTGCTGGCTA TATTAAGAAT AGCCGTTCCT TATACGGGAA TTATTTTAAG CACGCGCGAA AACTCCCAAA TGAGAACGGC GGCCATTGAA ATGGGCGTGT CACAGATGTC GGCAGGGTCA AAGACCAATC CCGGCGGATA TGAAGAAGGA TCCGCGGGCG CGCAATTTTC TTTAGGCGAC CACAGAACCT TAGAACAAGT GATTTTAGAC TTAGTTAAGC ATAACCATGT GCCGTCTTTT TGCACGGGAT GCTATCGTTT AGGCCGCGTG GGCAAAGATT TTATGGATCT GGCTAAACCC GGGCTTATAA AACACCATTG CCTGCCAAAC GCCATTTTTA CTTTTGCCGA ATACCTGCAT GACTTCGCGG GTGAGGAACT TAAACAAAAA GGTTTTGCCT TAATAGAAAA AACCGTTAAT GAGGAAATCA AGGACGAAAA CCTAAAAAAA CTGGCCCTTA AAAACCTTCA TGACATAAAA AACGGCAAAA GAGATATTTA CTTATAA
|
Protein sequence | MIINDAELSE LIKNSKAPTE KELNKILLKA KKLNGLNKDE VLSLLNVEDE KQLEQIYSTA KFIKEEIYGN RMVLFAPLYI SNLCSNECLY CAFRVSNKSL VRRALPQEEI EKEVIELLKQ GHKRILLVAG ESYPGGGLKY IFDSIDTVYK TKWNGQNIRR VNVNIAPLTE EEFKELSKHN IGTFQLFQET YHKPTYSGLH IAGQKKNFEF RLNAMDRALK NGIHDVGIGI LFGLYDYKFE VMAMLEHISH LEKTYGIGPH TISVPRIEPA DGSDLSLKPP YQLSDLEFKK VLAILRIAVP YTGIILSTRE NSQMRTAAIE MGVSQMSAGS KTNPGGYEEG SAGAQFSLGD HRTLEQVILD LVKHNHVPSF CTGCYRLGRV GKDFMDLAKP GLIKHHCLPN AIFTFAEYLH DFAGEELKQK GFALIEKTVN EEIKDENLKK LALKNLHDIK NGKRDIYL
|
| |