Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ppha_2004 |
Symbol | thiH |
ID | 6462958 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelodictyon phaeoclathratiforme BU-1 |
Kingdom | Bacteria |
Replicon accession | NC_011060 |
Strand | + |
Start bp | 2091942 |
End bp | 2093042 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642728203 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_002018833 |
Protein GI | 194337039 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGAGA TACCCGACTG GCTCACGGAC AATCGGGAGA CGATGGCGCT TGCCGCCATG CTTGCGCCCC CCTATGCATC GGATTCCCTC GAAGCGCTTG CTGCCGAATC GAGAGCGATC ACCCTGCGTC GTTTTGGACG CACCATGACG CTCTACGCCC CGCTCTACCT TTCGAACTAC TGTTCCAGCG GCTGCGTCTA CTGCGGTTTT GCCTCCGACC GAAAAACTCT GCGCCACCGC CTTGAACCCG ATGAGATCGT CCGGGAGCTG AAGGCCATGA AAAAGCTCGG CATCAGCGAC ATCCTTCTCC TCACCGGCGA ACGGACAGCC GCTGCCGACT TCGACTACCT CCGAAAAAGT GTTGAAATCG CTGCACAGGA GATGCAACGG GTCTCGGTTG AGGCCTTTCC TATGAGCGTC AGTGAATACC GCGCACTTGC CGACTGCGGC TGCACCGGCA TCACCATATA CCAGGAGACC TACAACCGGG AGCGTTACGA GGCGCTGCAC CGCTGGGGGC CAAAAAAAGA TTATATCGAC CGGCTTGAAA CCCCGGCAAG AGCCCTCGAA GGGGGCATAA AAAACGTCGG ACTCGGAGTG CTGTTCGGCC TCTCCGATCC GGTTGAAGAT GCTCTGGCCC TCTACCGGCA CCTTCGATAT CTTGGCAGAA CCTGGTGGCG TGCAGGCATG TCACTCTCCT TTCCCCGCAT GAGACCCCAG ACCGGCGGTT ATGAGCCCCC GTTCCCCGTT GATGACCACC TTCTCGCCCG CATGATCTTT GCCTTCCGCA TAGCGCTGCC GGATACGGAG CTGGTTCTCT CAACAAGGGA GAGCGCAGCT TACCGCGACG GCATGGCAGG ACTGGGCATT ACCCGAATGA GCATTGAAAG CCGCACCACC GTTGGCGGCT ACGATAACCC GGAGAACAAG GAAGAGGGAC AGTTTGAAAT TTTTGACGAC CGCACCGCCA AAGAATTTTG CACCGCGCTG CGCAAAAAAA ATATAGAACC TGTCTTTAAA AACTGGGAAC CAGCCTATAA TGGTCCGTCT GATGGCAACA AGACAACAAT GCATCATGGA GAAGCGGAAA CATGCCATTG A
|
Protein sequence | MREIPDWLTD NRETMALAAM LAPPYASDSL EALAAESRAI TLRRFGRTMT LYAPLYLSNY CSSGCVYCGF ASDRKTLRHR LEPDEIVREL KAMKKLGISD ILLLTGERTA AADFDYLRKS VEIAAQEMQR VSVEAFPMSV SEYRALADCG CTGITIYQET YNRERYEALH RWGPKKDYID RLETPARALE GGIKNVGLGV LFGLSDPVED ALALYRHLRY LGRTWWRAGM SLSFPRMRPQ TGGYEPPFPV DDHLLARMIF AFRIALPDTE LVLSTRESAA YRDGMAGLGI TRMSIESRTT VGGYDNPENK EEGQFEIFDD RTAKEFCTAL RKKNIEPVFK NWEPAYNGPS DGNKTTMHHG EAETCH
|
| |