Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1557 |
Symbol | thiH |
ID | 6375235 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1680892 |
End bp | 1681962 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642684047 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001959961 |
Protein GI | 189500491 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.190528 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.123415 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGGCC TCCCGACCTG GCTGACAAAT AAAAAGGCCT CTGATGACAT TGCTCCTCTG CTGCGAAATG ATTCCCCGGT CTCACTCGAA TCTCTGGCAG CTGAAGCCCG GGCTATCACG TTAAGCCGTT TCGGCCGGAC CATGTCGCTC TATGCGCCTC TTTATCTCTC GAATTTCTGC TCAAGCGGCT GCGCCTATTG CGGTTATGCT TCTGACCGGA CAACACGACG ACGTCGTCTT GAATCTGAAG AGGTCAGACA TGAACTTGAA AGCATGAAAA AGATGGGCAT ACACGATGTC CTTCTCCTGA CAGGTGAACG AACTTCCGCG GCAGAATTTG ACTATCTGAG GAGGTGTGTC GAAATCGCCT CGGGCATCAT GGCCAGAGTT TCGATCGAGG CTTTTCCGAT GAATGTCAAT GAATATCGAG CTCTTGCCGA CAGCGGCTGT ACCGGAGTCA CCATCTATCA GGAGACCTAT GATCCGGAAC AATACCGTCG CCTGCATCGC TGGGGGCCGA AACAGAACTT CCTTGAACGG CTGGAAATAC CTGAACGCGC GCTTCAGGCA GGCATAAAAA CAGTCGGTAT CGGGGCTCTT CTCGGTCTTT CCGAGCCGGT CGAGGAGGCG TTGAGAGTAC TGCGACATGT CAGACACCTC AGCCGTACCT ACTGGAGAGC AGGTCTGTCG GTATCATTCC CGCGCATTCG ACCACAGACT GGTGGGTTTC AACCTGAATT CACCGTGTCC GATCACTTTC TTGCAAGAAT GATTTTCGCG TTCCGTATAG GCTTACCGGA TGTTGAACTG GTGCTTTCAA CCCGTGAAAG TCCTGCGTTC CGCGACGGCA TGGCCGGTCT TGGAATCACA CGAATGAGTA TCGCCAGCAG AACAACTGTT GGCGGCTACC TTGAACCTGA CGGGAATGAA AAAGGCCAGT TTGAAGTCAA TGATGACCGG AATACGAAAA CGTTCTGCAA GGCCCTGGAA GCAAAAAACA TTGAACCTGT CTTTAAAAAC TGGGAACCGG TATACAACGG ACCAAACACT GAGTCAGAAC AAACAGGATA A
|
Protein sequence | MNGLPTWLTN KKASDDIAPL LRNDSPVSLE SLAAEARAIT LSRFGRTMSL YAPLYLSNFC SSGCAYCGYA SDRTTRRRRL ESEEVRHELE SMKKMGIHDV LLLTGERTSA AEFDYLRRCV EIASGIMARV SIEAFPMNVN EYRALADSGC TGVTIYQETY DPEQYRRLHR WGPKQNFLER LEIPERALQA GIKTVGIGAL LGLSEPVEEA LRVLRHVRHL SRTYWRAGLS VSFPRIRPQT GGFQPEFTVS DHFLARMIFA FRIGLPDVEL VLSTRESPAF RDGMAGLGIT RMSIASRTTV GGYLEPDGNE KGQFEVNDDR NTKTFCKALE AKNIEPVFKN WEPVYNGPNT ESEQTG
|
| |