Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plut_1566 |
Symbol | thiH |
ID | 3745413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium luteolum DSM 273 |
Kingdom | Bacteria |
Replicon accession | NC_007512 |
Strand | + |
Start bp | 1755815 |
End bp | 1756906 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637769599 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_375463 |
Protein GI | 78187420 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0773107 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGAAG TCCCCGCATG GCTGGTGGAT GAAGGGAGTA CTGCGGAAAT GCGCCGGATG CTCTCTTCCG ATTCCCCGGT CGACATTGAA ACCCTCGCCG CCCGTGCCCG CGCCATCACC CTGCGCCGAT TCGGACGAAC CATATCGCTC TATGCCCCGC TCTACCTGTC GAACCACTGC CCGAGCGGTT GCGCATACTG CGGGTTCGCA TCCGACAGGA CCACCCTGAG ACGGCGGCTT GAAGAGGATG AGATCAGAAG GGAGATTGCC GCCATGAAAA AGCTTGGCAT CCGGGACATC CTCCTCCTCA CCGGCGAACG GACTGCGGTG GCCGGGTTCG ACTACCTGCG CCGTGCCGTG GAGATCGCCG CAGAGGAAAT GCCGCGCGTG TCGGTGGAAA CCTTCCCGAT GAGCGTAGAA GAATACAGGG AACTTGCCAG ATGCGGCTGC ACCGGCGTCA CCATCTACCA GGAGACCTAT GACCGGGGGC GTTACGAAGA GCTCCACCGA TGGGGTCCCA AGAAAGATTT TCTCCACCGC CTCGAAACCC CTGAACGGGC GCTGGAAGGC GGCATCAAAA CCGTCGGTAT CGGAGCCCTG CTCGGGCTCT CCGAACCCGT CGAGGAAGCG CTCCGGCTCT ACCGCCATGC GCGCCATCTT GCCAAAACCT GGTGGCGTGC AGGCATTTCG GCCTCATTCC CGCGCATGCG CCCTGAACAG GGCGGCTGGC AGCCCCCATT CAATGTAAGC GACCATCAGC TCGCCCGTAT GATTCTGGCT TTCCGCATCG GTCTTCCAGA CATGGATCTT GCGCTCTCGA CCCGCGAACG GGCATCATTC CGCGACGGCA TGGCCGGACT CGGCGTAACG CGCATGAGCA TCGCCAGCAA AACAACTGTC GGCGGATACG ATGAGGGGGA AACCGGCGAG CGGGGACAGT TTGACATTTC CGACGAGCGG AGCGCCGGGG AGTTCTGTCA GGCACTGCGA AATCGGGGAA TTGAACCGGT CTTCAAGAAC TGGGACGGGG CATACAACGG ACCGGCAACA CAAATCATCC CTACCGGAGG GCTTAAGGAA ACCATCCCAT GA
|
Protein sequence | MKEVPAWLVD EGSTAEMRRM LSSDSPVDIE TLAARARAIT LRRFGRTISL YAPLYLSNHC PSGCAYCGFA SDRTTLRRRL EEDEIRREIA AMKKLGIRDI LLLTGERTAV AGFDYLRRAV EIAAEEMPRV SVETFPMSVE EYRELARCGC TGVTIYQETY DRGRYEELHR WGPKKDFLHR LETPERALEG GIKTVGIGAL LGLSEPVEEA LRLYRHARHL AKTWWRAGIS ASFPRMRPEQ GGWQPPFNVS DHQLARMILA FRIGLPDMDL ALSTRERASF RDGMAGLGVT RMSIASKTTV GGYDEGETGE RGQFDISDER SAGEFCQALR NRGIEPVFKN WDGAYNGPAT QIIPTGGLKE TIP
|
| |