Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_0664 |
Symbol | thiH |
ID | 6354278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | - |
Start bp | 737153 |
End bp | 738220 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642668291 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_001942726 |
Protein GI | 189346197 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.428424 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCTGC TACCTGGATG GCTGAGCGCC GGACATGATG AAACCCGCTT CAGGGCCATG CTTTCGCCGG ATTCGCCATA CACCCTCGAA CAGCTTGCGG GAGAGTCGAA AGCCATAACC CTCCGCAGAT TCGGACGCAC CATGTCGCTC TACGCGCCGC TCTACCTTTC GAATCACTGT TCGAGCGGCT GTGCCTACTG CGGCTTCGCC TCCGACAGAA AAACCCCTCG ACGCCGCCTC GAACAGGAAG AAATCCGAAA AGAACTCGGA GCCATGAAAC GCCTCGGCAT CAGCGATGTT CTGCTCCTGA CCGGAGAACG TACCAAAGCT GCGGATTTCG ACTACCTCAG AACCTCCGTC GCCACTGCCG CAGCCGACAT GCAGCGTGTT ACGGTAGAAG CCTTTCCCAT GAGCGTCGCC GAATACCGTG ACCTCGCTGA AGCGGGCTGT ACCGGCATAA CGATCTACCA GGAGACTTAC GATCCCCTGC GCTACGCCGA ACTCCACCGC TGGGGTCCGA AAAAAGATTT CAGGGAACGG CTTGAAACGC CATCGAGAGC CCTTGAAGGA GGCATCAAAA CCGTAGGCCT CGGCGTCCTG CTCGGACTTG CCGACCCGCA GGAGGACGCG CTCATGCTCT ACCGTCATCT TCGTTATCTC GGAAAAACCT ACTGGCGAGC CGGACTTTCG GTATCATTTC CGCGTATACG GCCGCAGACC GGCAGTTACG AGCCGCCCTT CCCGGTAAGC GATCACCTGC TGGCACGCAT GATCTTTGCC TTCCGCATAG CCCTTCCCGA TGTGGAACTT GTGCTCTCCA CCCGTGAAAG CCCGGCTTTC CGCGACGGCA TGGCCGGCAT CGGCGTCACC CGAATGAGTA TCGCAAGCCG CACGACGGTT GGCGGATACC TCGATGCAGA ATCCAGCGAC CGGGGACAGT TCGATGTCTT CGACGACCGC ACGGCAGAAG CGTTCTGCAG CGCTCTGCGC GAAAAAAACA TCGAACCGGT TTTCAAGAAC TGGGAACACG CCTATAACGG TCCGTCACAT CCGGAAGCGG AAAAATAA
|
Protein sequence | MNLLPGWLSA GHDETRFRAM LSPDSPYTLE QLAGESKAIT LRRFGRTMSL YAPLYLSNHC SSGCAYCGFA SDRKTPRRRL EQEEIRKELG AMKRLGISDV LLLTGERTKA ADFDYLRTSV ATAAADMQRV TVEAFPMSVA EYRDLAEAGC TGITIYQETY DPLRYAELHR WGPKKDFRER LETPSRALEG GIKTVGLGVL LGLADPQEDA LMLYRHLRYL GKTYWRAGLS VSFPRIRPQT GSYEPPFPVS DHLLARMIFA FRIALPDVEL VLSTRESPAF RDGMAGIGVT RMSIASRTTV GGYLDAESSD RGQFDVFDDR TAEAFCSALR EKNIEPVFKN WEHAYNGPSH PEAEK
|
| |