Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1268 |
Symbol | thiH |
ID | 3748306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | + |
Start bp | 1733993 |
End bp | 1735063 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637773806 |
Product | thiamine biosynthesis protein ThiH |
Protein accession | YP_379572 |
Protein GI | 78189234 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR02351] thiazole biosynthesis protein ThiH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGAAA TTCCAGCTTG GCTCCATACC ACAAACGATG CAAACGCCCT TGCTTCTTTA CTTGCACCCA ATGCAACACG ATCGCTTGAA TCGCTTGCAG CAGAAGCATC GGCTATCACA CGCCGCCGTT TTGGACGCAC CATAACGCTC TACGCGCCGC TTTACCTTTC AAACCATTGC TCTAACGGCT GCGCTTATTG CGGCTTTGCT TCTGACCGAA CAACGCCGCG CCGCCGCCTT GAAATGGAGG AGATTCGCCG CGAAATAGCA GCTATGAAAG CACTCGGCAT TAGCGATATT TTGCTCTTGA CGGGAGAGCG CACACCTGCG GCTGATTTCG ACTATTTGCG CCAAAGCGTA GCACTTGCGG CTGAAGAAAT GCAGCGCGTT GCCGTTGAAG CCTTCCCCAT GAGCGTAGCC GAATATCGAG CCTTAGCGGA GAGCGGCTGC ACCAGCGTTA CCATTTACCA AGAGACCTAC AATCGCAAAC AATACGAAGC GCTTCACCGC TGGGGAGCAA AAAAAGATTT TCTCTATCGG CTTGAAACGC CTGCCCGCGC ACTTGAAGCC GGCATTAAGC ATGTAGGGCT TGGCGTACTC TTGGGACTTT CCGATCCAAT AGAAGATGCC CTTTGCCTCT ACCGCCATGT GCGCCATCTT GAACGGCGCT ACTGGCGAGC TGGATTTTCC ATCTCCTTTC CCCGCTTGCG CCCCGAAAGC GGCGGCTATC AACCACCATT TCCTGTTGAC GATCGCCAAC TTGCCCGCCT GATTATGGCG TTCCGCATTG CACTGCCAAA CATCGAATTA GTACTTTCCA CCCGCGAAAG TGCTCGCTTT CGCGATGGCA TGGCAACCCT CGGCATTACT CGCATGAGCG TTGAAAGCCG CACCACTGTT GGAGGCTATG CAGAAAACGA AACCATTAAA AGCAGTGCAG GACAGTTTGA AATTTGCGAT GACCGCAACG TTGAAGAGTT TTGTGCCGCT TTACGAACAC AGCAGATTGA GCCAATTTTT AAGAATTGGG AACGCGCTTA CAATGCGCCA TCAATGAGCT GCTTTTTATA A
|
Protein sequence | MAEIPAWLHT TNDANALASL LAPNATRSLE SLAAEASAIT RRRFGRTITL YAPLYLSNHC SNGCAYCGFA SDRTTPRRRL EMEEIRREIA AMKALGISDI LLLTGERTPA ADFDYLRQSV ALAAEEMQRV AVEAFPMSVA EYRALAESGC TSVTIYQETY NRKQYEALHR WGAKKDFLYR LETPARALEA GIKHVGLGVL LGLSDPIEDA LCLYRHVRHL ERRYWRAGFS ISFPRLRPES GGYQPPFPVD DRQLARLIMA FRIALPNIEL VLSTRESARF RDGMATLGIT RMSVESRTTV GGYAENETIK SSAGQFEICD DRNVEEFCAA LRTQQIEPIF KNWERAYNAP SMSCFL
|
| |