Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1851 |
Symbol | |
ID | 4571193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 2145318 |
End bp | 2147018 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 639766433 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_912291 |
Protein GI | 119357647 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCTCA TGACTACTTC TTCAGAGACC GGCTTTTGCA CCGGAAACCA GACTCCCGGA ATCGCTTCAG AAAAAATCTA TGCCGAAGGC ACGATCTTCC CGCTGAAAAT CGGCATGAGA CAGATAAAGC TCAGCAAAAC CTATACAACA AAGGATCGGG AGTTCTCTTC CTTTCCGCTC TACGATACCA GCGGCCCCTA CTCTGATCCC TCGATCGTTA CGGATCCAAA AAAAGGACTG CTTCCGATTC GGGATACGTG GGGATTCAAT GACGGAAAAA CCTCGCCCTC TAACGACTCC TCGCTTCCCG TAACCATCCC CGTGCGAAAC CCCCTCAAAG CAAAGGAGGG CGTTGGCATC ACGCAAATGC ACTATGCGCG CAAAGGAGTT ATCACTCCGG AAATGGAGTA TGTCGCCATA CGGGAGAACC AGTTGCTTGA AAGCCGGGCA TCTTCGTTTC ACGGCAATCA TAACAACGCG AAACCTGTGA CGCCTGAATT TGTCCGGCAG GAGATTGCAT GCGGAAGAGC GATCATACCG GCAAACATCA ATCATCCCGA ACTTGAGCCG ATGATTATCG GAAAAAACTT CCGGGTAAAA ATCAATGCCA ATATCGGCAA CTCCGCCATG GGATCGTCAA TTGAACAAGA GGTCGAAAAA GCTGTCTGGG CTTGCCGATG GGGAGCTGAC ACGATTATGG ATCTTAGTAC AGGAACCAGC ATTCACAAAA CCCGTGAGTG GATAGTGAGG AACTCTCCGG TCCCTGTGGG AACCGTACCA ATATACCAGG CACTCGAAAA AGTTGGCGGT GTTTCGGAAG CGCTCACCTG GGAGGTTTAT CGCGATACCC TGATCGAACA GGCTGAACAG GGCGTTGATT ACTTCACCAT TCACGCAGGA ATTCTCCTTG AGCATCTTCC CTATGCTGAA AAGCGCCTTA CCGGCATTGT ATCACGTGGA GGCTCAATCA TGGCGAAATG GTGTCGGGCA CATAAGACGG AAAACTTTCT TTTCACCCAT TTTGAAGATA TATGCCGCAT CCTCAAGACC TACGATATCG CCATTTCGCT CGGCGATGCC CTCAGGCCAG GCTCAATCGG CGATGCCAAC GATGAGGCCC AGTTTGGTGA GCTTAAAGTC CTGGGTTCCT TAACGCTCGT CGCATGGGAG CATGATGTCC AGGTAATGAT TGAAGGCCCG GGGCACGTTC CGCTTCATCT GGTGCTTGAA AATATGGAAA AACAGCTTGA ACTCTGTCAT GAAGCTCCAT TTTATACGTT AGGACCGCTT GTCACCGATA TCGCTGCAGG ATATGACCAC ATCAATTCAG CAATCGGCGG CGCTCTGATT GCAAGTTACG GCTGTTCCAT GCTCTGCTAC GTTACTCCCA AAGAGCATCT CGGACTTCCT GATAAAAACG ATGTTCGCGA AGGGGTTATC GCTCACAGAG TTGCAGCGCA TGCCGCAGAC CTTGCAAAAG GAAACCATGC CGCATGGTTG CGCGATGAAC TCATGAGCCG GGCCCGCTAC TCTTTTGCCT GGGAAGATCA GTTCAATCTC TCTCTTGATC CTGAAAAAAC CCGCGAAGTT TACCGCCAGA GTATGGCTTC AAGCGTAAAC CTGAATAAAA ACGCGGATTT TTGCACCATG TGCGGACCGG ATTTCTGTTC GATGAAAAAA TCACGGGAGT TAAACGGGTA A
|
Protein sequence | MTLMTTSSET GFCTGNQTPG IASEKIYAEG TIFPLKIGMR QIKLSKTYTT KDREFSSFPL YDTSGPYSDP SIVTDPKKGL LPIRDTWGFN DGKTSPSNDS SLPVTIPVRN PLKAKEGVGI TQMHYARKGV ITPEMEYVAI RENQLLESRA SSFHGNHNNA KPVTPEFVRQ EIACGRAIIP ANINHPELEP MIIGKNFRVK INANIGNSAM GSSIEQEVEK AVWACRWGAD TIMDLSTGTS IHKTREWIVR NSPVPVGTVP IYQALEKVGG VSEALTWEVY RDTLIEQAEQ GVDYFTIHAG ILLEHLPYAE KRLTGIVSRG GSIMAKWCRA HKTENFLFTH FEDICRILKT YDIAISLGDA LRPGSIGDAN DEAQFGELKV LGSLTLVAWE HDVQVMIEGP GHVPLHLVLE NMEKQLELCH EAPFYTLGPL VTDIAAGYDH INSAIGGALI ASYGCSMLCY VTPKEHLGLP DKNDVREGVI AHRVAAHAAD LAKGNHAAWL RDELMSRARY SFAWEDQFNL SLDPEKTREV YRQSMASSVN LNKNADFCTM CGPDFCSMKK SRELNG
|
| |