Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2347 |
Symbol | |
ID | 4808981 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2798883 |
End bp | 2800163 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640107754 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_001038742 |
Protein GI | 125974832 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000000937745 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAGA TAAGCTTGAG AGAATTAATT GAAATAATCA TTAAAGGTAA ATGGATTATA GTAGCTGTAA CTGCAGTTTG CATAGCTATT GGCATTGCAG TAAACGCATT TGTTATAAAG CCGGTTTATG TGGCACAAAC GACGCTGATG ATTTCATCCA TTAAAAAGAG TCAAATAAAA GAACAGGTAA AGTCAGGAGA TGGTGACGTT AACAATTTTT CATCCTTAAT CGATGACATA TTGCAGTACC CGGAGATGTC TGTTGATAAC TATAGGGAAC AAGTAAAAAA TCCTGTGATA CTGGAGTATA TTCGTGAAGA AATGGATATG AAAGATGTTC CCCTAAGTTC AATAGCATCT AAAATAACCT TAAATGAAAT AAAAGATACA GATTTGATTA CAATAAAGGT GACTGATGAG AATCCTGAGA CAGCAGCAAA GATAGCAAAT CTGGTTGGCG ACAGGTTTGC AAAACTTGTG TGCGAAACAA ACCAAAAACG GACTGAGAGT ACACTGGAGT TTATTGAAAA TCAGATGAAA AAAGAAAAAG AGAACATGGA AAAATTGTTG GGAGAATATA AAAGTTATCT GTTGCAATCC AGAGGACCTG AAGAGGTTAA GATGGAGCTG GATGCAAAAC TGGAAAAAAT GACTGAATAC AAAACCCAAT TATCGCAAAT CAAAATTGAT GAGAATGCCA CGAAAGCGTC TTTAGATACC GCAAAAAATT TGATAAACAA AACACCGCAA AAATTAGTTA CAGATAGTTC GCTTTTAACT AATCCGTTGC TTTCGGCAGT GATTAAAGAA AAAACAGGTA TTAGTTCGGA AGAACTGGCA AGCATGAAAA TGTCAACAGA ACAAATAAAT ATTATATACG TTGAATTGTC CAACATAATT AATGAATTGG AAATTCGGCT GTCAAACCTG GAAGCTCAGA GAATAAATAT CGAAAAGGTC ATTCAGGAAT GTCAGAAGGA AATTGAAAAT CTCCAGACAG AATATGCGGA AAAACAACAG GAGTACGAGA TTCTGAAAAA GGAGCTCGAC TTGTCGAAAG AAGTATATAA TGCATATCAG CAAAAATATA AAGAGTCAAT GATTATGCAG TCTGCAGAGA CAGGCAGATC AAGTGCGGTA ATAGTATCTG AGGCCATTCC GCCCGCTAAT CCTGTTGCTC CAAAAAAGGC TTTGAATGTG GCTGTTGCAG GAGTAGTGGG AGTCGGAATC AGTTTTGCTA TAATATTTAT AAAGGAATAT TTAATTAGAA GCAAACAGTG A
|
Protein sequence | MEEISLRELI EIIIKGKWII VAVTAVCIAI GIAVNAFVIK PVYVAQTTLM ISSIKKSQIK EQVKSGDGDV NNFSSLIDDI LQYPEMSVDN YREQVKNPVI LEYIREEMDM KDVPLSSIAS KITLNEIKDT DLITIKVTDE NPETAAKIAN LVGDRFAKLV CETNQKRTES TLEFIENQMK KEKENMEKLL GEYKSYLLQS RGPEEVKMEL DAKLEKMTEY KTQLSQIKID ENATKASLDT AKNLINKTPQ KLVTDSSLLT NPLLSAVIKE KTGISSEELA SMKMSTEQIN IIYVELSNII NELEIRLSNL EAQRINIEKV IQECQKEIEN LQTEYAEKQQ EYEILKKELD LSKEVYNAYQ QKYKESMIMQ SAETGRSSAV IVSEAIPPAN PVAPKKALNV AVAGVVGVGI SFAIIFIKEY LIRSKQ
|
| |