Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2694 |
Symbol | |
ID | 4810688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3179188 |
End bp | 3180786 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640108113 |
Product | O-antigen polymerase |
Protein accession | YP_001039086 |
Protein GI | 125975176 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00327343 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTTCATG ACAGTGTGCT TTTAAAGCCG GTTTTTCTTC TGGTTAAGCT TATAAATACT TCATATGAAA ACAGTATTTT AAAAAGGGTT GTCACTTCTG TCATCAACTT TTTTAAAACG CTTAACCGTT TTTATGAAAA CAGCGTTACT GCGCGCATAG CAAAATTGAT AACGGAAATG TTGTATAACA GTGCCATAAT TGGCTTTTTC ACAAGAAAAG GCAAAATGTC CAGGTGGTGG GAACATTCTC TGGCATACCG GATAATAAAC GGTTGCTTTG AGGTTCCCTC AAAGATTTTA AAGGGTTTTT ATCAGAAGCA TGAAGAAATA TTTTTGGAAA GTATTGCGGT AAAAGCGGCG AAGGCTTTGC TGCAAAGAAT TGAATATATA GTGGGGCTGT TTTTGATAGT CATTTTGGTT GTTCCCCATG AAAGATGGAA TAATGTTTAC AATGTGGCAA TAGCTTTGTT TCTTGTTTTT CTGATTTTTA TAAACACCAT AATTCAAAGA CACTGGATGT TTAATTTCAA GGCATTGGAC TGTACGCTCA TATTGTTTAT GACGGCTGTG GTTGTTTCTT TTGCCACTTC CATGAATATA TCAAGCAGTA TTAAATATAT GTTGTTTTAT GCAGCCTGCT TTCTGTTTGT TTTGATAATT GTCAGCACCG CAAAAAATGA AAATTCCCTT CAGACAATTA TTGAAATGAT GCTTTTTGGG GTTACGGCGA TAGGAATTTT CGGATTATGG CAGTCGATAA GCGGTGCCGT TGTTTTTGAC CCGTCCCTTA CTGACGTTGA GATGAATCAG GGCATGCCGG GAAGGATTTA TGCCACTATG AGCAACCCGA ACAATTTCGG TGAGATACTG GTGATGTCGC TTCCTTTTTA TGTAAGTGTG ATTTTGAATT CGAAAACTTT CTTCAAAAAA ATGATATATT TTATTATGGC GTTGCCGCCT CTGCTTGCCC TTTTTAATAC CGGTTCGAGG TCATCATGGA TAGGTTTTGC GGTTTCGGTA ATAGTGATAA CGTTTTTGCT CAATAAAAGG CTTTTGCCTT TTATAATATT GGGCGGGGTA ATGATGATAC CTTTCCTTCC CCAGTATGTT TACAACCGGA TTCTTACCAT TTTCAGAGCC GCACAGGATA CTTCAACCCA GACCAGGGTC AAGATTTTGC AGACTGTGGA ACCAATGTTC AAACACTATA TGCTGTCGGG GGTTGGCCTG GGTTCGGACA TATTCAGGCA ACTGATTCAG AATTACAAAT TGTATACAAA AGCAGTTCCG CCTCATACCC ATATTCTGTA TTTGCAGATA TGGATTGAAA TGGGGCTTGC CGGATTTGTC ACTTTCATGT GGTATATTTA CAGAACAATA AAGAATTCTG TTATTGGTAT ATATAATTCA AGCCTGACGC TTAAAACCAT ACTTGCCGCA GGGACGGCAG CTTTGTGCGG GATTTTGGTG ACGTCGCTGG TGGAGTACAG CTGGTATTAT CCAAGAGTGA TGGTTATGTT CTGGGTGCTT TTGGGGATAA TTGCCGCAGC GACATATTTG GCTGAGTTAA AGAACAGAAG GCTCTCGGAG ATGGACTGA
|
Protein sequence | MFHDSVLLKP VFLLVKLINT SYENSILKRV VTSVINFFKT LNRFYENSVT ARIAKLITEM LYNSAIIGFF TRKGKMSRWW EHSLAYRIIN GCFEVPSKIL KGFYQKHEEI FLESIAVKAA KALLQRIEYI VGLFLIVILV VPHERWNNVY NVAIALFLVF LIFINTIIQR HWMFNFKALD CTLILFMTAV VVSFATSMNI SSSIKYMLFY AACFLFVLII VSTAKNENSL QTIIEMMLFG VTAIGIFGLW QSISGAVVFD PSLTDVEMNQ GMPGRIYATM SNPNNFGEIL VMSLPFYVSV ILNSKTFFKK MIYFIMALPP LLALFNTGSR SSWIGFAVSV IVITFLLNKR LLPFIILGGV MMIPFLPQYV YNRILTIFRA AQDTSTQTRV KILQTVEPMF KHYMLSGVGL GSDIFRQLIQ NYKLYTKAVP PHTHILYLQI WIEMGLAGFV TFMWYIYRTI KNSVIGIYNS SLTLKTILAA GTAALCGILV TSLVEYSWYY PRVMVMFWVL LGIIAAATYL AELKNRRLSE MD
|
| |