Gene Cthe_2694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2694 
Symbol 
ID4810688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3179188 
End bp3180786 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content39% 
IMG OID640108113 
ProductO-antigen polymerase 
Protein accessionYP_001039086 
Protein GI125975176 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00327343 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTTCATG ACAGTGTGCT TTTAAAGCCG GTTTTTCTTC TGGTTAAGCT TATAAATACT 
TCATATGAAA ACAGTATTTT AAAAAGGGTT GTCACTTCTG TCATCAACTT TTTTAAAACG
CTTAACCGTT TTTATGAAAA CAGCGTTACT GCGCGCATAG CAAAATTGAT AACGGAAATG
TTGTATAACA GTGCCATAAT TGGCTTTTTC ACAAGAAAAG GCAAAATGTC CAGGTGGTGG
GAACATTCTC TGGCATACCG GATAATAAAC GGTTGCTTTG AGGTTCCCTC AAAGATTTTA
AAGGGTTTTT ATCAGAAGCA TGAAGAAATA TTTTTGGAAA GTATTGCGGT AAAAGCGGCG
AAGGCTTTGC TGCAAAGAAT TGAATATATA GTGGGGCTGT TTTTGATAGT CATTTTGGTT
GTTCCCCATG AAAGATGGAA TAATGTTTAC AATGTGGCAA TAGCTTTGTT TCTTGTTTTT
CTGATTTTTA TAAACACCAT AATTCAAAGA CACTGGATGT TTAATTTCAA GGCATTGGAC
TGTACGCTCA TATTGTTTAT GACGGCTGTG GTTGTTTCTT TTGCCACTTC CATGAATATA
TCAAGCAGTA TTAAATATAT GTTGTTTTAT GCAGCCTGCT TTCTGTTTGT TTTGATAATT
GTCAGCACCG CAAAAAATGA AAATTCCCTT CAGACAATTA TTGAAATGAT GCTTTTTGGG
GTTACGGCGA TAGGAATTTT CGGATTATGG CAGTCGATAA GCGGTGCCGT TGTTTTTGAC
CCGTCCCTTA CTGACGTTGA GATGAATCAG GGCATGCCGG GAAGGATTTA TGCCACTATG
AGCAACCCGA ACAATTTCGG TGAGATACTG GTGATGTCGC TTCCTTTTTA TGTAAGTGTG
ATTTTGAATT CGAAAACTTT CTTCAAAAAA ATGATATATT TTATTATGGC GTTGCCGCCT
CTGCTTGCCC TTTTTAATAC CGGTTCGAGG TCATCATGGA TAGGTTTTGC GGTTTCGGTA
ATAGTGATAA CGTTTTTGCT CAATAAAAGG CTTTTGCCTT TTATAATATT GGGCGGGGTA
ATGATGATAC CTTTCCTTCC CCAGTATGTT TACAACCGGA TTCTTACCAT TTTCAGAGCC
GCACAGGATA CTTCAACCCA GACCAGGGTC AAGATTTTGC AGACTGTGGA ACCAATGTTC
AAACACTATA TGCTGTCGGG GGTTGGCCTG GGTTCGGACA TATTCAGGCA ACTGATTCAG
AATTACAAAT TGTATACAAA AGCAGTTCCG CCTCATACCC ATATTCTGTA TTTGCAGATA
TGGATTGAAA TGGGGCTTGC CGGATTTGTC ACTTTCATGT GGTATATTTA CAGAACAATA
AAGAATTCTG TTATTGGTAT ATATAATTCA AGCCTGACGC TTAAAACCAT ACTTGCCGCA
GGGACGGCAG CTTTGTGCGG GATTTTGGTG ACGTCGCTGG TGGAGTACAG CTGGTATTAT
CCAAGAGTGA TGGTTATGTT CTGGGTGCTT TTGGGGATAA TTGCCGCAGC GACATATTTG
GCTGAGTTAA AGAACAGAAG GCTCTCGGAG ATGGACTGA
 
Protein sequence
MFHDSVLLKP VFLLVKLINT SYENSILKRV VTSVINFFKT LNRFYENSVT ARIAKLITEM 
LYNSAIIGFF TRKGKMSRWW EHSLAYRIIN GCFEVPSKIL KGFYQKHEEI FLESIAVKAA
KALLQRIEYI VGLFLIVILV VPHERWNNVY NVAIALFLVF LIFINTIIQR HWMFNFKALD
CTLILFMTAV VVSFATSMNI SSSIKYMLFY AACFLFVLII VSTAKNENSL QTIIEMMLFG
VTAIGIFGLW QSISGAVVFD PSLTDVEMNQ GMPGRIYATM SNPNNFGEIL VMSLPFYVSV
ILNSKTFFKK MIYFIMALPP LLALFNTGSR SSWIGFAVSV IVITFLLNKR LLPFIILGGV
MMIPFLPQYV YNRILTIFRA AQDTSTQTRV KILQTVEPMF KHYMLSGVGL GSDIFRQLIQ
NYKLYTKAVP PHTHILYLQI WIEMGLAGFV TFMWYIYRTI KNSVIGIYNS SLTLKTILAA
GTAALCGILV TSLVEYSWYY PRVMVMFWVL LGIIAAATYL AELKNRRLSE MD