Gene Cthe_2638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2638 
Symbol 
ID4808949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3119873 
End bp3121159 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content34% 
IMG OID640108051 
ProductO-antigen polymerase 
Protein accessionYP_001039030 
Protein GI125975120 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTAA AGGCAATTTA TTTTTTGGTT CTGATTGCAA GCCTTATAAA TATGGCAATA 
CTTCTTCGAA GCAGCACAAA CAAGAAAAGA ATTGCGGCGG ATTCCATATT TTTATTTTTT
GTACTGACAC TTCCCATTAT AGGCAGGTAT GAATCCCTGG CTCTTTTTAT AATATCGTTT
TTGTTAATGC TACGTAAAAT CCGAATAGAA AAAAACGCAT TGACGTTTAT ATATTTTCTT
ATTGTATTGG TGGGATTCCT GGGTATATTC AATTCCATTA ACATAATAAC TTCGTTCCGG
GAACTTCTTA TAGTCGCATC ATATTTTCTA ATCAGCATTA CAGCTACCGC ACTGGTTTAT
GAATTCAAGG AAGCGTTTGT TGAAAAAATC TTTCGATTTA TAATAGTTTC GGCCTTAAGT
GTATCATGTT TAAATCTTAT TGATTTAATC GGCCTCTTTA ATTTTAGAAA ATATATAATT
TTGTTGAAAA CCAACAACCA TTTTGCATAT TATATCGGAG GAATATTAAT AATTGTCATT
GTAATGCTGT ATGACAAAAA CAAAACTTTC AGGAAAATAA ATCTCTTGCC TGTTATCGTT
TTTGTTACTT CTTTAATTTC CGCAGGCAGC AGGGGCTCAT TGGTTGCTAT CGTTCTAACT
TTGGGAGTCT TCGCTGTTTA TAATAAAAAA TATAAAACAG TATTTATTAT GGCGTTGATT
TTTATTATAG TATATTTTAA CTTTGACAGC CTGCCCTACA ATATTCAAAG AACCCTGCTG
TCAATAACCA ACAAAACTGC CACTTTTTCA AATATAGAAA GGCTCGCCAT CTGGAGTGCA
AGCATACAGA TGATTAAAGA GCATCCTTTG GTGGGAGTTG GAATTGACAA CTGGAGGTAT
TATTATTTGC TGCCGGAATA TATGCAATCC ATAAATACCT ATCCCCACGC CCACAATTTT
TATCTTCAGA CGGGTAGTGA AATGGGAGTT ACAGGATTGA TTCTCTATCT TTTTATGTTT
GCCATAAATA TATACTACTC CATTTATGTT TTTAAAAGAA CAAAAAACGA AATAAGAAAA
TGCATCGCGC TCTCCTCAAT AATGTTTTTT GTCTTCACAA TTATATACTG TATGGTCAAC
AATCCTCTTT TCAACAGCAA ACCTGCCATG CTTTTTTATA TAATTGCGGG GTTAAATGCG
GGAATTTACA AACTGACCGG AGAGGAAGTG AAAATCCGTG AAAAAAAGGC TGACAATACT
GCAAATCAGA TCCGAATTCA GGGATAA
 
Protein sequence
MALKAIYFLV LIASLINMAI LLRSSTNKKR IAADSIFLFF VLTLPIIGRY ESLALFIISF 
LLMLRKIRIE KNALTFIYFL IVLVGFLGIF NSINIITSFR ELLIVASYFL ISITATALVY
EFKEAFVEKI FRFIIVSALS VSCLNLIDLI GLFNFRKYII LLKTNNHFAY YIGGILIIVI
VMLYDKNKTF RKINLLPVIV FVTSLISAGS RGSLVAIVLT LGVFAVYNKK YKTVFIMALI
FIIVYFNFDS LPYNIQRTLL SITNKTATFS NIERLAIWSA SIQMIKEHPL VGVGIDNWRY
YYLLPEYMQS INTYPHAHNF YLQTGSEMGV TGLILYLFMF AINIYYSIYV FKRTKNEIRK
CIALSSIMFF VFTIIYCMVN NPLFNSKPAM LFYIIAGLNA GIYKLTGEEV KIREKKADNT
ANQIRIQG