Gene Cthe_2346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2346 
Symbol 
ID4808980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2795812 
End bp2798841 
Gene Length3030 bp 
Protein Length1009 aa 
Translation table11 
GC content36% 
IMG OID640107753 
ProductO-antigen polymerase 
Protein accessionYP_001038741 
Protein GI125974831 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0470801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAGAA GCAATAAAAA TTCGAATAAA GGGAAAGCAA AAAAGGCAGT AAAATCTATA 
AAAGAAGAAA ACAAGTACGG TAAATTTAGA ATGTTAATAT TAATATTAAA TCTAATTGTA
ATATTTTATT CACCTTTTGT AAGAGGGCTG TATTTTGAAG CAGAACAGCT GCCGGCAGAA
ATATTTGTAT TAGTAAGCTT TGCTGTATTC TGGATATTTA AGTATATGGA GAAAGAAAAA
AAATTCATAT CGACTCCGAT AGAGTACTGT TCATTTGGGT TGATGATTGC TTATTTTATA
TCGATACTTG GTTCTGTCAG CACAAGGCTT GCAATTTCAG AATGGCTGAA ATATTGTATG
TATTTTGCTG TGTTTTTCAT GATTACCGAT TTGGCTTCAA CAATGAAAGA CAAGCTTATA
GTATTATGGA CTGTTATTGC AGCATCAGTG GGTTTATGTG TTGTTGGCCT TGACAGTGCA
TCTGGAGGTA AACTTGTTGA TTGGCTGAAT AATGTGTTTG ATTTCCTGCA TATACCGGTT
GAATTTTTTG GTCTTTACGT TGAAGGACGT ATTCATTCGA CGATCCAATA TCCCAATGCA
CTGGCTGCGT ATCTTATGGC AGTATTCTTT GTAACTTTGA CGATATCCAT AATATCGTCA
AAAATTTGGC AAAGACTGAT TGCAGGAGTA TGCAGCTTTG TATTTGTAAC GACGATAATA
CTTACTTTGA GTCGTGGTGT TATGATATTA ATACCCATTG TGCTGATATT ATATCTTGTT
GTAATTCCTG AGGGCAGTAA GCTGAGAGCT TTTTTAATGG CTTTATGTGC TGCAGTTTCA
AGTGTAATAC CTGTGCTGTT TTCACCTTTG GCAGGCAGAA GCCGCTCGAA CCTTTGGCTG
GGAATTGCAT TGGGAATAAT AGTTTCCTTG ATTTTAACAG TGGCAGTGGA GTTTTTATTT
AGGCTCCGTT TAAAAGTAAT GCCAAGGCTA AAGCTAAAGC CTTATTTCTT GTTCATCCCT
GCTGCTTTGG TATTGGCCGG AATATTGATT GTTATAAGTA TCCCGAAAGA GCTGGAGCTT
GAAATCTATA ATCCCGAAAG GGGAACGAAG TATTCTTTTC AGAAGAGTAT TGCATTGAAG
CCGGGTAAAG AGTATAAGCT GTTACTTGAT GTGTCATATA TGAATAATGA TGGTGAAAAT
TCATTAACAG TATTGATTGG CAGCCGGGAT AAAAAGAATA TAATGTTCGG AGGAAACACA
AAACTTGCAG AGATTAACGA AAAGAACAGT GATACTCTGG AAATTCCGTT TACAGTGCCA
CAGGGCGGTT CATTGGTAGA TATTAGAATT ACCAATAACT CAGAGAAGTC AAAGGTATTA
ATTGATAATG CTAAAATTAT TGATGGCAAA ACCGGGAAAA GTGTAAAAAA TGTTAAACTT
CAATACAAAT TTATACCGAA TTCCGTAGCC TCAAGATTTG AAAATCTGAT GATATCAAGA
AGTTTTATTC AAAGACAGAT ATATTTAAAT GACGGATTTC AGATGTTCAA AGATAATTGG
CTGATTGGAG CCGGAGGAGG GGCATGGCCG AGCCTTGTTT TTGCATACCA GTCCTACCCG
TACTGGTCCA CGCAATCCCA TGCATATTTT CTGCAGGTTG CGGTAGAGAC AGGCATAATT
GGGTTGATAG TCCTGATAAT GCTGCTGTTA TCAATTGTTG TACAGTTTAT TACGGAATAT
AAATATAAAA AAGAAGAAGA TGTTAATTAC AGGATTTTGC AAGGGACACT GTTAACATCC
ATATTCGGAA TGTTTCTGCA TTCGTGCTTG GATTTTGATT TATCAATATC CTCAGTATTC
CTGCTGTTGT GGACATTAAT GGCATTGTTT AATTCTGGCT ACAGACACAA TAGGCCTGTT
GTAAAAGGAA ATGACGGCAC CGGTTCGAAG CCTGGTTTAT TTTACAGGCT AAACGAATTA
AAGCCGTTTA ATACCAACCC AATAGTAATG ACTGTTTTGT CCTTTGCAAT TATGATAATG
CCGGTATTAT TCGCGGCCGC TTCAAGTTTT GACCGCAAAT ACGAAAAATC CATGTCTGAA
GGCAACAGAG AAAATGCGCT GATATATATA AGAAGTGCGG AGTCATTAGA TACTTTTAAT
GCTGATTATA AAGTAAAATA TGCAAATTTG CTTTTATCCT CAGAAGGTCT TACGAAAGAA
GATTTTGAAA CTGCCAAAAA ATTGGTAAGT AGCGCGGAAA AAGCAGGCAA ATACAGCGCC
GAAACATTAC AGAATGCTGC CATACTATAT ATGAAGATGT CCATGTTTGA CAAAGGAATT
GAGCTTGTAG ACAGGGCTAT TGAGTTAAAG CCGTTTTATG AAGAAGGCTG GCAGCTTAAA
ATGAATATGC ACTACCAGCT TGCCCTTGCG TATTTAAAAA ATGATGAGCA TGAGAATGCC
AAGAAGCACC TCGATTTGGC ACTTAGTGTA ATCAGCAATG CGAAAGCAAA AAATGAAAGA
AATATGGATC CGTTTGCATT TAGTGAAAAG ACAATGGAGT ATCTGGAAAA AATGGTTTAT
ATGAAAGAAA ATTTCGACAA CCTGAATTTG GGACAGGTAG ATAAAGTTAA ATTCCAAAGC
ATAAATGAAA TGGATATAGA CTCCGACAAT ATACCTGATC AGTGGAACAT TGTCCAAAAA
GAAAGGGTGG AGTTAAGTAT CAGCGAGGGT AATATTTTAG TTAATAATAT AAATGACGAT
ACATTAGGCT CTTTTCAGAC CAGAAATATA AATTTTGAAG CCGGTAAGAA CTATAGGATT
GAATTAGCAC TCGACAATCA GGAAGACATT AATGTACTGT ATTTTGTTCC CGAATTGCAT
ACGAAATTTG TACAGCTTGA AAAAACAGGA GAAGGAAAAT ATTCGGCAAA TATTGAACTT
CCGTCAGATT ATAAAGCAGA GAATACTTTT ATTAGGTTCC GTTTTTCAAA GGATTCATCG
ATAAAAAGCT TGATTGTTAC TGAAATATAA
 
Protein sequence
MGRSNKNSNK GKAKKAVKSI KEENKYGKFR MLILILNLIV IFYSPFVRGL YFEAEQLPAE 
IFVLVSFAVF WIFKYMEKEK KFISTPIEYC SFGLMIAYFI SILGSVSTRL AISEWLKYCM
YFAVFFMITD LASTMKDKLI VLWTVIAASV GLCVVGLDSA SGGKLVDWLN NVFDFLHIPV
EFFGLYVEGR IHSTIQYPNA LAAYLMAVFF VTLTISIISS KIWQRLIAGV CSFVFVTTII
LTLSRGVMIL IPIVLILYLV VIPEGSKLRA FLMALCAAVS SVIPVLFSPL AGRSRSNLWL
GIALGIIVSL ILTVAVEFLF RLRLKVMPRL KLKPYFLFIP AALVLAGILI VISIPKELEL
EIYNPERGTK YSFQKSIALK PGKEYKLLLD VSYMNNDGEN SLTVLIGSRD KKNIMFGGNT
KLAEINEKNS DTLEIPFTVP QGGSLVDIRI TNNSEKSKVL IDNAKIIDGK TGKSVKNVKL
QYKFIPNSVA SRFENLMISR SFIQRQIYLN DGFQMFKDNW LIGAGGGAWP SLVFAYQSYP
YWSTQSHAYF LQVAVETGII GLIVLIMLLL SIVVQFITEY KYKKEEDVNY RILQGTLLTS
IFGMFLHSCL DFDLSISSVF LLLWTLMALF NSGYRHNRPV VKGNDGTGSK PGLFYRLNEL
KPFNTNPIVM TVLSFAIMIM PVLFAAASSF DRKYEKSMSE GNRENALIYI RSAESLDTFN
ADYKVKYANL LLSSEGLTKE DFETAKKLVS SAEKAGKYSA ETLQNAAILY MKMSMFDKGI
ELVDRAIELK PFYEEGWQLK MNMHYQLALA YLKNDEHENA KKHLDLALSV ISNAKAKNER
NMDPFAFSEK TMEYLEKMVY MKENFDNLNL GQVDKVKFQS INEMDIDSDN IPDQWNIVQK
ERVELSISEG NILVNNINDD TLGSFQTRNI NFEAGKNYRI ELALDNQEDI NVLYFVPELH
TKFVQLEKTG EGKYSANIEL PSDYKAENTF IRFRFSKDSS IKSLIVTEI