Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2346 |
Symbol | |
ID | 4808980 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2795812 |
End bp | 2798841 |
Gene Length | 3030 bp |
Protein Length | 1009 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640107753 |
Product | O-antigen polymerase |
Protein accession | YP_001038741 |
Protein GI | 125974831 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0470801 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAGAA GCAATAAAAA TTCGAATAAA GGGAAAGCAA AAAAGGCAGT AAAATCTATA AAAGAAGAAA ACAAGTACGG TAAATTTAGA ATGTTAATAT TAATATTAAA TCTAATTGTA ATATTTTATT CACCTTTTGT AAGAGGGCTG TATTTTGAAG CAGAACAGCT GCCGGCAGAA ATATTTGTAT TAGTAAGCTT TGCTGTATTC TGGATATTTA AGTATATGGA GAAAGAAAAA AAATTCATAT CGACTCCGAT AGAGTACTGT TCATTTGGGT TGATGATTGC TTATTTTATA TCGATACTTG GTTCTGTCAG CACAAGGCTT GCAATTTCAG AATGGCTGAA ATATTGTATG TATTTTGCTG TGTTTTTCAT GATTACCGAT TTGGCTTCAA CAATGAAAGA CAAGCTTATA GTATTATGGA CTGTTATTGC AGCATCAGTG GGTTTATGTG TTGTTGGCCT TGACAGTGCA TCTGGAGGTA AACTTGTTGA TTGGCTGAAT AATGTGTTTG ATTTCCTGCA TATACCGGTT GAATTTTTTG GTCTTTACGT TGAAGGACGT ATTCATTCGA CGATCCAATA TCCCAATGCA CTGGCTGCGT ATCTTATGGC AGTATTCTTT GTAACTTTGA CGATATCCAT AATATCGTCA AAAATTTGGC AAAGACTGAT TGCAGGAGTA TGCAGCTTTG TATTTGTAAC GACGATAATA CTTACTTTGA GTCGTGGTGT TATGATATTA ATACCCATTG TGCTGATATT ATATCTTGTT GTAATTCCTG AGGGCAGTAA GCTGAGAGCT TTTTTAATGG CTTTATGTGC TGCAGTTTCA AGTGTAATAC CTGTGCTGTT TTCACCTTTG GCAGGCAGAA GCCGCTCGAA CCTTTGGCTG GGAATTGCAT TGGGAATAAT AGTTTCCTTG ATTTTAACAG TGGCAGTGGA GTTTTTATTT AGGCTCCGTT TAAAAGTAAT GCCAAGGCTA AAGCTAAAGC CTTATTTCTT GTTCATCCCT GCTGCTTTGG TATTGGCCGG AATATTGATT GTTATAAGTA TCCCGAAAGA GCTGGAGCTT GAAATCTATA ATCCCGAAAG GGGAACGAAG TATTCTTTTC AGAAGAGTAT TGCATTGAAG CCGGGTAAAG AGTATAAGCT GTTACTTGAT GTGTCATATA TGAATAATGA TGGTGAAAAT TCATTAACAG TATTGATTGG CAGCCGGGAT AAAAAGAATA TAATGTTCGG AGGAAACACA AAACTTGCAG AGATTAACGA AAAGAACAGT GATACTCTGG AAATTCCGTT TACAGTGCCA CAGGGCGGTT CATTGGTAGA TATTAGAATT ACCAATAACT CAGAGAAGTC AAAGGTATTA ATTGATAATG CTAAAATTAT TGATGGCAAA ACCGGGAAAA GTGTAAAAAA TGTTAAACTT CAATACAAAT TTATACCGAA TTCCGTAGCC TCAAGATTTG AAAATCTGAT GATATCAAGA AGTTTTATTC AAAGACAGAT ATATTTAAAT GACGGATTTC AGATGTTCAA AGATAATTGG CTGATTGGAG CCGGAGGAGG GGCATGGCCG AGCCTTGTTT TTGCATACCA GTCCTACCCG TACTGGTCCA CGCAATCCCA TGCATATTTT CTGCAGGTTG CGGTAGAGAC AGGCATAATT GGGTTGATAG TCCTGATAAT GCTGCTGTTA TCAATTGTTG TACAGTTTAT TACGGAATAT AAATATAAAA AAGAAGAAGA TGTTAATTAC AGGATTTTGC AAGGGACACT GTTAACATCC ATATTCGGAA TGTTTCTGCA TTCGTGCTTG GATTTTGATT TATCAATATC CTCAGTATTC CTGCTGTTGT GGACATTAAT GGCATTGTTT AATTCTGGCT ACAGACACAA TAGGCCTGTT GTAAAAGGAA ATGACGGCAC CGGTTCGAAG CCTGGTTTAT TTTACAGGCT AAACGAATTA AAGCCGTTTA ATACCAACCC AATAGTAATG ACTGTTTTGT CCTTTGCAAT TATGATAATG CCGGTATTAT TCGCGGCCGC TTCAAGTTTT GACCGCAAAT ACGAAAAATC CATGTCTGAA GGCAACAGAG AAAATGCGCT GATATATATA AGAAGTGCGG AGTCATTAGA TACTTTTAAT GCTGATTATA AAGTAAAATA TGCAAATTTG CTTTTATCCT CAGAAGGTCT TACGAAAGAA GATTTTGAAA CTGCCAAAAA ATTGGTAAGT AGCGCGGAAA AAGCAGGCAA ATACAGCGCC GAAACATTAC AGAATGCTGC CATACTATAT ATGAAGATGT CCATGTTTGA CAAAGGAATT GAGCTTGTAG ACAGGGCTAT TGAGTTAAAG CCGTTTTATG AAGAAGGCTG GCAGCTTAAA ATGAATATGC ACTACCAGCT TGCCCTTGCG TATTTAAAAA ATGATGAGCA TGAGAATGCC AAGAAGCACC TCGATTTGGC ACTTAGTGTA ATCAGCAATG CGAAAGCAAA AAATGAAAGA AATATGGATC CGTTTGCATT TAGTGAAAAG ACAATGGAGT ATCTGGAAAA AATGGTTTAT ATGAAAGAAA ATTTCGACAA CCTGAATTTG GGACAGGTAG ATAAAGTTAA ATTCCAAAGC ATAAATGAAA TGGATATAGA CTCCGACAAT ATACCTGATC AGTGGAACAT TGTCCAAAAA GAAAGGGTGG AGTTAAGTAT CAGCGAGGGT AATATTTTAG TTAATAATAT AAATGACGAT ACATTAGGCT CTTTTCAGAC CAGAAATATA AATTTTGAAG CCGGTAAGAA CTATAGGATT GAATTAGCAC TCGACAATCA GGAAGACATT AATGTACTGT ATTTTGTTCC CGAATTGCAT ACGAAATTTG TACAGCTTGA AAAAACAGGA GAAGGAAAAT ATTCGGCAAA TATTGAACTT CCGTCAGATT ATAAAGCAGA GAATACTTTT ATTAGGTTCC GTTTTTCAAA GGATTCATCG ATAAAAAGCT TGATTGTTAC TGAAATATAA
|
Protein sequence | MGRSNKNSNK GKAKKAVKSI KEENKYGKFR MLILILNLIV IFYSPFVRGL YFEAEQLPAE IFVLVSFAVF WIFKYMEKEK KFISTPIEYC SFGLMIAYFI SILGSVSTRL AISEWLKYCM YFAVFFMITD LASTMKDKLI VLWTVIAASV GLCVVGLDSA SGGKLVDWLN NVFDFLHIPV EFFGLYVEGR IHSTIQYPNA LAAYLMAVFF VTLTISIISS KIWQRLIAGV CSFVFVTTII LTLSRGVMIL IPIVLILYLV VIPEGSKLRA FLMALCAAVS SVIPVLFSPL AGRSRSNLWL GIALGIIVSL ILTVAVEFLF RLRLKVMPRL KLKPYFLFIP AALVLAGILI VISIPKELEL EIYNPERGTK YSFQKSIALK PGKEYKLLLD VSYMNNDGEN SLTVLIGSRD KKNIMFGGNT KLAEINEKNS DTLEIPFTVP QGGSLVDIRI TNNSEKSKVL IDNAKIIDGK TGKSVKNVKL QYKFIPNSVA SRFENLMISR SFIQRQIYLN DGFQMFKDNW LIGAGGGAWP SLVFAYQSYP YWSTQSHAYF LQVAVETGII GLIVLIMLLL SIVVQFITEY KYKKEEDVNY RILQGTLLTS IFGMFLHSCL DFDLSISSVF LLLWTLMALF NSGYRHNRPV VKGNDGTGSK PGLFYRLNEL KPFNTNPIVM TVLSFAIMIM PVLFAAASSF DRKYEKSMSE GNRENALIYI RSAESLDTFN ADYKVKYANL LLSSEGLTKE DFETAKKLVS SAEKAGKYSA ETLQNAAILY MKMSMFDKGI ELVDRAIELK PFYEEGWQLK MNMHYQLALA YLKNDEHENA KKHLDLALSV ISNAKAKNER NMDPFAFSEK TMEYLEKMVY MKENFDNLNL GQVDKVKFQS INEMDIDSDN IPDQWNIVQK ERVELSISEG NILVNNINDD TLGSFQTRNI NFEAGKNYRI ELALDNQEDI NVLYFVPELH TKFVQLEKTG EGKYSANIEL PSDYKAENTF IRFRFSKDSS IKSLIVTEI
|
| |