Gene Cthe_0418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0418 
Symbol 
ID4808421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp525212 
End bp527314 
Gene Length2103 bp 
Protein Length700 aa 
Translation table11 
GC content42% 
IMG OID640105832 
Productpolynucleotide phosphorylase/polyadenylase 
Protein accessionYP_001036849 
Protein GI125972939 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1185] Polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase) 
TIGRFAM ID[TIGR03591] polyribonucleotide nucleotidyltransferase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAAAA CTTTCAGCAT GGAGCTAGCC GGAAGAACAC TTACTATTGA AACTGGGAAA 
CTTGCTCAAC TGGCAAACGG TTCAGTATTG GTCAGATACG GTGATACTGT TGTACTCTCA
ACAGCTACCG CTTCGGCAAC ACCAAGAGAG GGAGTCGATT TTTTTCCTTT GAGTGTGGAT
TATGAAGAAA GATTGTATGC TGTAGGAAAA ATTCCCGGAG GTTTTATAAA AAGAGAAGGT
AAACCGTCGG AAAAGGCAAT ACTTACAGCC AGAGTTATAG ATAGACCCTT AAGGCCTTTG
TTCCCCAAGG ACTTGAGGAA TGATGTGGCT ATTGTAAATA CAGTTTTGTC CGTTGATCAG
GACAATTCAC CGGAACTTGC CGCTTTATTG GGATCCTCCA TTGCCGTGTC AATTTCGGAC
ATACCGTTTA ACGGTCCTGT CGGAGCAGTT ATTCTGGGGC TTATTGACGG TGAAGTGATT
ATAAATCCGA CCGAAAAACA AAAAGAAATA AGCCAGATGT ATGTTACTTT GGCAGGCACA
AGGAATAAAA TTGTCATGAT AGAGGCAGGA GCAAACGAGG TTCCCGATGA AGTCATGCTG
GATGCCATCA AAAAAGGACA TGAGGAAATA AAGAAAATTG TTGACTTTAT TGACGGAATT
GTAAAGGAAG TCGGCAAACC TAAATTTGAA TATGAATCTG CGGAAGTTCC CGAGGAAATA
TTCAATGCCG TCAGGGAATA TGCTTATGAC AAGATGAGGG AAGCCGTACT TGCTGTGGAC
AAGCAGGTAA GAGACAAAAA CATTGATGAT CTTACAAAAG AAATAACGGA GCATTTTGCG
GAAGTGTTCC CTGAAATGGA ACCAGCCATT AAAGAAGCAA TATACAAACT GGAGAAGAAA
GTTGTAAGGG AATATATTTT GGAAGAGGGC AGAAGAGTTG ACGGCAGAAG ACTTGACGAA
ATAAGGCCTT TGTCGGCTGA AGTCGGGCTG CTTCCGAGAG TTCATGGTTC AGGTCTTTTC
ACAAGGGGAC AGACTCAGGT GCTTTCAAGT GTTACCTTGG GCGCCATGGG GGATGTTCAG
ATACTGGACG GTATTGACAC TGAAGAAACC AAAAGATATA TGCATCACTA TAATTTCCCT
GGATTCAGCG TTGGCGAAGC TAAGAGTTCA AGAGGTCCGG GAAGAAGAGA AATCGGTCAC
GGAGCTCTAG CGGAAAGAGC ATTGGAGCCT GTGATTCCAA GTGAAGAAGA ATTCCCTTAT
ACAATAAGGG TGGTATCTGA AGTTCTTATG TCAAATGGTT CCACATCTCA GGGAAGCGTT
TGCGGAAGCA CTCTGGCACT TATGGATGCA GGTGTGCCTA TCAAAAAGCC TGTTGCGGGA
ATTTCTGCCG GTTTGGTTGT TGACGAAAAT AATCCCGACA GGTTTGTTAC TTTTATGGAT
ATCCAAGGCA TAGAGGATTT CTTTGGAGAT ATGGACTTTA AAGTTGCCGG AACGAAGGAT
GGAATAACAG CCATTCAGGT TGATATAAAG ATAGACGGAC TTACGGAGGA AATTATAAAA
CAGGCATTTG AACTTACAAG AAAAGGTCGT TTGTATATTA TTGACAATGT GTTGCTGAAG
GCTATTCCGG AACCGAGAAA ACAAATGTCA AAATATGCGC CTAAGATTAT TTCAACTACC
ATAAATCCGG ATAAAATCAG GGAAGTAATA GGCCCCGGAG GTAAAATGAT AAACAAGATA
ATTGACGAAA CCGGAGTAAA GATTGATATA AACGACGACG GTAGAGTTTA TATATTCAGT
TCGGATATTC AAGCCGGAAA AAGAGCTCGC AGTATGATAG AGGCAATAGC AAAAGATATT
GAGCCAGGCC AGGTATTTTT GGGCAGAGTT ATCAGAGTTA CATCCTTTGG AGCTTTTGTA
GAGTTTCTTC CGGGAAAAGA AGGGCTTGTG CATATAAGCA AGCTGGACAA GAAGAGAGTC
GAAAGGGTTG AAGACATAGT AAGAGTCGGA GATCAGATAC TTGTTAAGGT TATAGAAATT
GACAAGCAGG GACGTGTAAA TCTTTCAAGA AAGGATGCAA TGGAAGATGA ATGGGATAAA
TAG
 
Protein sequence
MYKTFSMELA GRTLTIETGK LAQLANGSVL VRYGDTVVLS TATASATPRE GVDFFPLSVD 
YEERLYAVGK IPGGFIKREG KPSEKAILTA RVIDRPLRPL FPKDLRNDVA IVNTVLSVDQ
DNSPELAALL GSSIAVSISD IPFNGPVGAV ILGLIDGEVI INPTEKQKEI SQMYVTLAGT
RNKIVMIEAG ANEVPDEVML DAIKKGHEEI KKIVDFIDGI VKEVGKPKFE YESAEVPEEI
FNAVREYAYD KMREAVLAVD KQVRDKNIDD LTKEITEHFA EVFPEMEPAI KEAIYKLEKK
VVREYILEEG RRVDGRRLDE IRPLSAEVGL LPRVHGSGLF TRGQTQVLSS VTLGAMGDVQ
ILDGIDTEET KRYMHHYNFP GFSVGEAKSS RGPGRREIGH GALAERALEP VIPSEEEFPY
TIRVVSEVLM SNGSTSQGSV CGSTLALMDA GVPIKKPVAG ISAGLVVDEN NPDRFVTFMD
IQGIEDFFGD MDFKVAGTKD GITAIQVDIK IDGLTEEIIK QAFELTRKGR LYIIDNVLLK
AIPEPRKQMS KYAPKIISTT INPDKIREVI GPGGKMINKI IDETGVKIDI NDDGRVYIFS
SDIQAGKRAR SMIEAIAKDI EPGQVFLGRV IRVTSFGAFV EFLPGKEGLV HISKLDKKRV
ERVEDIVRVG DQILVKVIEI DKQGRVNLSR KDAMEDEWDK