Gene Moth_1056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1056 
Symbol 
ID3833320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1086482 
End bp1088725 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content57% 
IMG OID637828984 
Productpolynucleotide phosphorylase/polyadenylase 
Protein accessionYP_429913 
Protein GI83589904 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1185] Polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase) 
TIGRFAM ID[TIGR03591] polyribonucleotide nucleotidyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000737281 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00038852 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAAGGGG TATTACGTAA AACCCTGGTT GTTGCCGGCC GGGATCTGAC CCTGGAAACC 
GGACGCCTGG CCAAACAGGC CGGGGGAGCA GTTATGGTTA GTTATGGTGG CACCATGGTC
CTGGTAACGG CAACAGCTTC AGAGGAACCC CGGGAAGGAA TCGACTTCTT TCCCCTTACC
GTTGACTACG AAGAGAGGCT CTATGCTGCC GGTAAAATCC CGGGAGGGTT TATTAAACGG
GAAGGACGGC CCAGCGAGAA GGCCATCCTC TCGGCCAGGT TGATCGACCG CCCCATCCGG
CCCCTTTTCC CTAAATTCTA CCGCAACGAT GTCCACGTGG TGGCTACGGT GATGTCGGTC
GACCAGGATT GTCCTCCCAA TGTGGCTGGT ATTATCGGGG CGTCGGCAGC CTTGATGCTC
TCGGCGATAC CCTTTGCCGG GCCCATTGGG GCGGTAAGTG TCGGATTAAT CGACAACCGC
CCGGTTATTA ACCCGACCCT GGAGGAAGAT TCCCGGAGCA GTCTAAATCT TACCGTGGCC
GGCACAGCCA ACGCCATCAT GATGGTAGAA GCCGGCGCTA AAGAAGTACC TGAAGACCTG
ATGCTGGAAT GTATAATGCA AGGTCATGAA GAGATCAAGC GGATTGTTGC CTTTATCAAT
GAATTTCGTG CAGAAGCCCT GGCCATGGGC CTGGCGAAGG AAAAACCGGA GCTAGTTGCC
CCCCAGCTAG ACCCGGCATG GGAGAGCCGG GTGCGGGAGA TTGCTACTCC CAGGCTTCGT
GAGGTTATTT ATCGCAGCCG TGACGAGAAG TGGAGCAAGC AGGAACGGGA TAAACAGTTA
GACGCCTGCC GGGAGGAAAT CAACAACTTG ATCCTGGAGG GCCAGGAAGA GGCCCTGGCA
GCCAACCCCG AACTCCCTGG CCTGATAAAA GAGCTGATTA CCAAGATCGA GAAGGAAATA
GTCCGCCGCA TGATCCTTAC GGAAGGTATC CGGGTGGATG GGCGCACCCT GGAAGAGATC
CGGCCCATCA CCTGCGAGGT GGGGGTCTTA AGCCGCACCC ATGGTTCCGG TCTCTTCACC
CGGGGAGAAA CCCAGGTTTT GACGGTCACT ACACTGGGGC CCATTAGCGA TGAGCAGATC
CTCGACGACC TGGGAGTCGA TGAATCGAAA CGCTATATGC ATCATTACAA TTTCCCGCCC
TACAGCGTCG GTGAAGCACG CCCCATCAGA GCCCCCGGGC GGCGGGAGAT TGGCCATGGA
GCCCTGGCCG AGCGAGCCCT GGAGCCCATG ATCCCCTCGG AAGAGGAATT CCCTTATGCT
ATTCGCCTGG TATCGGAAGT CCTGGGCTCC AACGGTTCAA CATCCATGGG CAGCGTCTGC
GGCAGCACCC TGTCCCTCAT GGATGCCGGG GTGCCCATCA AGGCCCCGGT GGCAGGGGTA
GCCATGGGCC TGGTAAAGGA GAACGACCAG GTAGCCATCC TTACGGACAT CCAGGGATTA
GAAGACGCCT TGGGAGATAT GGACTTTAAG GTGGCCGGGA CGAAAAAGGG TATAACAGCC
CTGCAGATGG ATATTAAAAT TGCCGGGATC GACAGATCCA TCCTGGAACG GGCTCTGGAA
CAGGCCCGCC GCGGCCGGCT CTTTATCCTC GATAAAATAC TGGCAACTAT CCCGGAGCCG
CGCAAGGAGC TTTCACCATA TGCCCCGAGG ATGCTGACTA TTACCATTGA CCCGGATAAA
ATTCGGGATA TTATCGGCCC GGGCGGTAAG ATCATTAAAA AGATCATTGA AGAGACCGGG
GTGGAGATTG ACGTCGAGGA TGACGGACGC GTCTTCATTG CCTCTACCGA TGCCGCCGCC
GGGGAGCGGG CACTAAAGAT TATTGAATCC TTGACCCAGG ATGTGGAAAC AGGTAAGGTA
TACAACGGCA AGGTAACTAG GGTGACTGAC TTTGGTGCCT TTGTAGAGGT AATACCCAGG
GTGCTGGGGA TGCCCGGTAA GGAAGGGCTG GTCCACATCT CCCAGTTGGC CAACGAACGG
GTAGAAAAAG TGGAGGATGT AGTCCAGGAA GGCGATTATA TCCTGGTCAA GGCCATCGGT
TTTGACCCCC AGGGCCGGCT GAAACTCTCA CGTAAAGAGG CCCTCAATGA GTCGACGGTC
GGGGAAGGGG GGCACCGCCA CTTTCGCCGG GCTGGACGGG AAGGCGGCCA TCGGGGCTTA
AACAACCGCA GACAATCTCG TTAG
 
Protein sequence
MQGVLRKTLV VAGRDLTLET GRLAKQAGGA VMVSYGGTMV LVTATASEEP REGIDFFPLT 
VDYEERLYAA GKIPGGFIKR EGRPSEKAIL SARLIDRPIR PLFPKFYRND VHVVATVMSV
DQDCPPNVAG IIGASAALML SAIPFAGPIG AVSVGLIDNR PVINPTLEED SRSSLNLTVA
GTANAIMMVE AGAKEVPEDL MLECIMQGHE EIKRIVAFIN EFRAEALAMG LAKEKPELVA
PQLDPAWESR VREIATPRLR EVIYRSRDEK WSKQERDKQL DACREEINNL ILEGQEEALA
ANPELPGLIK ELITKIEKEI VRRMILTEGI RVDGRTLEEI RPITCEVGVL SRTHGSGLFT
RGETQVLTVT TLGPISDEQI LDDLGVDESK RYMHHYNFPP YSVGEARPIR APGRREIGHG
ALAERALEPM IPSEEEFPYA IRLVSEVLGS NGSTSMGSVC GSTLSLMDAG VPIKAPVAGV
AMGLVKENDQ VAILTDIQGL EDALGDMDFK VAGTKKGITA LQMDIKIAGI DRSILERALE
QARRGRLFIL DKILATIPEP RKELSPYAPR MLTITIDPDK IRDIIGPGGK IIKKIIEETG
VEIDVEDDGR VFIASTDAAA GERALKIIES LTQDVETGKV YNGKVTRVTD FGAFVEVIPR
VLGMPGKEGL VHISQLANER VEKVEDVVQE GDYILVKAIG FDPQGRLKLS RKEALNESTV
GEGGHRHFRR AGREGGHRGL NNRRQSR