Gene Moth_0556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0556 
Symbol 
ID3831456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp579345 
End bp581033 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content60% 
IMG OID637828497 
Productribonuclease G 
Protein accessionYP_429429 
Protein GI83589420 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1530] Ribonucleases G and E 
TIGRFAM ID[TIGR00757] ribonuclease, Rne/Rng family 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTAAAAG AGATTCTAAT TCAGGATGAT GCCGAGGAAA CGGCCGTGGC CCTCCTGGAA 
GACGGCCGGT TAATGGAAAT CTACCTGGAA AGGGATAGTA ACCAGCGCCT CGTTGGCAAT
ATTTATAAGG GCAGGGTGGC CAACGTCCTG CCAGGTATGC AGGCCGCCTT CGTGGATATC
GGCCTGGAGA AGAACGCTTT TCTCTTTGTT GATGATACCA GCGGTATTGA GGCCCTGGAA
GGTGAGATTG TTTCCCGTTC CCGGCGCCGC ATCAGCGACG TCGTCCGGGA GGGGCAGGAG
ATCCTGGTCC AGGTGGCCAA AGAACCCCAG GGTACCAAAG GGGCCCGGGT TACCACCCAG
ATAACCCTGC CGGGGCGCTA CCTGGTCCTC ATGCCCACGG TTAATTATAT AGGCGTATCC
CGCCGGATTA GCCATGAGGA AGAGCGGGAG AGGTTAAAAA ACCTGGCCCG GGCGGTAAAA
CCCCGGCGCA TGGGCCTGAT TGTCCGTACC GTAGCTGCCG GGGCGGGCCT GGAAGAACTC
CAGGCGGACT GCCAGAACCT TACCCGGACC TGGAAACGCA TCCGCCAGGC GGCCCGGCGG
AGCAAGGCCC CCCGCCTGGT CCATCACGAC GTTGAGCTTT CCCTGCGCAT CCTGCGGGAC
CTGTATGCTG ATGACGTCAA TCGCCTGGTG GTGAATTCCC CGGCCACCTA TGCCAAGGTT
CTCGAAGTCC TGGCCGGCCG GGCTCCCAAC CTGCGGCAGC GGGTGATTTT AAGGGAGAAC
GCCGACCTCT TTGCCGTCTA CGGGGTACAA AACCAGATCG AGCAGGCCCT GAAACGTAAA
GTCTGGCTCA GGTGCGGCGG CTACCTGATC ATCGACCAAA TGGAAGCCCT GACGGCCATT
GATGTCAATA CCGGCAAGTA TGTCGGCCGC CACAACCTGG CCGAGACGGT GCTGACCACC
AACCTGGAGG CAGCCGTCGA GGTGGCCCGC CAGCTGCGGT TGCGGAATAT CGGCGGCATC
ATTGTGGTCG ACTTCATCGA TATGGATAAC CCCCTCCACC GGGAGCAGGT CATCAAAGTT
CTGGAAGGTG AGTTGACCAA GGATAAAACA AAGACCCAGA TCCTGGGCTT CACCCGCCTG
GGGCTCCTGG AAATGACCAG GAAAAAGGCC CAGCAGCGGC TGGAGAGCGT GTTGCAGCAG
GATTGTCCTT ATTGCCATGG CACCGGCAAG GTCCTTTCAG CGGAAACGGT AACCCTCAAG
GCTCGTAAGG AGATCCTGCA ACTGGCGGCT GTCAGCAAGG CCCGGGCCAT CCTGGCAGAA
GCCAATCCAG CCGTGGCGGC GCCCCTCATC GGCGTCGGTG GCGCCAACCT GCGGACGCTG
GAGCGGCGGG CCGGTAAGAA ATTAATCATC AAGGGGAACG AAAATTTTCA CCTGGAGGAA
GTACGGCTGC GGGAGCTCTT TGACCGGGAA GAAATCGCCA ATCTCTCTAC GCCCGTAAAG
GTAGGCCAGG TCCTCCAGGT AACCATTGAA GGTGTTCATA CGGGCAACGG CGGCGACGGT
ATCGCCAGGG TAGCCGGTTT CGTCCTGGAT ATCCCCGGCG GTGCCGCCTA CCTGGGACGG
GAAGTACCGG TAGAGATAAC CCGGGTTTTC CGGACCTATG CCCGGGCGCG GCTACTGGCC
AATGCTTGA
 
Protein sequence
MLKEILIQDD AEETAVALLE DGRLMEIYLE RDSNQRLVGN IYKGRVANVL PGMQAAFVDI 
GLEKNAFLFV DDTSGIEALE GEIVSRSRRR ISDVVREGQE ILVQVAKEPQ GTKGARVTTQ
ITLPGRYLVL MPTVNYIGVS RRISHEEERE RLKNLARAVK PRRMGLIVRT VAAGAGLEEL
QADCQNLTRT WKRIRQAARR SKAPRLVHHD VELSLRILRD LYADDVNRLV VNSPATYAKV
LEVLAGRAPN LRQRVILREN ADLFAVYGVQ NQIEQALKRK VWLRCGGYLI IDQMEALTAI
DVNTGKYVGR HNLAETVLTT NLEAAVEVAR QLRLRNIGGI IVVDFIDMDN PLHREQVIKV
LEGELTKDKT KTQILGFTRL GLLEMTRKKA QQRLESVLQQ DCPYCHGTGK VLSAETVTLK
ARKEILQLAA VSKARAILAE ANPAVAAPLI GVGGANLRTL ERRAGKKLII KGNENFHLEE
VRLRELFDRE EIANLSTPVK VGQVLQVTIE GVHTGNGGDG IARVAGFVLD IPGGAAYLGR
EVPVEITRVF RTYARARLLA NA