Gene Mpe_A1983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1983 
Symbol 
ID4783770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2123909 
End bp2126143 
Gene Length2235 bp 
Protein Length744 aa 
Translation table11 
GC content67% 
IMG OID640090553 
Productexoribonuclease II 
Protein accessionYP_001021176 
Protein GI124267172 
COG category[K] Transcription 
COG ID[COG0557] Exoribonuclease R 
TIGRFAM ID[TIGR00358] VacB and RNase II family 3'-5' exoribonucleases
[TIGR02063] ribonuclease R 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0944364 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGG TCGAGGGTAT CGTCCAAGGC CACCGCGACG GGCACGGCTT CGTCCAGCGC 
GACGACGGCG AATCCGACAT CTATCTGTCG CCGCAGGAAA TGCGGGCGGT TCTGCACCGC
GATCGGGTCC GCGTGCGGGT GATCCGCTAC GACCGCAAGG GCCGGCCTGA AGGGCGGGTG
CTCGAGATCC TGGAACGCCG CAAGGCGCCG ATCATCGGTC GCCTGCTGCA CGAGAGCGGC
ATCTGGCTGG TCGCCCCGGA GGACAAGCGC TACGGCCAGG ACATCATGAT CCCGAAGAAT
GGCATTGCCA ATGCCAGCGC GGGTCAGGTG GTCGCGGTCG AGCTGACCGA GCCGCCGTCG
ATGCATTCGC AGCCGCTGGG CCGCGTGACA GAGGTGCTCG GCGAGATCGA CGACGCGGGC
ATGGAGATCG AGATCGCGGT GCGCAAGTAC GAGGTGCCGC ACCGCTTCTC GGCCGAAACG
CTGGCGCAGA CGGCCAAGCT GCCCGACAAG GTGCGGCCAG CCGACAAGAG GCAGCGCATC
GACCTGACCG ACGTCCCGCT GGTCACCATC GATGGCGAGG ACGCCCGGGA CTTCGACGAC
GCCGTGTACT GCGAGCCGGT CACGATCGGC CGCAAGACCA AGGGGGCGGC GCCGAACGGC
TGGCGCCTGA TCGTCGCCAT TGCCGACGTG AGCCACTACG TGAAGCCCGG CGAGTCGCTG
GACGACGATG CCTACGAGCG CGCGACCTCG GTCTATTTCC CGCGGCGCGT GATTCCGATG
CTGCCGGAGA AGCTCAGCAA CGGGCTGTGC TCGCTGAATC CCAACGAGGA CCGCCTCGCG
ATGGTGTGCG ACATGGTGGT CGACGCCGCA GGCGAGGTAC ACGCCTACCA GTTCTTCCCG
GCCGTCATCA ACTCGCATGC GCGCTTCACG TACACGGAAG TCGCGACGAT CCTCGCCAAC
ACGCGCGGGC CAGAGGCGCA GAAGCGCAAG GAGTTGGTGC CGCACCTGCT GCACCTGCAC
GAGGTGTATC GCGCCCTGCT GAAGGCGCGC GCCACGCGCG GCGCGGTCGA CTTCGAGACC
ACCGAGACGC AGATCGTCTG CGATGAGAAC GGGCGAATCG AGAAGATTGT GCCGCGGACA
CGCAACGACG CACACCGCCT GATCGAGGAG GCGATGCTGG CGGCGAACGT GTGCTCGGCC
GATTTCATCG CGTCACACAA GCACGCTTCG CTGTATCGCG TGCACGAAGG CCCGACGCCC
GAGAAGCGCG CGATCCTGCA GACTTATCTG CGTGCGCTCG GCCTCGGCTT GTCGATCGGC
GACGATCCCC GGCCCGGTGA GTTCCAGGCG ATCGCGCAGG CGACGAAGGA CCGCCCTGAC
GCCACCCAGA TCCACTCGAT GCTGCTGCGC TCGATGCAGC AGGCCATCTA CACGCCGACC
AACAGCGGTC ACTTCGGCCT GGCCTATCCG GCCTACACCC ACTTCACCAG CCCGATCCGC
CGCTACCCGG ACCTGCTGGT GCACCGTGTG ATCAAGGCTC TGCTGGGCAG CAAGAAGTAC
CACCTGCAGG TCACCGAGCT CGCCAGCTCA GCCGTCCACA CGCGCAAGGT TCGGCCGACC
GCGGCCTCCA AGGCGCAGCA GGCCAGCAAG CTGGTCGTCA AGCGGACCGC CGAGGCGATG
GCCTGGGAAG CGGCCGGCGC GCACTGCAGC GCGAACGAGC GGAGGGCCGA CGAGGCGTCG
CGCGATGTCG AGGCTTGGCT GAAATGCAAG TTCATGCGTG AGCACCTCGG CGAGGAGTTC
GGCGGCGCGG TCACCGCAGT CACGACCTTC GGGTTGTTCG TCACGCTCGA CGAGCTGTAC
GTCGAGGGCC TGGTGCACAT CACCGAACTC GGCGGAGAGT ACTTCCGCTT CGACGAAGCG
CGACAGGAGC TGCGCGGCGA GCGCACCGGC GTGCGCTACG TCGTCGGCAG CCGGGTGCGG
GTGCAAGTGA GCCGCGTCGA TCTCGACGGG CGCAAGATCG ACTTCCGACT GGTCCACGAA
TCGGGGCTGA ACCCGCGGGC ACCGCGTGAC AAGGCGGCTT CGGCCGTCGA GGAACTGCAG
GTCGTGAAGG ACGTCGACCG CGAGCACAAG CGGGCGACCA AGAAGGCGGT CAGCAAGACT
CCCCGCAAGC CTGCGCGCAG CGCTGGGGCG GACACCGCGG CAGCGCCGCG CAAGTCGAAG
CGTTCGAGGC GGTAG
 
Protein sequence
MSEVEGIVQG HRDGHGFVQR DDGESDIYLS PQEMRAVLHR DRVRVRVIRY DRKGRPEGRV 
LEILERRKAP IIGRLLHESG IWLVAPEDKR YGQDIMIPKN GIANASAGQV VAVELTEPPS
MHSQPLGRVT EVLGEIDDAG MEIEIAVRKY EVPHRFSAET LAQTAKLPDK VRPADKRQRI
DLTDVPLVTI DGEDARDFDD AVYCEPVTIG RKTKGAAPNG WRLIVAIADV SHYVKPGESL
DDDAYERATS VYFPRRVIPM LPEKLSNGLC SLNPNEDRLA MVCDMVVDAA GEVHAYQFFP
AVINSHARFT YTEVATILAN TRGPEAQKRK ELVPHLLHLH EVYRALLKAR ATRGAVDFET
TETQIVCDEN GRIEKIVPRT RNDAHRLIEE AMLAANVCSA DFIASHKHAS LYRVHEGPTP
EKRAILQTYL RALGLGLSIG DDPRPGEFQA IAQATKDRPD ATQIHSMLLR SMQQAIYTPT
NSGHFGLAYP AYTHFTSPIR RYPDLLVHRV IKALLGSKKY HLQVTELASS AVHTRKVRPT
AASKAQQASK LVVKRTAEAM AWEAAGAHCS ANERRADEAS RDVEAWLKCK FMREHLGEEF
GGAVTAVTTF GLFVTLDELY VEGLVHITEL GGEYFRFDEA RQELRGERTG VRYVVGSRVR
VQVSRVDLDG RKIDFRLVHE SGLNPRAPRD KAASAVEELQ VVKDVDREHK RATKKAVSKT
PRKPARSAGA DTAAAPRKSK RSRR