Gene Mpe_A3046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3046 
Symbol 
ID4784968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3237556 
End bp3240396 
Gene Length2841 bp 
Protein Length946 aa 
Translation table11 
GC content70% 
IMG OID640091617 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_001022234 
Protein GI124268230 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.490165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG ACCAGCCCAC CGACTACCGC GCCACGCTGA ACCTGCCCGA CACGCCCTTC 
CCGATGCGCG GCGACCTGCC CAAGCGCGAA CCGGGCTGGG TGAAGCAGTG GGAGCAGCAG
GGCACCTACC AGCGCCTGCG CGACGCCCGC GTCGGCCGGC CGCGCTTCGT GCTGCACGAC
GGCCCGCCCT ACGCCAACGG CCAGCTGCAC ATCGGCCACG CGCTGAACAA GGTGCTGAAG
GACATGATCG TCAAGGCGCG TCAGCTGGCC GGCTACGACG CGCTCTACGT GCCGGGCTGG
GACTGCCACG GCCTGCCGAT CGAGAACCAG ATCGAGAAGC TGCACGGCCG CGGCCTGGGG
CGCGACGAGG TGCAGGCGAA GAGCCGTGCC TATGCCACCG AGCAGATCGA GCAGCAGCGG
GCCGACTTCA AGCGCCTGGG CGTGCTCGGC GCCTGGGACC AGCCCTACCG CACGATGGAC
TTCGGCAACG AGGCCGGCGA GATCCGCGCG TTGAAGCGCG TGATGGAGCG CGGCTTCGTC
TACCGCGGCC TGAAGCCGGT GTACTGGTGC TTCGACTGCG GCAGCTCGCT GGCCGAGTTC
GAGATCGAGT ACGAGAACAA GTCCTCGCCG ACGGTGGACG TGGCCTTCCT GGCCGCCGAG
CCGCAGAAGC TCGCCGCCGC CTTCGGCCTG CCGGCGCTGG CCAAGGACGC GTTCGCGGTG
ATCTGGACCA CGACGCCATG GACCCTTCCC GCCAACCAGG CGCTGAACCT CAACCCCGAG
CTCGAGTACG CGCTGGTCGA CACCGAGCGC GGCCTGCTGC TGCTGGCCAA TGCGCTGGTG
GAGAAGTGCC TGGCGCGCTA CGGCCTGGCC GGCACGGTGC TCGCCACGAC CGCCGGCCAG
GCGCTCGAGG GCCTGGAGTT CCACCATCCG CTGGCCCACG TGCACCCGGG CTACGCGCGC
CGCAGCCCGG TCTACCTGGC CGACTACGCC ACCGCCGAGG ACGGCACCGG CATCGTCCAC
TCGGCGCCGG CCTACGGCGT GGAGGACTTC AACTCCTGCA TCGCGCACGG GATGAAGCAC
GACGAGATCC TGAACCCCGT GCAGGGCAAT GGCGTGTACG CGGCCGAGCT GCCGCTGTTC
GGCGGCCAGT TCATCTGGAA GGCCAACCCG CTGATCGTGC AGGCGCTGCA GGATGCCGGC
CGGCTGATGG CGACCGCAAA GCTCGAGCAC AGCTACCCGC ACTGCTGGCG CCACAAGACG
CCGGTGATCT ACCGCGCCGC GGCGCAGTGG TTCGTGCGCA TGGACGAGGG CGAGGGCGTG
TTCACCGTCG ACAAGGCGCC GAAGACGCTG CGCCAGACCG CGCTGGCGGC GATCGACGCC
ACCGCCTTCT ACCCCGAGAA CGGCCGCGCC CGACTGCGCG ACATGATCGC CAACCGGCCC
GACTGGTGCA TCAGCCGCCA GCGCAACTGG GGCGTGCCGC TGCCTTTCTT CCTGCACAAG
GTGAGCGGCG AGCTGCACCC CGACACGCTG GCGCTGATGG ACCGCGCCGC CGCGCTGGTG
GCGCAGGGCG GCGTGGAGGC CTGGTCGCGG CTCGACCCGC GCGAGTGGCT GGGCGAGGCA
GCCGGCGATT ACGCCAAGAG CACCGACATC CTCGATGTGT GGTTCGACTC CGGCTCGACC
TTCTTCCACG TGCTGCGCGG CAGCCATGCC GGCGCCGGCC GCGACGACGG CGGGCCCGAG
GCCGACCTCT ACCTCGAGGG CCACGACCAG CACCGCGGCT GGTTCCACAG CTCGCTGCTG
ATCGCCTGCG CGATCGAGGG CCGTGCGCCC TACCGCGGCC TGCTGACGCA CGGTTTCGCG
ACCGACGGCC AGGGCCGCAA GATGAGCAAA TCGCTCGGCA ACACCGTGGT GCCGCAGTCG
GTGAGCGAGA AGCTGGGTGC CGAGATCATC CGGCTGTGGG TCGCCAGCAC CGACTACTCG
GGCGACCTGA ACATCGACGA CAAGATCCTC GCACGCGTGG TCGACGCCTA CCGGCGCATC
CGCAACACGC TGCGCTTCCT GCTCGCCAAC ACCAGCGACT TCGACCCGGC GACCGACGCG
GTGCCGGACG AGCAGTTGCT GGAGATCGAC CGCTACGCGA TCGACCGCGC GGCGCAGCTG
CAGGCCGAGA TCCTGGCGCA CTACGAGGTC TACGAATTCC ACCCGGTGGT CGCGAAACTG
CAGGTCTACT GCAGCGAAGA CCTCGGTGCG TTCTACCTCG ACGTGCTGAA GGACCGGCTC
TACACCACCG CCCCGAAATC GCTGGCGCGG CGCAGCGCGC AGACCGCGCT GCACCGCATC
ACCGGCGCGA TGCTGCGCTG GATGGCGCCG TTCCTGAGCT TCACCGCCGA GGAGGCCTGG
CCGATCTTCG CGCCGGGCGT GTCGCCGTCG ATCTTCACGC AGACCTATAC CCCCTTCGCG
CCCCCCGATG CCGCGCGCCT GGACAAGTGG GCCCGCGTGC GCGAGATCCG CGATGCCGTC
AACAAGGAGA TCGAGGCCGT CCGCACCGCC GGCGCGGTGG GCGCCTCGCT GCAGGCCACG
GTGGCGGTCG GCGCGCCGGC CGACGACCTG GCGCTGCTGC AGTCACTGGG CGAGGACCTG
AAGTTCGTGT TCATCACCTC GGCCGCCACC GCGGCGGCGG CCGACGCGCT GACGGTCGCG
GTCACGCCGA GCAGCGCCGC CAAGTGCGAA CGCTGCTGGC ACTACCGCGA CGACGTCGGC
GCCGACCCGG CCCACCCGAC GATCTGCGGC CGCTGCACCA ACAATCTCTA CGGTGCCGGC
GAAAGCCGCA CGGTGGCCTG A
 
Protein sequence
MSTDQPTDYR ATLNLPDTPF PMRGDLPKRE PGWVKQWEQQ GTYQRLRDAR VGRPRFVLHD 
GPPYANGQLH IGHALNKVLK DMIVKARQLA GYDALYVPGW DCHGLPIENQ IEKLHGRGLG
RDEVQAKSRA YATEQIEQQR ADFKRLGVLG AWDQPYRTMD FGNEAGEIRA LKRVMERGFV
YRGLKPVYWC FDCGSSLAEF EIEYENKSSP TVDVAFLAAE PQKLAAAFGL PALAKDAFAV
IWTTTPWTLP ANQALNLNPE LEYALVDTER GLLLLANALV EKCLARYGLA GTVLATTAGQ
ALEGLEFHHP LAHVHPGYAR RSPVYLADYA TAEDGTGIVH SAPAYGVEDF NSCIAHGMKH
DEILNPVQGN GVYAAELPLF GGQFIWKANP LIVQALQDAG RLMATAKLEH SYPHCWRHKT
PVIYRAAAQW FVRMDEGEGV FTVDKAPKTL RQTALAAIDA TAFYPENGRA RLRDMIANRP
DWCISRQRNW GVPLPFFLHK VSGELHPDTL ALMDRAAALV AQGGVEAWSR LDPREWLGEA
AGDYAKSTDI LDVWFDSGST FFHVLRGSHA GAGRDDGGPE ADLYLEGHDQ HRGWFHSSLL
IACAIEGRAP YRGLLTHGFA TDGQGRKMSK SLGNTVVPQS VSEKLGAEII RLWVASTDYS
GDLNIDDKIL ARVVDAYRRI RNTLRFLLAN TSDFDPATDA VPDEQLLEID RYAIDRAAQL
QAEILAHYEV YEFHPVVAKL QVYCSEDLGA FYLDVLKDRL YTTAPKSLAR RSAQTALHRI
TGAMLRWMAP FLSFTAEEAW PIFAPGVSPS IFTQTYTPFA PPDAARLDKW ARVREIRDAV
NKEIEAVRTA GAVGASLQAT VAVGAPADDL ALLQSLGEDL KFVFITSAAT AAAADALTVA
VTPSSAAKCE RCWHYRDDVG ADPAHPTICG RCTNNLYGAG ESRTVA