Gene Mpe_A1137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1137 
Symbol 
ID4785712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1213037 
End bp1215862 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content67% 
IMG OID640089700 
Productputative valyl-tRNA synthetase 
Protein accessionYP_001020333 
Protein GI124266329 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.835272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0253174 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC TCGCGAAATC CTTCGAACCC GGCCCGATCG AGGCCAAGTG GGCCCCCGTG 
TGGGAGCAGC GAGGCCTGTT CGCGCCGACG CTCGACGACG GCAAGTCCTC GTTCGCCATC
CAGCTGCCGC CGCCCAACGT GACCGGCGTG CTGCACATGG GCCACGCGTT CAACCAGACC
ATCATGGACG CGTTGACGCG CTACCACCGC ATGCGCGGCG ACAACACGCT GTGGGTGCCG
GGCACCGACC ACGCTGGCAT CGCGACGCAG ATCGTCGTGG AGCGCCAGCT GGAGCAGCAG
GGCCAGAGCC GTCACGACCT CGGCCGCCAG AGCTTCGTCG CCAAGGTGTG GGAGTGGAAG
GAGCACTCGG GCTCGACGAT CACGCAGCAG ATGCGCCGGG TCGGCGCCAG CGTGGACTGG
CGGCACGAGT ACTTCACGAT GGACGAGAAG CTGTCGCCGG TGGTGACGGA CACCTTCGTG
CAGCTCTATG AGCAGGGCCT GATCTACCGC GGCAAGCGGC TGGTGAACTG GGACCCGGTG
CTGAAGTCGG CAGTCTCGGA CCTGGAGGTG GAGAGCGAAG AGGAAGACGG TTTCCTGTGG
CACATCCGTT ACCCGCTGGC CGACGGCAGC GGCGAGTTGG TGGTGGCGAC GACCCGGCCC
GAGACCATGC TCGGCGACAC CGCGGTGATG GTGCACCCCG AGGACGAGCG CCATGCCGGC
CTGATCGGAA AGCAGGTGAC GCTGCCGCTG TGCGGTCGCA CGATCCCGGT GATCGCCGAC
GACTACGTCG ACCGTGCCTT CGGCACCGGC GTGGTCAAGG TCACGCCGGC CCACGACTTC
AATGACTACG CGGTCGGCCA GCGCCACGGG TTGCCGGTGA TCGGCATCCT GACGCTGGAC
GCCAAGGTCA ACGACCTTGC GCCGGAGGCC TATCGCGGGC TGGACCGCTT CGTCGCGCGC
AAGAAGGTGG TCGCCGACCT CGAGACCCAG GGCTTCCTGG TCGAGGTGAA GAAGCACAAG
CTGATGGTGC CGCGCTGCGC GCGCACCGGC CAGGTGGTCG AGCCGATGCT GACCGACCAG
TGGTTCGTCG CGGTCAGCAA GGCCGGCCCC GACGGCAGGA GCATCGCGCA GAAGGCGATC
GACGCGGTTG CCTCCGGCGA GGTGAAGTTC GTGCCGGAAA ACTGGGTCAA CACCTACGAC
CAGTGGATGA AGAACATCCA GGACTGGTGC ATCTCGCGCC AGCTCTGGTG GGGCCACCAG
ATCCCGGCGT GGTACGGCAG CGGCGGCGAG CTGTTCGTCG CGCGCAGCGA GGACGAGGCG
CGGACCAAGG CGCGCGCGGC CGGCTACGTC GGCGCGCTGA CGCGCGACGA GGACGTGCTC
GATACCTGGT ACTCGTCCGC GCTGGTGCCG TTCTCCTCGC TCGGCTGGCC GGCGAGGACG
AAGGAACTCG AGCTGTTCCT GCCCTCGAGC GTGCTGGTCA CCGGCTACGA AATCATCTTC
TTCTGGGTCG CCCGGATGAT CATGATGACG ACCCACTTCA CCGGCCGCGT GCCGTTCCGC
ACCGTCTACA TCCACGGCAT GGTGCGTGAC AGCGAGGGCA AGAAGATGAG CAAGTCCGAA
GGGAACGTGC TCGATCCGGT GGACCTGATC CAGGGCGTGG ACCTCGATAC GCTGGTGAAG
AAGAGCACCA CCGGCCTGCG CAAACCCGAG ACCGCGCCGA AGGTCGCCGC ACGCGTGAAG
AAGGAGTTCC CCGAGGGCAT GCCCGCCTAT GGCGCCGACG CGCTGCGCTT CACGATGGCC
AGCTACGCCA GCCTGGGCCG CAACATCAAC TTCGACACCA AGCGCTGCGA GGGCTATCGC
AACTTCTGCA ACAAGCTCTG GAACGCCACG CGCTTCGTGC TGATGAACTG CGAGGGTCAG
GACTGCGGCT TTGCGGATCA CACGGCCGAG CAGTGCGTGC CGGGCGGCTA CCTCGACTTC
TCGAACGCCG ACCGCTGGAT CACCGGCGAG CTGCAGCGCA TCGAGGCGGC GGTCGAGAAG
GCGTTCGCCG AGTTCCGCCT CGACAACGTC GCCAACGCCG TCTACAGCTT CGTGTGGGAC
GAGTACTGCG ATTGGTATCT GGAAATCGCC AAGGTGCAGA TCGCGGTCGG CGACGACGCC
GCGAAGCGCG CGACCCGCCG CACATTGATC CGCGTGCTGG AGACCGTCTT GCGCCTGCTG
CATCCGCTGA CGCCCTTCAT CACCGAGGAA CTGTGGCAGG CCGTCGCACC GATCGCGCAA
CGCAAGGTCG CCGGCAGCGA CGCGTCGATC GCGACGGCAA GCTACCCGCA GCCGCAACTG
GAACGCGTCG ACGCCCAAGC CGATGCCTGG GTGGCGAAGC TGAAGGCACT GGTCGGCGCC
TGCCGCAACC TGCGCTCGGA GATGAGCCTC TCACCGGCCG AGCGCGTGCC GCTGCTGAGC
TTCGGAGATG CCACCTTCAT CACCCAGGCC ACGCCGCTGC TGAAGGCGCT GGCCAAGCTC
GGCGACGTGC GGGTCATCGA CAGCGAGTCT GAGTTCGTGC AGGCCACCGC GGCCGCGCCG
GTGTCTGTGC ACGGCGCGAC ACGGCTGGCG CTGCACGTGG AGGTCGATGT CGAGGCCGAG
CGCGAGCGGC TGTCGAAGGA GATCGCCCGG CTCGAGGGCG AGATCGTCAA GGCCGAAGCC
AAGCTGGGCA ACGAGAGCTT CGTGGCGCGC GCTCCAGCGA CCGTGGTGGC GCAGGAGCGC
CAGCGCCTCA CAGATTTCAG CGCGACGCTG GATCGCTTGC GGGCTCAGCG TTCGCGCTTG
GGCTGA
 
Protein sequence
MTELAKSFEP GPIEAKWAPV WEQRGLFAPT LDDGKSSFAI QLPPPNVTGV LHMGHAFNQT 
IMDALTRYHR MRGDNTLWVP GTDHAGIATQ IVVERQLEQQ GQSRHDLGRQ SFVAKVWEWK
EHSGSTITQQ MRRVGASVDW RHEYFTMDEK LSPVVTDTFV QLYEQGLIYR GKRLVNWDPV
LKSAVSDLEV ESEEEDGFLW HIRYPLADGS GELVVATTRP ETMLGDTAVM VHPEDERHAG
LIGKQVTLPL CGRTIPVIAD DYVDRAFGTG VVKVTPAHDF NDYAVGQRHG LPVIGILTLD
AKVNDLAPEA YRGLDRFVAR KKVVADLETQ GFLVEVKKHK LMVPRCARTG QVVEPMLTDQ
WFVAVSKAGP DGRSIAQKAI DAVASGEVKF VPENWVNTYD QWMKNIQDWC ISRQLWWGHQ
IPAWYGSGGE LFVARSEDEA RTKARAAGYV GALTRDEDVL DTWYSSALVP FSSLGWPART
KELELFLPSS VLVTGYEIIF FWVARMIMMT THFTGRVPFR TVYIHGMVRD SEGKKMSKSE
GNVLDPVDLI QGVDLDTLVK KSTTGLRKPE TAPKVAARVK KEFPEGMPAY GADALRFTMA
SYASLGRNIN FDTKRCEGYR NFCNKLWNAT RFVLMNCEGQ DCGFADHTAE QCVPGGYLDF
SNADRWITGE LQRIEAAVEK AFAEFRLDNV ANAVYSFVWD EYCDWYLEIA KVQIAVGDDA
AKRATRRTLI RVLETVLRLL HPLTPFITEE LWQAVAPIAQ RKVAGSDASI ATASYPQPQL
ERVDAQADAW VAKLKALVGA CRNLRSEMSL SPAERVPLLS FGDATFITQA TPLLKALAKL
GDVRVIDSES EFVQATAAAP VSVHGATRLA LHVEVDVEAE RERLSKEIAR LEGEIVKAEA
KLGNESFVAR APATVVAQER QRLTDFSATL DRLRAQRSRL G