Gene Mpe_A2193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2193 
Symbol 
ID4784798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2348733 
End bp2350757 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content71% 
IMG OID640090761 
Producthypothetical protein 
Protein accessionYP_001021384 
Protein GI124267380 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.189847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGCA CGATCCTCCT CACCGCCTCG CTGCTCACCG GCCTGCTGAT GCTGCAGGCC 
TGCGGCACGC CTGCGGCGCG CTCGCCGGCC GCCGGCACGC TGCTGCCGAA CGAGAACCTC
GTCGTGCAGG GCATTCCGGC CATCCCGGTG TCGCTGGTGC AGCAGGTCGA GCGTTACACC
GATTTTCGCG GCCATGCCCT TGCCGACTGG CATCCGGCAC GCGACGAGAT GCTGGTCTCG
CACCGCCCGG CCGGCGCCAA CACCGCGCAG CTGTTCCGTG TGCCCGGCCC GTTGGCGGCC
CCGGTCGCGC TGACCGACTT CGCCGACCCG GTGACCGAGG CGAGCTACGA GCCAGTGCTG
GGCCGCTACC TCGTGTTCTC GCGCAGCCCC GGCGGCAACG AGGCCGAGCA GCTGTATCGG
CTCGATGCCG GCAAGCGCCA ACCCACGCTG CTGACCGATC CCGACCAGCG CCACGACGCC
ATCGGCTGGC TGCACCAGAG CGCGCAGTTG CTTTACACCG CGGTACCGCT GGACCGCACC
GCCGAGGGCG GCAGCCGCGC AACGCCCACC ACCACGCTGT GGCTGGTCGA TCCCCTGCAG
CCGTCGTCGC GTCGCCGCAT CGCCGAGCTG CCCGGCACCG GCTGGTTCGG TGGCGAGATC
TCGCGTGACG ACAAGCAGTT GGCGATCACG CGCTACCGCT CGGCGACCGA TTCGCAGGTC
TGGCTGATCG ACCTCGCCGA CGGCACGCCG ACGCAGTTGC TGCCGGCCGT CGGCGAACCC
TTGCAGGCCA CGCACTTCCC GGTCGGCTAC TCGCCGGACG GCGCCTCGCT CTACGTGCTG
AGCGACCGCG CCGGCGAGTT CCGCGAGCTG ATGCGCTACG ACTTCGCGAG CCGCGCGCTG
ACGCGGCTCA CGGCGGCGAT TCCCTGGGAC GTCGGCGCGG CCCAGCTCAG CGACGACGGC
GCGCTGGCGG CGCTGAAGAT GAACGTCGAC GGTCGCGAGG AACTGCGCGT CGTCGACACC
CGCACGCTGC GCGAACGAGC GCTCCCCGCG CTGCCGGCCG GCGGCGTGAA TGCGGTGCGC
TTCCAGCCGC GCACCCACCG GCTCGCGGTG GCGATCGACA GCGCCCAGGG GCCGGGCCGC
CTGTTGGCGC TCGACGTCGA CCGCGGCACG GTGCAGCCGT GGACACGGCC CCACGTGCCG
CCGGGCCTGG ACACCACCAG CTTCCCCGAC CAGCGCATCG TGCGCTGGAC CAGCTTCGAC
GGCCTGAGCA TGTCGGCCGT GCTCAGTCCG CCGCCGGCGC GCTTCACCGG CAAGCGGCCG
GTCGTGGTGC TGGTGCACGG CGGCCCCGAA GCGCAAGCGA CGATGGGCTT CCTGGGCCGC
TGGAGCTACC TTGTCAACGA GCTCGGCGTC GCCATCGTCG AGCCCAACGT GCGCGGTTCC
TCGGGCTATG GCAAGACCTT CCTCGCGCTC GACAACGGCA TGAAGCGCGA GGACGCCGTC
AGGGACCTCG GCACGCTGCT CGACTGGATC GCCACGCAGC CGGACCTCGA CGCGGGCCGC
GTGCTGGTGG TCGGCGGCAG CTACGGCGGC TACATGAGCC TGGCCGCGAG CGTGCACTTC
GCCGACCGCA TCGCCGGTGC GATCGACATC GTCGGCATCT CCAGCTTCGT GAGCTTCCTG
AACAACACCG AGAGCTATCG GCGCGACCTG CGCCGCGTGG AATACGGCGA CGAGCGCGAT
CCGGCGATGC GCGACTTCCT CGAACGCATC TCGCCGCTGA ACAACGCACA GAAGATCCGC
AAGCCGCTGT TCGTGATCCA GGGTCGCAAT GACCCGAGGG TGCCGTGGAC CGAGGCCGAA
CAGATCGTCG AACGCGTCCG GCAGACCGGC ACGCCGGTCT GGTACCTGCT CGCCGAGAAC
GAGGGGCACG GCTTTCGCCG CAAGGAGAAC GCCGACTACC AGTTCTACGC GATGCTGCTC
TTCATGCAGG AGACGCTGCT GAAGTCGGCC GGGCGCCAAG ACTGA
 
Protein sequence
MPRTILLTAS LLTGLLMLQA CGTPAARSPA AGTLLPNENL VVQGIPAIPV SLVQQVERYT 
DFRGHALADW HPARDEMLVS HRPAGANTAQ LFRVPGPLAA PVALTDFADP VTEASYEPVL
GRYLVFSRSP GGNEAEQLYR LDAGKRQPTL LTDPDQRHDA IGWLHQSAQL LYTAVPLDRT
AEGGSRATPT TTLWLVDPLQ PSSRRRIAEL PGTGWFGGEI SRDDKQLAIT RYRSATDSQV
WLIDLADGTP TQLLPAVGEP LQATHFPVGY SPDGASLYVL SDRAGEFREL MRYDFASRAL
TRLTAAIPWD VGAAQLSDDG ALAALKMNVD GREELRVVDT RTLRERALPA LPAGGVNAVR
FQPRTHRLAV AIDSAQGPGR LLALDVDRGT VQPWTRPHVP PGLDTTSFPD QRIVRWTSFD
GLSMSAVLSP PPARFTGKRP VVVLVHGGPE AQATMGFLGR WSYLVNELGV AIVEPNVRGS
SGYGKTFLAL DNGMKREDAV RDLGTLLDWI ATQPDLDAGR VLVVGGSYGG YMSLAASVHF
ADRIAGAIDI VGISSFVSFL NNTESYRRDL RRVEYGDERD PAMRDFLERI SPLNNAQKIR
KPLFVIQGRN DPRVPWTEAE QIVERVRQTG TPVWYLLAEN EGHGFRRKEN ADYQFYAMLL
FMQETLLKSA GRQD