Gene Mpe_A1942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1942 
Symbol 
ID4786703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2077910 
End bp2079364 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content70% 
IMG OID640090512 
Productleucyl aminopeptidase 
Protein accessionYP_001021135 
Protein GI124267131 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.189087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0962979 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTTC GCATTCTCGC CACGTCGACC TCCAGCCTTT CGAAGGTCAG CGCCGATGCG 
CTCTTGATCG TGGTGGCCGG CGATCCGAAG CAGCAAGGCC TGGACGATCC CCTGGATCGC
GTGCTGTCCG ATGCGATCGC GGCCGGCGAC CTGGCCTTCA AGGCCGGTCG CACGCTCTAT
CTGCACCGTC CGCTGGGCGT GAAGGCGGCG CGCGTGGTGT TCGCCGTGGC CGGCGATGCC
GGGCCCAAGG CCTTCAAGTC CGCCGTGGCC GCGGGCTTGG GCCTGATCAA GCCGCTGGGT
GTGAAACACC TGGCAGTGGC CGCAAGCGGC GCCATCGAGG ACTCCCATGC CGAGGCCGCC
GCGGCGGCAG CGTCCGACGC CACCTACAGC TACCGCCACA CCAAGCCGAG CGCGAGCCCC
GCGCCGGCGC TGCAGAAGAT CACCCTGCTG GTCGACAAGG CGGAAGCCAA GGCGGCGCAG
ACCGGACTGT CGCGCGGCGC GGCGATCGCC CTTGGCGTGG CGCTGGCGCG CGAGTGCGCC
AACCGGCCCG GCAACCACTG CACGCCGAGC TTCCTGGCGG CCGAGGCGCG CAAGCTCGCC
AAGCTGCCGC GGATCAAGGT CGACGTGCTC GACCGCAAGG CCTGCGAGAA GCTCGGCATG
GGCTCCTTCC TCGCCGTGGC GCAGGGCTCC GACGAGCCCC CGAAGTTCAT CGTGCTGCGC
TATGACGGCG CGTCGCGGAG CGATGCACCG GTGGTGCTGG TGGGCAAAGG CATCACCTTC
GACACCGGCG GCATTTCCAT CAAGCCGGCG GCCGAAATGG ACGAGATGAA GTACGACATG
GGTGGCGCCG CCAGCGTGCT CGGCAGCTTC CGCGCTGTCG CGGAACTTCA GCCGCAGGTC
AATGTGGTGG GCCTCATCCC GAGCTGCGAG AACATGCCCA GCGGGCGCGC CATCAAGCCG
GGCGACGTGG TGACCTCGAT GTCGGGCCAG ACGATCGAGG TGCTCAACAC CGACGCCGAG
GGGCGCCTGA TCCTGTGTGA CGCGCTGACC TACGCCGAGC GCTTCAAGCC GGCCGTGGTG
ATCGACATCG CCACGCTGAC CGGCGCCTGC GTCATTGCGC TCGGTCATCA CCGCAGCGGC
CTGTTCAGCG CCGACGATGC GCTCGCCGAT GCGCTGCTCG ACGCCGGCAG CGCCGGCCTC
GACCCGGCTT GGCGCATGCC GCTCGACGAC GAGTACGAGG AAGCGCTGCG CAGCAACTTC
GCCGACATGG GCAATGTCGG TGGCCGCGCC GGCGGGGCCA TCACCGCGGC GATGTTCCTC
AAGAAGTTCA CAGCCAAGTA CCGCTGGGCG CACCTCGACA TCGCCGGCAC GGCCTGGAAA
TCCGGCGCCG CCAAGGGAGC GACCGGCCGG CCGGTGCCGC TGCTCACGCA CTTCGTGCTG
TCGCGCACCC GTTGA
 
Protein sequence
MDFRILATST SSLSKVSADA LLIVVAGDPK QQGLDDPLDR VLSDAIAAGD LAFKAGRTLY 
LHRPLGVKAA RVVFAVAGDA GPKAFKSAVA AGLGLIKPLG VKHLAVAASG AIEDSHAEAA
AAAASDATYS YRHTKPSASP APALQKITLL VDKAEAKAAQ TGLSRGAAIA LGVALARECA
NRPGNHCTPS FLAAEARKLA KLPRIKVDVL DRKACEKLGM GSFLAVAQGS DEPPKFIVLR
YDGASRSDAP VVLVGKGITF DTGGISIKPA AEMDEMKYDM GGAASVLGSF RAVAELQPQV
NVVGLIPSCE NMPSGRAIKP GDVVTSMSGQ TIEVLNTDAE GRLILCDALT YAERFKPAVV
IDIATLTGAC VIALGHHRSG LFSADDALAD ALLDAGSAGL DPAWRMPLDD EYEEALRSNF
ADMGNVGGRA GGAITAAMFL KKFTAKYRWA HLDIAGTAWK SGAAKGATGR PVPLLTHFVL
SRTR