Gene Mpe_A3087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3087 
Symbol 
ID4786660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3282565 
End bp3283656 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content71% 
IMG OID640091658 
Productputative zinc protease protein 
Protein accessionYP_001022275 
Protein GI124268271 
COG category[R] General function prediction only 
COG ID[COG4324] Predicted aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.222342 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTGC GTGCACGGTC CTGGTGGTTC ACTCTTGTCG CGGTCGGTGC TGCGCTGGCG 
CTGCTGGGCG GTTGCGGCAG CGTCGCCTAC CTCTCGCAAT CGGTGCAGGG CCACCTCGGC
GTGATGCGCG CCGCGAAGCC GGTCGACGAC TGGCTGGCCG ACGCCGGCAC GCCGGCCGTG
CTGCGCGAGC GCCTGCTGCT GAGCCAGCGC ATCCGCGACT TCGCTGTCCA GGAGCTGGGC
CTGCCCGACA ACGCCAGCTA CCGCCGCTAT GCCGACCTCG GCCGGCCTGC CGTGGTGTGG
AACGTGGTCG CGGCGCCCGA GCTGTCGCTG AGGTTGAAGA CCTGGTGCTT CCCGGTGGTC
GGCTGCGTCG GCTACCGCGG TTACTTCGAC CGCGGCGCGG CCGACGCGCT GGCGGCCGAG
TTGCTGTCCC AGGGCCAGGA GGTCAGCGTC TACGGTGTGC CTGCCTATTC CACGCTCGGC
AAGCTGCCCG GTGACTTCTT TGCCGATCCG CTGCTCAATA CCTTCATCGG CTACCCCGAA
GGCGAGCTGG CACGGCTGAT CTTCCACGAG CTGGCGCACC AGGTGGCCTA TGCAAAGGAC
GACACCGAGT TCAACGAAAG CTTCGCGACC GCGGTCGAAC GCCTCGGTGG CGAGCGCTGG
CTCGCGCAAC GGGCCGATGT GTCGGCACGC GAGGAGTACG AGCGCTACGA CGCACGCCGC
CGCGACTTCC GCACGCTCGT GCTCGCCACC CGCACGCAGC TCGACGCGCT GTACCGCGGG
CCCGGCAGCG AAGCCGACAA GCGTGCCGGC AAGGCCACGT TGATGGCGCA GATGCGCGCC
GAACACGCGC GCCTCAAGGC AGGTCCGTGG GCTGGCTACG GCGGCTACGA CGCCTGGTTC
GCGCGGGCCA ACAACGCCAG CCTGGGGGTG CAGTCGGCCT ACAACGCGCT GGTGCCGGGC
TTCGAGGCAC TGTTCGCCGC CGAGGGTCGC GACTTCGCGC GTTTCTACGC CGAGGTGCGG
CGCCTCGCCA GCTTGCCGCA GGCCGAACGC CGCGCCACAC TCGGAGCCGG CCGTCAGCTG
CCGCCGCCCT GA
 
Protein sequence
MLLRARSWWF TLVAVGAALA LLGGCGSVAY LSQSVQGHLG VMRAAKPVDD WLADAGTPAV 
LRERLLLSQR IRDFAVQELG LPDNASYRRY ADLGRPAVVW NVVAAPELSL RLKTWCFPVV
GCVGYRGYFD RGAADALAAE LLSQGQEVSV YGVPAYSTLG KLPGDFFADP LLNTFIGYPE
GELARLIFHE LAHQVAYAKD DTEFNESFAT AVERLGGERW LAQRADVSAR EEYERYDARR
RDFRTLVLAT RTQLDALYRG PGSEADKRAG KATLMAQMRA EHARLKAGPW AGYGGYDAWF
ARANNASLGV QSAYNALVPG FEALFAAEGR DFARFYAEVR RLASLPQAER RATLGAGRQL
PPP