Gene Mpe_A1428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1428 
Symbol 
ID4783710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1536279 
End bp1537748 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content71% 
IMG OID640089994 
ProductTldD family protein 
Protein accessionYP_001020625 
Protein GI124266621 
COG category[R] General function prediction only 
COG ID[COG0312] Predicted Zn-dependent proteases and their inactivated homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACA CGCTCGCCGC CACCGTTCGC CGCGCCACCG CCGCCCTGCC CGATTCGGTC 
ACGCACTGGA CCGCACGCGG CGTCAGCGAG CGATCCGAAC AGCTCGGCGT GCGCCAGGGC
GTTGCCGAAA CGCCGAAGCG AGTGCGTGAC GAGGGTGTGA TGCTCACCGT CGTCGACGGT
GGCGTGGGCC ATGCGGCCAC CGCCGACACG TCCGGAGCCG GCTTGCGTAC CGCCCTGCTG
CGCGCCCACG CGCTGGCCCG GGCCGGCGCC GGCAAGACGG TGTTCACGCC CGCGGAGCTG
CCGCGCCCGC GCGACAGCGG CCAATACGCC AGCCGCGTCG AGCGACCGCA CCACACCCTG
ACGCTGGGCG ACAAGCTCGA GCTGCTGATG CGCGTGAACG CGGCCTGCCG GATCGACGAG
CGCATCGTCG ACGCGCAGAC CAGCCTGTGG AGCGTCGAGA CCGATCAACT TTTCCTGAGC
AGCGACGGCG CCCACATCGA GCAGCGCTTC GCCTACCTCA CGCCGGCCAT CCAGGTGACG
GCGGTCGACG GCGGCGTGAC GCAGGTGCGG TCCAGCGCCG GACAGTACAA CGGCTACTGC
CAGCAGGGCG GGCTGGAAGT GCTGGACCGC GCCCGCTTCG AGCAGGACGG CCCGCGCGTC
GCGCGCGAGG CGCTGGAGCT CGTGGCCGCA CCCCACTGCC CGAGCGGCCG CATGGACCTG
CTGCTGATGC CGGACCAGAT GATGCTGCAG ATCCACGAGA GCATCGGCCA TCCGCTGGAG
CTGGACCGCA TCCTCGGCGA CGAGCGCAAC TTCGCCGGCA CGAGCTTCGT CACGCTCGAC
CTGTTCGGCC ACTACCGCTA CGGCAGCGAG CTGCTGAACG TGAGCTTCGA TCCGTCACGT
GCGCACGAAT TCGCCGGCTT CGCTTTCGAC GACGACGGCA CGCCGGCCGA GCGCCGCATG
CTGATCGAGC GCGGCGTGCT GCTGCACCCG CTGGGCGGCA GCCTGTCGCA GGCGCGCGCC
CGGGCCGCCG GCCACGACGT GGGCGGCGTG GCCACCACGC GGGCCTGCAG CTGGAACCGC
GCGCCGATCG ACCGCATGAG CAATCTCAAC GTCGAACCCG GGCACAGCAC GCTCGACGAA
CTGATCGCCG CGGTCGACCA CGGCGTGGTG ATGCACACCA ACTGCTCGTG GAGCATCGAC
GACTCGCGCA ACAAGTTCCA GTTCGGCTGC GAGTACGGCC GCATGATCCG CCACGGCCGG
CTGGCCGAGG TGGTGCGCAA CCCGAACTAC CGCGGCGTGA GCGCGACCTT CTGGCGCTCG
CTGGCCGGGG TGGGCGATGC GTCGACCTGC GAGGTGATGG GCACGCCCTT CTGCGGCAAG
GGCGAGCCCT CGCAGGTGAT CCGGGTCGGG CACGCCTCTC CGGCCTGCCT GTTCACCGGC
GTCGACGTGT TCGGCGGGGT CGACGGATGA
 
Protein sequence
MLDTLAATVR RATAALPDSV THWTARGVSE RSEQLGVRQG VAETPKRVRD EGVMLTVVDG 
GVGHAATADT SGAGLRTALL RAHALARAGA GKTVFTPAEL PRPRDSGQYA SRVERPHHTL
TLGDKLELLM RVNAACRIDE RIVDAQTSLW SVETDQLFLS SDGAHIEQRF AYLTPAIQVT
AVDGGVTQVR SSAGQYNGYC QQGGLEVLDR ARFEQDGPRV AREALELVAA PHCPSGRMDL
LLMPDQMMLQ IHESIGHPLE LDRILGDERN FAGTSFVTLD LFGHYRYGSE LLNVSFDPSR
AHEFAGFAFD DDGTPAERRM LIERGVLLHP LGGSLSQARA RAAGHDVGGV ATTRACSWNR
APIDRMSNLN VEPGHSTLDE LIAAVDHGVV MHTNCSWSID DSRNKFQFGC EYGRMIRHGR
LAEVVRNPNY RGVSATFWRS LAGVGDASTC EVMGTPFCGK GEPSQVIRVG HASPACLFTG
VDVFGGVDG