Gene Mpe_A1875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1875 
Symbol 
ID4786755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2002014 
End bp2004179 
Gene Length2166 bp 
Protein Length721 aa 
Translation table11 
GC content69% 
IMG OID640090445 
Productputative ABC transporter 
Protein accessionYP_001021068 
Protein GI124267064 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID[TIGR03375] type I secretion system ATPase, LssB family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.147039 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTAC CGCCCTCTTC CACCGCCACG ACCCGCCTGC GCGAAGACCT GATCCACCCC 
GACCCGCTGC TGGACTGTCT GCTGGAAGTC TGCCGCGTGC ACGGCGTGGC CGCATCGCGC
GCATCGCTGT CGGCGGGCCT GCCGCTGGTC GACGGCCGCC TGACGCTGGC CTTGGCCGAA
CGTGCTGCCG ACCGGGCGGC GATGACCGCG CGTCTGCAGC GCATGGCGCT CGATGCGGTC
GATACGGCCA CGCTCCCCGC CATCGCCGTG CTGCACGACG AGCGCGCCTG CGTGCTCCTT
GGCCGCGATG CCTCCGGCGC CTGGCAGGTG CTGATGCCGG AGACCGGCGC CGGTGCCGTG
ACGCTGGACG CCGACCAGCT GGCCGAGCGC TACAGCGGCG TGATGCTGTT CGTGCGGCCG
CAGTTCCGCT TCGACGAACG TGCGCCCGAG GTGCGCGCCA CCCGCGCCGG CCACTGGTTC
TGGGGCGCCG TGCTGGCGCA GCGGAACGTG TACCGCGACG TGCTGTGGGC GGCACTGCTG
ATCAACCTGT TCGCGATCGC CTTCCCGCTG TTCTCGATGA ACGTCTACGA CCGCGTGGTG
CCCAACCATG CGGAGGAGAC GCTGTGGGCG CTGGCGATCG GGGTGCTGAT CGTCATCAGC
GCCGATCTGT TCATGCGCAC GCTGCGCAGC CATTTCGTCG ACGAGGCCAG CGCCCGCATC
GACGTGCAGA TCTCGGCCAC GCTGATGGAG CGCGTGCTGG GGATGCAGCT CTCTCACAGG
CCGGCGGCAG TCGGCTCGTT CGCGTCCAAT CTGCGCGGCT TCGAGCAGGT GCGCGACTTC
ATCGCCTCGA GCACCGTCAC GGCGCTGATC GACCTGCCGT TCGCCCTGCT GTTCATCGGC
GTGATGTTCT GGCTGTCGCC CTGGCTCGCT GCGCCCGTGG TAGTCGCCTT CGTGCTGGTC
GTGGTGGTCG GCTACATCCT GCAGCACCGG CTGCACGAGC TGTCGCAGGC CACCTGGCAG
GCCGGCGCCC AGCGCAACGC GACGCTGATC GAGAGCCTGA CCGCGATCGA GACCGTCAAG
ACCCAGGGTG CCGAAAGCGT GATCCAGGCC CGCTGGGAGC AGACCAATGC CTACCTGGCC
GGCATCAACA TGCGCATGCG CGGCCTGTCG TCCACCGCGC TGTCGGCCAC GGCCTGGCTG
ACGCAGTTGG TGAGCGTGTC GCTGATCGTC ATCGGCGTCT ACCTGATCGG CGACCGCCAG
CTGACGATGG GTGCGCTGAT CGCCGCCACG ATGCTCAGCG GCCGTGCGCT GGCGCCGGCC
GGGCAGATCG TCGGCCTGCT GCTGCAGTAC CAGGGGGCGG TGACGGCGCT CGAATCGCTG
GAGAAGATCA TGGCCCAGCC CGTCGAGCGG CCGGCCGGCA ATGCCTTCAT CCACCGGCGC
GAACTGCGCG GCGAGATCGA GTTCCGGGAC GTCCATTTCG CCTACCCGGG ACGTTCCGAC
AGTGCGCTCG ACGGCGTGAG TTTCAAGATC GCCGCCGGCG AACGGGTGGC GCTGATCGGC
AAGGTCGGCT CCGGCAAGAG CACGATCGAG AAGCTGATGC TGGGCCTGTA CGCCCCCACC
GCGGGCGCCG TGCTGCTCGA CGGCATCGAC CTGCGCCAGC TCGACCCGGC CGACGTGCGA
CGCAACCTCG GCTACGTGTC GCAGGACGTG ACGCTGTTCT TCGGCAACCT GCGCGAGAAC
ATCGCCTTCG GTCTGCCCTA CGCCGATGAC GAGGCCATCG TCGCGGCGGC CGAGGTGGCC
GGACTCAGCG AGTTCGTCAA CCGCCATCCC CGCGGCTTCG ACATGCCGGT GGGCGAACGC
GGCGAATCGC TGTCGGGCGG CCAGCGCCAG AGCGTCGGAC TGGCCCGCGC AGTGCTGCAC
AACGCGCCGA TCCTGCTGCT CGACGAGCCG ACCAGCGCCA TGGATTTCGC CACCGAAGCT
CACGTCACGT CGCGCCTCGC TTCGATCGCC AGCAACAAGA CGGTGGTGCT GGTCACGCAC
CGCACCTCGC TGCTGCCGAT GGCCACGCGG CTGATCGTCG TCGACCAGGG ACGTGTTGTC
GCCGACGGGC CGCGTGAGCA GATCATGCAG GCGCTTGCCG CCGGTCGCAT CACGAGGGCG
GCCTAG
 
Protein sequence
MTLPPSSTAT TRLREDLIHP DPLLDCLLEV CRVHGVAASR ASLSAGLPLV DGRLTLALAE 
RAADRAAMTA RLQRMALDAV DTATLPAIAV LHDERACVLL GRDASGAWQV LMPETGAGAV
TLDADQLAER YSGVMLFVRP QFRFDERAPE VRATRAGHWF WGAVLAQRNV YRDVLWAALL
INLFAIAFPL FSMNVYDRVV PNHAEETLWA LAIGVLIVIS ADLFMRTLRS HFVDEASARI
DVQISATLME RVLGMQLSHR PAAVGSFASN LRGFEQVRDF IASSTVTALI DLPFALLFIG
VMFWLSPWLA APVVVAFVLV VVVGYILQHR LHELSQATWQ AGAQRNATLI ESLTAIETVK
TQGAESVIQA RWEQTNAYLA GINMRMRGLS STALSATAWL TQLVSVSLIV IGVYLIGDRQ
LTMGALIAAT MLSGRALAPA GQIVGLLLQY QGAVTALESL EKIMAQPVER PAGNAFIHRR
ELRGEIEFRD VHFAYPGRSD SALDGVSFKI AAGERVALIG KVGSGKSTIE KLMLGLYAPT
AGAVLLDGID LRQLDPADVR RNLGYVSQDV TLFFGNLREN IAFGLPYADD EAIVAAAEVA
GLSEFVNRHP RGFDMPVGER GESLSGGQRQ SVGLARAVLH NAPILLLDEP TSAMDFATEA
HVTSRLASIA SNKTVVLVTH RTSLLPMATR LIVVDQGRVV ADGPREQIMQ ALAAGRITRA
A