Gene Mpe_A2068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2068 
Symbol 
ID4784643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2212500 
End bp2214872 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content71% 
IMG OID640090636 
Producthypothetical protein 
Protein accessionYP_001021259 
Protein GI124267255 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.847092 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCCC TCGACCGCAA GCTGCTGCGC GACCTGCGCC TGATGTGGAG CCAGGCGCTG 
ACGATCGCGC TGGTGGTCGC TGCCGGCATC GCCGGCTTCA TCGCCAGTCT GTCGGCAGTG
GACTCGCTGG CCGCCGCGCG CGACGACTAC TATGCCGCCG GCCGCTTCGC CGACGTGTTC
GCGAACCTGA AGCGCGCGCC CGACTCGTTG CAGGTCACGC TGCGCGAGCA GCCCGGCGTG
GCCGACCTGG AAGCCGGCGT CGAGACGATG GCGCGCATCA CGTTGCCCGG CGTGCCAGAT
CCCATCATCG GTCAACTCGT CGGCCTGGAC CTGGAACGCC CACAACGGCT CAACGTGGTG
AGCGTCGCCG CCGGCGCGGC GCTGCAGCCC TTCGACGCCG ACGGGGGCGC ACGCAGCATC
CCGGTGCTGG TGTCGCGCGG CTTTGCGCAG GCGCGCGGGC TGAAACCCGG CGCCACGCTG
CAGGCGCAGA TCAACGGCCG CTTGCGCACC CTGCGCGTGA GCGGGCTGGC GCTGTCGCCG
GAGTTCGTGT TTGCGGGCCT CTGGGGCATG CCGGACCAGC GCGGATACGG CATCTTCTGG
GTCGACCGCC GCGTCCTCGC CGCCGCCATC GACATGGACG GCGCGTTCAA CCGCCTCGCG
GTGCGGCTGG CGCCGGGCGC CGACACCACC GCGACGATGG CTGCGCTGGC GCGCCGCCTG
GCGCCCTACG GCGGCCGCGA TGTGCACGGC CGCGCCGACC AGGCTTCGCA CCAGATGCTG
GACAACGAGA TCAAGGCGCA GCACGTCATC GGCACCGTGG TGCCGGCGAT CTTCCTCGGC
GTCGCGGCCT TCCTGCTGCA CGTGGTGATC TCGCGCCTGG TCGGCACACA GCGCGAACAG
ATCGCGGCGT TGAAAGCGCT GGGCTATACC GACCGCGCGA TCGGCATGCA CTACCTGAAG
CTGGTTCTGG CGATCGTGAT CGTCGGTCTG GGCCTGGGTC TGCTGCTGGG CCGCTGGATG
GGAACGATGC TCACGGGACT CTACGCCGAG CTGTTCCAGT TCCCGCGCTT CGAACACCGC
CTCGCGCCCT GGCTGGCGGC GACCAGCGCG GCGGTGGCGC TGCTCACCGC GGTGCTGGGC
ACGCTGAGCG CGGTGCTGGC CACGGTGCGC TTGCCGCCGG CCGAGGCGAT GCGCCCGCCG
GCGCCCGACC GCTACCGCCG CACCTTGCTG GAGCGGCTGG GCGTGCAGCA CATGGCGCCG
GCGCTGCGCA TGGTCATCCG CAACATGGAG CGGCGGCCCC TGCGCACCGC CTCCAGCATC
ACCGGCATCG CCGCCGCGGT GGCCATCACG ATCATGGGCA ACTTCTTCCG CGACGCGATC
GACGTCATCG TGCACACACA GTTCGAGCTC GGCCTGCGCG GCGACGTCAC GGTCTGGGCT
GTCGAGCCGG TCGATGACGC GGCGCGCCTC GAGCTGATGC GGCTGCCTGG CGTGCGCCAG
GTCGAATCGA CGCGCTTCGT GCCAGTGACG CTGGTGCACG GCCACCGCCA CGAACGCGGC
CTGATCCGCG GGTATGCCGC CCGGCCCGCG CTGTACCGCG TGGTCGATCT CGACGGGGCG
GTCATACCGC TCGCCGGCGA TGGCGTCGTG CTCTCCGACC GGCTGGCCGA CAAGCTCGGG
CTGCGGGTCG GCGACACCGT GCAGGCCGAA CTGCTGACCG GCGAGCCGCG CACGCTGGCG
CTGCGCGTCG ATGCCACGGT GCGCGAGATG ATGGGCCTCA ACGCCTACAT GGAGCGCGGT
GCTCTCAACC GCGCGCTCGG CGACAGCGAC GTCTCCACCG GCTGGGTGCT AGGCGTCGAG
CCGGGCCGCG AGGCCGCGCT GCTGGAGGCC AGCAAAGCGC TGCCCCGCGC AGCCGGTGCC
TTCAGCAAGG CGACGATGCT GCGCAACATG CAGGAGATCA GCGCGCGCAA CGTGCGCATC
ACGAGCACGG TACTGACGCT GTTCGCCGCG ATGATCTCGA TCGGCGTGGT CTACAACAAC
GCGCGCATCG CGCTCGCCGA GCGCGGCTGG GAGCTGGCCT CGCTGCGCGT GCTCGGCTTC
ACGCGCGCCG AGGTCTCGGC GCTGCTGCTG GGCGAGTTGG CGCTGGCGAT TGCGCTTGCG
CTGCCGCTCG GCATGGCGCT CGGCTGGGCA CTGGTGCACG GCGTCAACGA GCTGCTTCGC
TCGGACCAGT TCCTGTTCCC CGCCACGATC CGCCCGCGCA CGTACGCCTG GGCCGCGTTG
TGTGTCGCGG TTGCCGGAGT GGGCAGCGCG CTCGTGGTGC GCCGGCGCAT CGACCGGCTC
GACCTGGTGG CGGTGCTGAA GACGAGGGAG TGA
 
Protein sequence
MKALDRKLLR DLRLMWSQAL TIALVVAAGI AGFIASLSAV DSLAAARDDY YAAGRFADVF 
ANLKRAPDSL QVTLREQPGV ADLEAGVETM ARITLPGVPD PIIGQLVGLD LERPQRLNVV
SVAAGAALQP FDADGGARSI PVLVSRGFAQ ARGLKPGATL QAQINGRLRT LRVSGLALSP
EFVFAGLWGM PDQRGYGIFW VDRRVLAAAI DMDGAFNRLA VRLAPGADTT ATMAALARRL
APYGGRDVHG RADQASHQML DNEIKAQHVI GTVVPAIFLG VAAFLLHVVI SRLVGTQREQ
IAALKALGYT DRAIGMHYLK LVLAIVIVGL GLGLLLGRWM GTMLTGLYAE LFQFPRFEHR
LAPWLAATSA AVALLTAVLG TLSAVLATVR LPPAEAMRPP APDRYRRTLL ERLGVQHMAP
ALRMVIRNME RRPLRTASSI TGIAAAVAIT IMGNFFRDAI DVIVHTQFEL GLRGDVTVWA
VEPVDDAARL ELMRLPGVRQ VESTRFVPVT LVHGHRHERG LIRGYAARPA LYRVVDLDGA
VIPLAGDGVV LSDRLADKLG LRVGDTVQAE LLTGEPRTLA LRVDATVREM MGLNAYMERG
ALNRALGDSD VSTGWVLGVE PGREAALLEA SKALPRAAGA FSKATMLRNM QEISARNVRI
TSTVLTLFAA MISIGVVYNN ARIALAERGW ELASLRVLGF TRAEVSALLL GELALAIALA
LPLGMALGWA LVHGVNELLR SDQFLFPATI RPRTYAWAAL CVAVAGVGSA LVVRRRIDRL
DLVAVLKTRE