Gene Mpe_A0006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0006 
Symbol 
ID4787255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp7643 
End bp8881 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content62% 
IMG OID640088553 
Productrestriction modification system, type I 
Protein accessionYP_001019203 
Protein GI124265199 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGCG ATTGCTACGT GGATGCCGGG GTTCGTGTTG TCCGCGGCAC CAACTTGACT 
GGCGGCCGAT CGTTCTCGGG TGAGTTTGTA TTCATCACGC CAGAGAAGGC TGTAGAACTC
AATTCGGCGA ACCTGTCGCC GAATGACTTG GTCTTCCCTC ATCGTGGCGC TATTGGCGAA
GTTGGCATCG TCCCGGAAGA CGGCGAGAGG TACGTTCTGT CCTCAAGCCT GATGAAGCTG
ACATGCGATG TGGCCCGTGC ACACCCGGAC TTCGTCTACT ACTTTTTCAA GTCTGCGATT
GGGCGCTTCG AACTTCTCAA GAACTCATCG CAGGTCGGCA CGCCGGGTAT TGGCCAGCCA
CTGACATCAC TCAAACAAAT CAAGCTGAGG CTGCCGCCAG TCGGCGAGCA GGTAGCGATT
GCGGCCGCTC TGCGTGCTCT CGACGACCGC ATCGCCCTCC TGCGCGACAC CAACGCCACC
CTCGAAGCGA TCGCGCAGGC GCTGTTCAAG TCGTGGTTCG TCGACTTCGA TCCCGTTCGC
GCCAAGAGCC AAGGCCTCGC CCCGGCCGGC ATGGACGAAG CCACGGCGGC CCTGTTTCCA
GAGGGGGTCG AGGAGTCTGC TTTGGGGCCA GTGCCCAGGG GGTGGCGCGC TGCAACGTTG
GCAGAAACCT TCGAGATCAA TCCCTCGCGC AGCCTTCCGA AGGATTCAGA GGCGAAGTAC
CTCGAGATGG CCGGTGTGCC GACCACGGGC CATTGCGCCG AGTCGATCGC GGTGCGTGCC
TTCGGGTCCG GCACCAAGTT TCGGAACGGC GACACGCTGC TGGCGCGCAT CACGCCCTGC
CTCGAAAACG GCAAGACGGC GTTCGTCGAT TTCCTCGTGG AAGATGAGAT CGGCTGGGGA
TCGACAGAGT TCATCGTGCT GCGGCCTAAG GCGCCGCTGC CCGATTACTT CGCCTATCTG
CTGTGCAGAC ACGCACCGTT TCGCGAGTTT GCCGAGCGCA GCATGTCAGG GACGAGTGGA
CGTCAGCGGG TGCAGAACGA TGTGCTCGCG ACCTATCGGA TTGCCGTGCC GCCATCAGCA
GTTGCAGAAG CTTTCGGCGC GCTGATCAAT CCACTGCGGC ACGCGATCAC GAGCAACCAT
GCGAGGGGAG CAACCCTTGG CGCGCTGCGT GATGCGCTGT TGCCTCGTCT GATCTCCGGC
CAACTCCGCC TGCCTGACGC TGTGGCGCTG GCCGCCTGA
 
Protein sequence
MKSDCYVDAG VRVVRGTNLT GGRSFSGEFV FITPEKAVEL NSANLSPNDL VFPHRGAIGE 
VGIVPEDGER YVLSSSLMKL TCDVARAHPD FVYYFFKSAI GRFELLKNSS QVGTPGIGQP
LTSLKQIKLR LPPVGEQVAI AAALRALDDR IALLRDTNAT LEAIAQALFK SWFVDFDPVR
AKSQGLAPAG MDEATAALFP EGVEESALGP VPRGWRAATL AETFEINPSR SLPKDSEAKY
LEMAGVPTTG HCAESIAVRA FGSGTKFRNG DTLLARITPC LENGKTAFVD FLVEDEIGWG
STEFIVLRPK APLPDYFAYL LCRHAPFREF AERSMSGTSG RQRVQNDVLA TYRIAVPPSA
VAEAFGALIN PLRHAITSNH ARGATLGALR DALLPRLISG QLRLPDAVAL AA