Gene Mpe_A0281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0281 
Symbol 
ID4786890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp306357 
End bp307454 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content69% 
IMG OID640088833 
Productputative SMF protein 
Protein accessionYP_001019478 
Protein GI124265474 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0175905 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGC GTGACGATCT CGCAGCGTGG CTGCGCTTGC TCGAAACTCC GTCGATCGGG 
AGGGATACCG CGCGCCGCCT GCTGGCGGCT TTCGGTTCCC CTCAGGCCGT CTTTGACGCT
CCGCCCGTCG CCCTGAGAGA GCTGCTGACG GCGGAAAAGG CCGCGGCGTT GCGGTCGCCG
CCCCCCTCGC TCGACGTACT GATCGAAGCG ACTTGGCAGT GGCTTGACGC AGGGGAGGAG
CGCCACGTGG TGGCGCTCGG CGACCCGGCC TACCCCCGCG CTTTGCTCGA GACCGCCGAC
CCACCCCTGC TGATCCATGC CGTGGGCCGC TTGGCGTTGC TGAATGCACC GAGCGTGGCC
GTGGTTGGCA GCCGCAATCC GACGCCGCAG GGCGCCGAGA ACGCCCGCGC CTTCGCCACC
GCCCTGAGCC ACGCCGGCCT GACCGTTGTA TCCGGGCTGG CGCTCGGCAT CGATGGCGCC
GCCCACGACG GTGCACTGGC AGGCGAGGGG TCGACGATCG CGGTCGTCGG CACCGGGCTC
GATCGCGTCT ATCCGAAGCG GCATCTGAAG TTGGCTCACC GGATTGCCCG CGATGGCCTG
ATGGTGAGCG AGTACGCGCC GGGAACGCCC CCGATTGCCG CACACTTCCC ACTGCGCAAC
CGCCTGATCG CCGGCCTGAC CCGAGGTACG CTGGTGGTCG AGGCCGCGTT GCAGTCGGGC
TCGCTGATCA CCGCACGGCT GGCGCTCGAA GCCGGTCGGG AGGTCTTCGC GATTCCGGGC
TCGATTCACG CGCCCCAGTC ACGCGGCTGT CACGCGCTGA TCAAGCAGGG CGCCAAACTG
GTCGACAGCG CGGCGGACAT CCTTGAGGAA CTGCGGTGGT TCGACGCGCC AGACAGGCCC
TCGCCCACCA CGTCAAGCCC GTCCGTAGAA GACCCGGTAC TGGCCGCCCT CGGCCACGAT
CCCGTCACTC TTGACGCACT GAGCGCCCGG ATCGGGTGGC CCCCAGCCGA ATTGAGCGCT
CGCTTACTGG CACTTGAACT GAGTGGCGAC GTGGTTCGTT TGCCAGGCCA GTTGTTCCAG
CGCCTCGTAC AGGCCTGA
 
Protein sequence
MIERDDLAAW LRLLETPSIG RDTARRLLAA FGSPQAVFDA PPVALRELLT AEKAAALRSP 
PPSLDVLIEA TWQWLDAGEE RHVVALGDPA YPRALLETAD PPLLIHAVGR LALLNAPSVA
VVGSRNPTPQ GAENARAFAT ALSHAGLTVV SGLALGIDGA AHDGALAGEG STIAVVGTGL
DRVYPKRHLK LAHRIARDGL MVSEYAPGTP PIAAHFPLRN RLIAGLTRGT LVVEAALQSG
SLITARLALE AGREVFAIPG SIHAPQSRGC HALIKQGAKL VDSAADILEE LRWFDAPDRP
SPTTSSPSVE DPVLAALGHD PVTLDALSAR IGWPPAELSA RLLALELSGD VVRLPGQLFQ
RLVQA