Gene Mpe_B0052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_B0052 
Symbol 
ID4787655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008826 
Strand
Start bp42320 
End bp43669 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content67% 
IMG OID640092461 
ProductDNA-directed DNA polymerase 
Protein accessionYP_001023066 
Protein GI124262596 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00154189 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCATAC AGTACGGTGT CTGGCGGATG TACGCCCTCG TCGACGGCAA CAACTTCTAC 
TGCTCGGTCG AGCGCGTTTT CCGGCCGTCG CTGACGGGCC GGCCCGTGGT GGTCATGGGC
TCGAACGACG GCTGTGTCAT TGCCCGGTCC AACGAGGCGA AGGCGCTGGG CGTGAAGATG
GGGGAGCCCT GGTTTCAATG TCGACACCTC GAGAAGGACC AGGGCCTCGT GGCGCTGTCG
GCCAACTTCA CCCTCTACGG CGACATGTCC GAGAGGATGA TGACCGTGGC CGGTTCCTTC
GCGCCACGCC AAGAGGTTTA CAGCATCGAC GAGTGCTTCC TGGACTTCGC CGGCATGCCG
GGCGGACCAG GTTCCCTCGT CGAAACTGGC CGCGCGCTGC GTGAGAAGGT TCTTCGGTGG
ACCGGCATTC CCACCTGCGT CGGCTTCGGC CCGACCAAGA CGCTCGCGAA GCTGGCGAAC
CACATCGCCA AGTCGGCCGA GCGCAAGCCG GGGTCCTACC CTGAGCAGCT TGCTCAGGTC
TGCAATCTGG GCGAGCTGGC AGATGAAGTC CGGCGGGAGC TCCTGGTCAA GACGGAGGTC
AAGGAAGTCT GGGGCGTAGG GCCGCGCATC GGCGCCCAGC TCAACGCGGC TGGAGTCTCG
ACCGTGCTCG ACCTAGTCCG GCTGGACCCC GCGGCTGTCC GGCGACGCTT CTCGGTGGTG
CTCGAGAAGA CCGTGCTCGA ACTGCGCGGC ATCAGCTGCC TGACGCTCGA AGACGCGCCG
GCGAGCCGGC AGCAGATCAT GTGTTCGCGC TCCTTCGGGC GGCCGGTGCT CGAGATGGAG
GGACTGGTCG AGGCGTTGAG CGACTTCGCG GCGCGCGCGG CCGAGAAGCT GCGCAAGCAG
GAGAACCTCG CCGGCGCCGT GCACGTCTTC ATCACCACGA GCCCCTTCCG CAAGGAGGAC
CGGCAGTACA GCCGTTCGGT CACCGTGCCG CTGGTCCGAC CGACAGGAGA CACGCGGGTG
CTGGTCAGCG CCGCCATCCT CGCTCTGCGG GCGGTGTTCA AGCCTGGCTA CCGCTACGCG
AAAGCGGGGG TCATGCTCAT GGAACTGCAG CCCGAGTCGG TGCACCAGGC CGAGCTCGAC
CTGGGAGAGC CCGAGCCCGG CACTGTCCCG CGCGACCGCA GCAAGCTGAT GTCCGCGGTC
GACGCTGTGA ACCGGCGCCA CGGCCGCGGC TCACTAATGG TGGCGAGCGC CGGTCTCGCA
ACTGCGCGAC GCGAGTTCGT GCCGAAACAG GAGCGGCGAA CGCCCCACTA CACAACGTCT
TGGGACGACA TGCCGGTGGC GCGCGCCTGA
 
Protein sequence
MSIQYGVWRM YALVDGNNFY CSVERVFRPS LTGRPVVVMG SNDGCVIARS NEAKALGVKM 
GEPWFQCRHL EKDQGLVALS ANFTLYGDMS ERMMTVAGSF APRQEVYSID ECFLDFAGMP
GGPGSLVETG RALREKVLRW TGIPTCVGFG PTKTLAKLAN HIAKSAERKP GSYPEQLAQV
CNLGELADEV RRELLVKTEV KEVWGVGPRI GAQLNAAGVS TVLDLVRLDP AAVRRRFSVV
LEKTVLELRG ISCLTLEDAP ASRQQIMCSR SFGRPVLEME GLVEALSDFA ARAAEKLRKQ
ENLAGAVHVF ITTSPFRKED RQYSRSVTVP LVRPTGDTRV LVSAAILALR AVFKPGYRYA
KAGVMLMELQ PESVHQAELD LGEPEPGTVP RDRSKLMSAV DAVNRRHGRG SLMVASAGLA
TARREFVPKQ ERRTPHYTTS WDDMPVARA