Gene Mpe_A1533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1533 
Symbol 
ID4783551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1653393 
End bp1654793 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content75% 
IMG OID640090100 
Producthypothetical protein 
Protein accessionYP_001020730 
Protein GI124266726 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0110393 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.601822 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGAGC AGCACATGCT GCTCGGCGCC AACCGCGCGG CGCACGCCCT CGGGGTGCGA 
GCCGGGCAGA GCGTGGCCAC GGCCTTGTCC CTGCTGCCGC AGCTCGTCGT GTTCCCGCGT
GACCGGCCGC GCGAGGCCGC GCTGGTCGAG CGACTGGCCC TGGCGCTCGC TGCGCTCACG
CCCCACCTGA GCCTCATTCC GGACGGGGTG CTGCTCGAAG TGCAGAGCAC GCTGCGGCTG
TTCGGCGGCA TCCATGCGCT GCTGCACCGG GCACAAGCCC TGGCGCGCAC GAGCGGGGTG
CAGGTGCGGA CGGGCTGTGC GCCCACCCCC GGCGCCGCCT GGCTGTTCGC CACCAGCGGC
CTGGCCCGGC GACATGCACT GCAGGCCCGG AGCAGTGCGC AGCAGCTGGA TCGGCTGCCG
GTAGCGAGCC TGGCGCGTCT GCTGCCGCCC AGCCTCCACC AGAGCGAGTT GCTGCAGGCG
CTGGGGATCC GGACCTTCAG GGCGCTGCGC GCGCTGCCAC GTGCCGGTCT GCAGCGCCGG
CTCGGCGTGG ACCTGGGCCG GGCGCTCGAC CGCTGCTATG GCGATGCCCC AGACCCGAGG
CCGTGGTTCG TGCCGCCCGA GCACTTCCTG GCTCGACGCG AACTGCTGCA GCGCGCCGAC
GAGGCTGCGG TGCTGATGGC CGCCGTCGAG GCTCTGCTGC CGGCACTGCG CGGTTGGCTG
CAGCTGCGCT GGCAGGCCGT CACGGTGCTG GCACTGAGAC TGCGCCATGA GCACGGCCGC
GAGCCCTGCC CCGACACCCG GCTGCGACTG CAGCTGTCGG CCCCGAGCCG CGACATCGCC
CAGCTCGCGC TGCTGTGGCG CGAGCGCCTG CAGCGCCATG TGCTCGCCGC GCCGGTCTAC
GAGCTCGCGC TCGAACTGGA AGCCGCCGTG CCGCACGGCG GCACGCCTGG CGAGCTGCTG
CCCGGACCAG GCCGGCAGGA CGGCGAACAC GCCGCGCTGC TTGACCGACT GACCGCGCGG
CTGGGGAGTG ATCACGTGCG GCGCTGGGTG CCGCGGGCCG ATCACCGACC CGAGCACGCG
CAGCGTGTCC GGTCGGTCGG CGAACCGCCC CCAGCGCCCT TTGTGACCAC GGCGCCCGCT
CCCACCGCCC CCCGACCACT CTGGTTGCTG CCGTTGCCGC TGCCGCTGGC CAGCGATGCC
CTGGGCCGGC CGCGGCATGG TGGGCCCCTG CGGGTGTGCT CGCGGGCCGA GCGCATCGAG
GCCGGCTGGT TCGACGGCGC GCTGGTGCGG CGCGATTACC ACGTGGCCGA GGGCCCCGAC
CACCGGCTGC GCTGGATCTA CCGCGAGCAC CCCGGCGACG CCTCGTCGGG GGGCTGGTTC
CTGCACGGCT GGTTCGGGTG A
 
Protein sequence
MTEQHMLLGA NRAAHALGVR AGQSVATALS LLPQLVVFPR DRPREAALVE RLALALAALT 
PHLSLIPDGV LLEVQSTLRL FGGIHALLHR AQALARTSGV QVRTGCAPTP GAAWLFATSG
LARRHALQAR SSAQQLDRLP VASLARLLPP SLHQSELLQA LGIRTFRALR ALPRAGLQRR
LGVDLGRALD RCYGDAPDPR PWFVPPEHFL ARRELLQRAD EAAVLMAAVE ALLPALRGWL
QLRWQAVTVL ALRLRHEHGR EPCPDTRLRL QLSAPSRDIA QLALLWRERL QRHVLAAPVY
ELALELEAAV PHGGTPGELL PGPGRQDGEH AALLDRLTAR LGSDHVRRWV PRADHRPEHA
QRVRSVGEPP PAPFVTTAPA PTAPRPLWLL PLPLPLASDA LGRPRHGGPL RVCSRAERIE
AGWFDGALVR RDYHVAEGPD HRLRWIYREH PGDASSGGWF LHGWFG