Gene Mpe_A1192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1192 
Symbol 
ID4787038 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1289040 
End bp1290605 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content65% 
IMG OID640089757 
Productputative recombinase 
Protein accessionYP_001020390 
Protein GI124266386 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCGG ACGTCCCACG AACCGAAGGG CAGGAGGTGC CCATCCGCAT GCGCGCGGCC 
GAGTACGTGC GCATGTCGAC CGAGCACCAG CAGTACTCCA CCGAGAACCA GGGCGACAAG
ATCCGCGAGT ACGCCGCGCG GCGCGGCATC GAGATCGTTG CGACCTACGC AGACGAGGGT
AAGAGCGGGC TGCGCATCGA CGGACGCCAA GCGCTGCAGC GCCTGATTCA TGACGTGGAG
AACAAGCGCG CCGACTTCCA GATCATCCTC GTCTACGACG TCAGCCGCTG GGGACGCTTC
CAGGACGCCG ACGAGAGCGC GTACTACGAG TACATCTGCC GGCGTGCTGG TATCCAAGTC
GCCTACTGCG CCGAACAGTT CGAGAACGAC GGCTCGCCGG TGTCGACGAT CGTCAAGGGT
GTCAAGCGGG CGATGGCTGG CGAGTACAGC CGCGAGCTCT CGGCAAAAGT GTTTGCTGGC
CAGTGCCGAC TGATCGAGCT GGGTTTTCGT CAGGGCGGGC CAGCCGGGTA CGGCCTGCGC
CGCACGCTGG TTGATGTACA GGGGGCGGCT AAGACGGAGC TCTCCCGCGG GGAGCAGAAG
AGCCTGCAGA CCGACCGCGT CATCCTGACG CCCGGCCCCG AAGAAGAGGT TCGGATCGTC
AACCAGATCT ACCGCTGGTT CATTGACGAC GGTCTGGTTG AATCGGAGAT CGCTGGCCGG
CTCAACGGCA TGCGCTTGCG CACGGACCTC GGCCGCGAGT GGACCCGGGC AACGGTGCAC
GAGGTGCTGA CCAACGAGAA GTACGTCGGC CACAACATCT ACAACCGGGT CTCGTTCAAG
CTGAAGAAGG TGCGGGTGGT GAATCCGCCC GACATGTGGA TCCGGCGGGA GGGCGCGTTC
ACCGGCATCG TGCCGCCCGA CGTGTTCTAC ACCGCCCAGG GGATCATCCG GGCACGGGCG
CGTCGCTACA CCGACGAGGA GCTGATCGAA CGGTTGCGCG GTCTGTATCG CAACCGCGGC
TTCCTGTCCG GACTGGTGAT CAACGAGGCC GAAGGCATGC CCTGCGCCGC TGTCTATGCG
CACCGCTTCG GCAGCCTCGT CCGTGCCTAT CAGCTGGTGG GCTATACACC CGACCGCGAC
TACCGATACG TGGAGATCAA CCGCCAGCTG CGCCGCATGC ACCCTGACCT GGTGCAGCAG
ATCGAGCGCG AGATCGCCAA CCTCGGCGGC TCCTTCGAGC GAGACCCGGC CACCGACGTG
CTGTGGGTGA ACCGCGAGTT CACGGTGTCC ATCGTCCTTG CACGGTGCCA CGCCGTCGAC
AGTGGCCACA ACCGTTGGAA GATTCGACTC GACACCAGCC TCGCGCCGGA CATCTCCGTG
GCGGCCCGGC TCGATGACGG CAACCAGGCA GTGCGGGACT TCTACCTCCT CCCGCGGGTT
GACTTCGGCC CGTCGCGCAT CAGCCTTGCC GAGCAGAACA CCGCTGAGCT CGAGAGTTAT
CGCTTCGAGA CGCTCGACTA CCTCTATGGA ATGGCCGCCC GGGCGCGGCT GCGGGTGGCG
GCGTGA
 
Protein sequence
MQSDVPRTEG QEVPIRMRAA EYVRMSTEHQ QYSTENQGDK IREYAARRGI EIVATYADEG 
KSGLRIDGRQ ALQRLIHDVE NKRADFQIIL VYDVSRWGRF QDADESAYYE YICRRAGIQV
AYCAEQFEND GSPVSTIVKG VKRAMAGEYS RELSAKVFAG QCRLIELGFR QGGPAGYGLR
RTLVDVQGAA KTELSRGEQK SLQTDRVILT PGPEEEVRIV NQIYRWFIDD GLVESEIAGR
LNGMRLRTDL GREWTRATVH EVLTNEKYVG HNIYNRVSFK LKKVRVVNPP DMWIRREGAF
TGIVPPDVFY TAQGIIRARA RRYTDEELIE RLRGLYRNRG FLSGLVINEA EGMPCAAVYA
HRFGSLVRAY QLVGYTPDRD YRYVEINRQL RRMHPDLVQQ IEREIANLGG SFERDPATDV
LWVNREFTVS IVLARCHAVD SGHNRWKIRL DTSLAPDISV AARLDDGNQA VRDFYLLPRV
DFGPSRISLA EQNTAELESY RFETLDYLYG MAARARLRVA A