Gene Mpe_A3335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3335 
Symbol 
ID4786434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3541424 
End bp3543145 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content76% 
IMG OID640091908 
Productputative DNA repair protein 
Protein accessionYP_001022523 
Protein GI124268519 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAGC GCATCGCCCT GCGCGACTTC GTCATCGTGC CCGCGCTCGA GCTCGACCTG 
CAGGCCGGGT TCACCGCGCT CACCGGCGAG ACCGGTGCCG GCAAGTCCAT CCTGGTCGAT
GCGCTGCAGC TCGCGCTCGG CCACCGCGGC GACGCCGGCG TGGTGCGCGA GGGCGCGACG
CGCGCCGAGA TCAGCGCCGA GTTCGACACG CCGGCCTCGC TGCGGCCCTG GCTGGCCGAT
GCCGGCTTCG CCGATGAAGA GGACGGTGGC GACGACCCCC TGCTGCTGCG CCGCACGCTC
GACGCCCAGG GCAAGAGCCG TGCGTGGATC AACGGCAGCC CGGCCACCAT CGGCCAGCTG
CGCGAGGCGG GCGAGCACCT GGTCGACATC CACGGGCAGC ACGCCTGGCA GAGCCTGACG
CGGCCCGACG CGGTGCGCGG CCTGCTCGAC GGCTATGCCG GCCTCGACAC CGCGCCGCTG
GCCGCCGCCT GGAGCGCGTG GCGCAGCGCA AGCGAACAGC TCGCCGACGC GCGCAGCCGC
CAGGCCGACC TGGAACGCGA GCGCGAGCGG CTGCAATGGC AGATCGCCGA GCTCGACAAG
CTGGCCCCCG GCGCCGACGA ATGGCCGGAG CTCAACGCCG AGCACCACCG CCTGAGCCAC
GCGCAGGCCA TCATCGACGC GGTGCAGGCG GCGCTGGCCG CGATCAGCGA GGACGACGAC
TCGGCCGACG CGCGCAGCGC CCAGGCGATC GGTGCGCTCG AAGGTGTGCT CGACCACGAC
CCGGCGCTGC GCGATCCCTT CGAGGTGCTG AAGAGCGCGC AGGCCCAGCT GCAGGACGCC
GCCCACACGC TGCACGCCAC GCTGCGCCAC GGCGAGCTGG ACCCGGAGCG CCTGCAGGCG
CTCGACGAGC GGCTGGCGGC GTGGATGGGC CTCGCGCGGC GCTTCCGCCG GCCGGCCGAG
GAATTGCCGG CACTGCGGGC CGCCTGGCGC GACGAGCTCA CCCGGCTCGA CGCCGCGACC
GACCTCGACG CGCTGGAGGC CCGGGTGCAG ACCACGCGCC AGGCCTACGA CACCGCGGCG
CAGGCCGTCA GCTCGCAACG CAAGGCCGCG GCACCGCGGC TCGCCGCGTC GGTGACGGCG
GCGATGCAGC AGCTCGGCAT GGCCGGCGGG CGCTTCGAGG TGGCGCTGGA GCCGTTGGCC
GAGCCGCAGC GGCACGGCCG CGAGGCGGCA GAGTTTCGCG TGGCGGGGCA CGCCGGCAGC
ACGCCGCGCG CGCTGGCCAA GGTGGCCTCG GGCGGCGAGC TGTCGCGCAT CGCGCTGGCG
ATCGCGGTCA CCACCAGCGA GCTCGGCGAG ACCGGCACGC TGATCTTCGA CGAGATCGAC
GCCGGCGTCG GCGGCAGCGT GGCCGACGCC GTGGGGCAGT TGATGCGCCG CCTGGGCCGC
GACCGGCAGG TGCTGTGCGT GACCCACCTG CCGCAGGTGG CCGCCTGCGC CGACCACCAC
TGCGTGGTCA GCAAGGCCGC GCAGGGCGGC AGCACGCGCA GCCAGGTCGA CCCGGTCACC
GGCGAGGCGC GGGTGCAGGA GATCGCCCGC ATGCTGGGCG GCGGCGCGGC CGGCGGCACC
AGCCTGGCCC ATGCGCAGGC GCTGCTGGCG GCCGCGACAT CGGCCCCGCC GGCGCCTCCG
GCCGCGGCCA GTGGTGGTGG ACGCAAGCGG CAGCGGGCAT GA
 
Protein sequence
MLKRIALRDF VIVPALELDL QAGFTALTGE TGAGKSILVD ALQLALGHRG DAGVVREGAT 
RAEISAEFDT PASLRPWLAD AGFADEEDGG DDPLLLRRTL DAQGKSRAWI NGSPATIGQL
REAGEHLVDI HGQHAWQSLT RPDAVRGLLD GYAGLDTAPL AAAWSAWRSA SEQLADARSR
QADLERERER LQWQIAELDK LAPGADEWPE LNAEHHRLSH AQAIIDAVQA ALAAISEDDD
SADARSAQAI GALEGVLDHD PALRDPFEVL KSAQAQLQDA AHTLHATLRH GELDPERLQA
LDERLAAWMG LARRFRRPAE ELPALRAAWR DELTRLDAAT DLDALEARVQ TTRQAYDTAA
QAVSSQRKAA APRLAASVTA AMQQLGMAGG RFEVALEPLA EPQRHGREAA EFRVAGHAGS
TPRALAKVAS GGELSRIALA IAVTTSELGE TGTLIFDEID AGVGGSVADA VGQLMRRLGR
DRQVLCVTHL PQVAACADHH CVVSKAAQGG STRSQVDPVT GEARVQEIAR MLGGGAAGGT
SLAHAQALLA AATSAPPAPP AAASGGGRKR QRA