Gene Mpe_A1623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1623 
Symbol 
ID4787247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1751455 
End bp1752855 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content67% 
IMG OID640090191 
Productintegrase or site-specific recombinase 
Protein accessionYP_001020820 
Protein GI124266816 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.262588 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTCG TGCGACGCCT GGACGAAGCT GCGCAAGCCA TCCCCTTGCG GGCGCGATCA 
CTCACGGATC TCCAGCTCAA GGCCCTGAAG CCGCGGGACA AGGCCTACAA GGTCAGCGAC
CGGGACGGGC TGTACGCCTA CGTGGCGCCG TCGGGGACCG TCAGCCTGCG GTACGACTAC
CGCATCGGCA GGCGCCGCGA GACCCTGACC CTCGGTCGCT ACGACGCCAC CGCTCCGGCG
CGGGTGCCTC GGTCCCTGGA CGTGCTCGAG TACGGCATGG GCCTGTCCCT GGCCGAGGCG
CGACTGCTGC TGACCAAGGC GAAGCGCGCG CTCGAGCAAG GCGTGTCGCC TTCGCGCGCG
AAGGCCGAGC AGAAGGCGGC GGAATCCGAC GCCCTCACGT TCGGCAAGTG GGCCGAGCGC
TACTTCGAGT TCAAGGGTGA TCCCAAGAGC AAGGGCGAGC AGCTGGCCGA CAGTACGCTC
GCCATGCGTC GCTCAACCTA CAAGCGCGCG CTGGAGAAGC CCCTGGGCAA ACTGATGCTG
GAGGAGATCA CACCCAACCG GCTGGCGGCC CTGTGCGACG ACATCAAGGC GCAGCGTGGC
CCGGCGGTGG CGGTGCACGC CCGCGAGATC GTCCTGATGG TCTACCGACA CGTCCAGCGC
AAGGGGATCG AAGTGCCGAA CCCGGCCGAA CGGGTGCAGG CGAGCGCCAT CGCCCGCTTC
GAGCCGCGCG ATCGTGCGCT GTCTCCGAGC GAGCTGCGCC TGTTCCTCGC GGCGCTGGAC
CAGTGCGCGA CGATGCCGAC GCTGCGGCTG GCCGTGCGCT TTGTGCTGCT GACCGGCGTT
CGCAAGGGCG AGTTCATCGG TGCGACGTGG GACGAGATCG TCTTCGACAC CGAGACCTGG
ACGATTCCGT CGCGCCGGAT GAAGGGTGGC AGGGCCCACG TGGTCTATCT CAGCGACCAG
GCGATGGACA TCCTGACGAC GCTGCGGTCG TGCTTCTCGG CAAGCCGCTA CCTTCATCCC
GGCCGGTATG ACAGCGACCT GCCGATCAGC GACGCGACCC TGAACCGGGT CATCGCGATG
GCGATCCGTG GCATTCAGGC GACCGCTCCG GAGTTTCAGC CGTTCACCGT GCACGACCTG
AGGCGGACCT TCAGCACCTC GCTGAACCGG GCCAAGTTCG ACGAGCGCTG GATCGAGATG
GCGCTGGCGC ACGTGCCCCG GAACCGCATC GCCGCGACGT ACAACGTGGC CCGCTATGCG
GCCGAGCGCC GGATCATGAT GCAGGCCTGG GCCGACATGC TCGACCTTTG GGAGAAGGGC
GAGTCGGCCA AGGAAGTGAT CTTGAAGGCG AAGCAGGCAG CCTCCGAGGT GACCGACTTC
GAGTTGGAAG ACGATCTTTG A
 
Protein sequence
MSVVRRLDEA AQAIPLRARS LTDLQLKALK PRDKAYKVSD RDGLYAYVAP SGTVSLRYDY 
RIGRRRETLT LGRYDATAPA RVPRSLDVLE YGMGLSLAEA RLLLTKAKRA LEQGVSPSRA
KAEQKAAESD ALTFGKWAER YFEFKGDPKS KGEQLADSTL AMRRSTYKRA LEKPLGKLML
EEITPNRLAA LCDDIKAQRG PAVAVHAREI VLMVYRHVQR KGIEVPNPAE RVQASAIARF
EPRDRALSPS ELRLFLAALD QCATMPTLRL AVRFVLLTGV RKGEFIGATW DEIVFDTETW
TIPSRRMKGG RAHVVYLSDQ AMDILTTLRS CFSASRYLHP GRYDSDLPIS DATLNRVIAM
AIRGIQATAP EFQPFTVHDL RRTFSTSLNR AKFDERWIEM ALAHVPRNRI AATYNVARYA
AERRIMMQAW ADMLDLWEKG ESAKEVILKA KQAASEVTDF ELEDDL