Gene Mpe_A1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1087 
Symbol 
ID4783690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1161610 
End bp1162818 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content67% 
IMG OID640089649 
Productprophage CP4-like integrase 
Protein accessionYP_001020283 
Protein GI124266279 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0686158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.733302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGA CTGACTTGAA ACTGCGGACG CTGACTCAGT CGGGAAAGCA CTTCGATGGC 
GGCGGGCTCT ACCTGGAGGT GACGGCCGCG GGCGGTCGTT ACTGGCGCAT GAAGTACCGC
CATGGTGGGA AAGAGAAGCG TCTGGCGTTC GGCGTTTATC CAGAGGTCAC GCTGCGTGCC
GCGCGCGATC GCCGCGACGA AGCCCGCAGG GTGCTCGACC AGGGCGGCGA TCCGGGCGAG
CTGCGCAAGG CAGCCAAGGC GCAGGCCGCG CACGAGGCGT CCAACACGTT CGAGGCCGTG
GCGAGGGACT GGCTCACGCA CCAGGCCGAT AGCTGGGAGG CCGTCACCCT GGCTCGCATC
AAGGCGGCTT TCAAGGCGGA CGTGTTCCCG CAGCTCGGCG CGCGGCCCAT GGCGCAGATC
AAGCCGCGCG AGGTGGCGAC CGTCGTCAAG GCGATCGAGG CGCGTGGAGC TGGCGACATG
GCGGCGCGCG TGCTGCAGCG GATCCGGGCC GTCTTCCGAT TCGCCGTGGT GCATGAGCGC
ATCGACTCCA ATCCGATGCT TGACCTGCAG CCCGGCGAGC TGCTGAAGCC GCGCCAGGTG
CGGCACCGCG CCGCGCTGGC CGATCGTGAT CTGCCGGTGT TTCTGGAGAA GCTGGCGGCC
TATGACGGCG ACGTATCCAC CTCGGCAGCC CTGCGACTGC TGATGCTCAC CGCCGTCCGA
CCTGGCGAGC TGCGCGGCGC GCGGTGGGAC GAGATCGACA TGGATGCAGC CGAGTGGCGC
ATTCCAGCCG AGCGCATGAA GATGCGCTCC CCTCACGTGG TTCCGCTGTC TCGGCAAGCG
CTCGATGTGC TCCAGTTGAT GCAGCCGCTC AGCGGCGAGC GCGAGCTGGT GTTCCCAAGT
CCCTACTACC CGGGCAAGCC GCTGAGCGAA AACACGCTGA ACAGTGCGCT GGCACGCATG
GGCTACAAGG GCCTCGCCAC GGCACATGGC TTCCGGGCGC TGTTCTCGAC GGTGGCCAAT
GAGTCGGGCC ATTCACCCGA CGTGATCGAG CGCCAGCTCG CGCACGTGGA GCGCAATGCG
GTGCGAGCCG CCTATCACCG CTCGACCTAC CTGAAGGATC GTGCGCAGCT AATGCAGTGG
TGGGCCGACT ACCTTGATGG TCGACGCAGC GGCAAGGTGG TCCCGCTGTC ATCTGCTCGC
GTGGCCTGA
 
Protein sequence
MKLTDLKLRT LTQSGKHFDG GGLYLEVTAA GGRYWRMKYR HGGKEKRLAF GVYPEVTLRA 
ARDRRDEARR VLDQGGDPGE LRKAAKAQAA HEASNTFEAV ARDWLTHQAD SWEAVTLARI
KAAFKADVFP QLGARPMAQI KPREVATVVK AIEARGAGDM AARVLQRIRA VFRFAVVHER
IDSNPMLDLQ PGELLKPRQV RHRAALADRD LPVFLEKLAA YDGDVSTSAA LRLLMLTAVR
PGELRGARWD EIDMDAAEWR IPAERMKMRS PHVVPLSRQA LDVLQLMQPL SGERELVFPS
PYYPGKPLSE NTLNSALARM GYKGLATAHG FRALFSTVAN ESGHSPDVIE RQLAHVERNA
VRAAYHRSTY LKDRAQLMQW WADYLDGRRS GKVVPLSSAR VA