Gene Mpe_A1837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1837 
Symbol 
ID4786776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1974078 
End bp1976213 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content68% 
IMG OID640090407 
Productbacteriophage replication protein 
Protein accessionYP_001021030 
Protein GI124267026 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCAGC GATTCAGCCC GGCGCCGATG CGCCCAGGCG AAACCCTGGG CGAGTATCGA 
CAGCTCGACG GCGAAGCCCT CGCGCTGCGC CAGGACGAGC GCGACGCGCA GCAGCGCCTG
AGCACGCTGC CGGCGTCGTG GATAGATCGC CTGTACAACC GCTGGCGCCG CTGGCGTCTC
GCCGATCGGG CGCCAGCGAA TCTCGACTTG CTGCAGCGCG TGCGCAGCAT CCGCGCGGCG
ACGGCGGCCG GGCTCAATCC CGACGCGAAC GACACGGAGC TGTGCACGCT GGCCGACCTC
ACCGCGCGCG ACATGGAGCG CCGTCTCGGG CAGCGCGAGC AGATCACGCG GGCCACCATC
GCTCCGGCCG TCTCTCCGTG GTCGGTCGAT TGGGTCGAGC GCTGCGCCAC GCTCGCGGCC
TTCGCTGAAG CGCAGCACTG GCTGGAGCGC CGCGGCGTCG TGCATCGCGT GCGCGGTGCC
ATCGTGCCTT TCCTGCGCCG CGTCTGCTGC GCGCGCTGGT GGCGCCGCGT GCTGCGCAAG
GTGCACGCGC GGGCGGTCGA ATCGACGGCG CGCGCGATCG GCCTCGTGCA CAAGCGGGCA
GGGTGCTACG TGAGTCACGA CGGGCTCAAC CGCCGCACCG GCCAGCGCGT GCGCAACGAA
CGGGCGCTGG AGTCGGTCGC GGCCATCAAT GAGCACGGGC AGGCCTACAC CCTCGCCGAG
CTGGCGGCCC GTGGCCCAGC GAACCGCGAG ATTCGTCGGC ATGAGCTGAT GACGCGCATT
GCGGGGTTTG AGCTGATCGC GAAGGACTGC GACGACGAAG CGTTTTTCGT CACGGTCACG
TGTCCATCGC GCATGCATGC CTATCGAACC AAGGGCGATG GGTACGGCGT GGAGCGCAAC
CCAAAGCACG ACGGCACAAC GCCGGACGAG GCTCAGCGCC ACCTGTCGAA GCAGTGGCAA
AAGTGCCGCT CGGCGGCCGA TCGCGCTGGG CTACAGTGGT ACGGATTCAG GATCGCAGAG
CCCAACCACG ATGGAACTCC ACATTGGCAT TTCTTGTTGT TCTTCCCCAA GATGGCGATA
CCTGGGGGAA GTGGTGCCGC CGTTCGTGGG CGCGACGGTG GATGTGTCGC TGACGGCTCA
CCCGCCGCGC GATCCGCTCT AGCCTCCAGC GCCCGAGCTG GTCTCGCGCG GCCTGGATGT
AGGGTGGCGG TGCGGCTCTT GCGGCGCTAT TTTCTGTGGC AAGCTGACCC GGCTGAGAGA
GGGGCCCGGA AGCATCGTGT TGAAGTAGTC AAGATCGATT GGACCCAGGG CTCGGCGGCC
GGCTACATCG CAAAGTACGT CGCGAAGAAC ATCGACGGAT ACAAGGTCGA GAAAGACCTG
TACGGGAACG ATGCGCTCAC CAGCTCGCGC CGCGTTGATG CCTGGGCCTC GACGTGGCGC
ATTCGACAGT TTCAACAGAT CGGCGGTGCG CCGGTCGGCA TCTGGCGCGA GCTGCGTCGC
CTGCACCCAG ACCAAGCCCA GGCCGCGGCC GGCGTCGCCT TCATGCTCGA TGCGGTCAAC
GTCACCAGCG GCGCCGAGAA GATCGACGAA GCGCACGACA TTGAGCGGCG CGAGACCGCG
GCGCACGGCT GGGCCGCGTA CACAGAGCTG CAGGGCGGCC CGCGCGTGCC GCGGCGCTCG
CTGCGCGTGC GCCTGCTGCG CGAGCAGACC GGCGAAGTAG GGCGCTACGG CGAACTCATG
GCGCCGCGGT CGATCGGCGT CGAGACAACA GAGATTCGCC TAGAGCGCGT GCCGGCCTTC
GGCATTGTCA AGGCCTTCGA TCGTCGGCGG ACCGTGCTCG CGGAAGTGGA GAGCGAGCGG
TGCGCCTGGA TGATCGTGCC CAAGGGAACC GAGGGCGCAG CCGCATTGCG GTTTGCATTG
CCGGGCGGCG AAGCCGCGCG GCCTTGGAGT CCTGTCAATA ACTGTACGCG GCCGGACCTA
TACAAGCGTC ATCCGGCGGC GCTGTTCGGG CCTTCGATAG AGCGCACGCG CAAGCGCGGG
CGTTGGCATG CATGGGACCG GGGTCGGTCA CACCCAAAGG AAGCCGACCA TGAGCGAAAC
CACCCCCCAC CGCAGCCCGA AGATCGTCGC GGCTGA
 
Protein sequence
MRQRFSPAPM RPGETLGEYR QLDGEALALR QDERDAQQRL STLPASWIDR LYNRWRRWRL 
ADRAPANLDL LQRVRSIRAA TAAGLNPDAN DTELCTLADL TARDMERRLG QREQITRATI
APAVSPWSVD WVERCATLAA FAEAQHWLER RGVVHRVRGA IVPFLRRVCC ARWWRRVLRK
VHARAVESTA RAIGLVHKRA GCYVSHDGLN RRTGQRVRNE RALESVAAIN EHGQAYTLAE
LAARGPANRE IRRHELMTRI AGFELIAKDC DDEAFFVTVT CPSRMHAYRT KGDGYGVERN
PKHDGTTPDE AQRHLSKQWQ KCRSAADRAG LQWYGFRIAE PNHDGTPHWH FLLFFPKMAI
PGGSGAAVRG RDGGCVADGS PAARSALASS ARAGLARPGC RVAVRLLRRY FLWQADPAER
GARKHRVEVV KIDWTQGSAA GYIAKYVAKN IDGYKVEKDL YGNDALTSSR RVDAWASTWR
IRQFQQIGGA PVGIWRELRR LHPDQAQAAA GVAFMLDAVN VTSGAEKIDE AHDIERRETA
AHGWAAYTEL QGGPRVPRRS LRVRLLREQT GEVGRYGELM APRSIGVETT EIRLERVPAF
GIVKAFDRRR TVLAEVESER CAWMIVPKGT EGAAALRFAL PGGEAARPWS PVNNCTRPDL
YKRHPAALFG PSIERTRKRG RWHAWDRGRS HPKEADHERN HPPPQPEDRR G