Gene Mpe_A0941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0941 
Symbol 
ID4787323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp993992 
End bp995317 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content64% 
IMG OID640089502 
Producthypothetical protein 
Protein accessionYP_001020138 
Protein GI124266134 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0564159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCG GAAAGAACAT GACCAACCGA CGCAGCGTGC TGCGCACCGG CTTCGGTGGT 
GCCGTGGCCC TGGCCGCCGC CCTGGTGGCC ACCCCCGTGA TGGCGAAGGA GCTCGAGGCG
GGCTTCGTCA TCAACAAGGG CAACTTCGAC AGCATCAAGA ACGACACCTT CGAGGGCAAG
ACGATTGCCA GCATGGTGCC GGAGAAGCTG GAATGGATGA TCAGGAACTA CGACCTGACC
ATCAAGCTGG CCAACTCGAA GAAGATCACG ATGGACCCGA AGTACGTCCA GGCGACCAAG
GACGGCGCCA AGGACGTGAA GTTCAACCCG GCCGACCGCA CCGTGTCGGG CTGGAAGGCG
GGCATGATGT TCCCGCCCGA GTCCATCAAG ATGGACGACC CGCATGCGGG TGACAAGGTG
ATCTGGAACC TGCGCGCCGC CACCTACGGC GCCACGATGG ACCTGCGCGA CATCGCCTGG
GCCTTCCTCG ACGCGAAGAA GGGCTACGAG CGCGTGCAGG CCTTCCAGTC GCGGCGCTAC
TACATGGAGG GCCGTCTCGA CGGTGGTCCG GTCTCCGAAG GCGACGGCAC GATCGCGCAG
AAGACCTACT TCGTCGCGAC CTACCCGCAG GACATCCGGG GCCTGGGGCT CTTCTCGGTC
CGCTACAACC AGGCCGACTC GAAGAAGCCC GACGACTCCT ACGCCTACCT GAAGTCGGTG
CGCCGCACGC GCCGTCTCTC CGGCGGTGCG TGGATGGACC CGATCGGCGG CACCGACCAG
CTGTATGACG ACTGGGACAT CTGGGACGCC GCGCCCACCA AGTACAAGTC GAACAAGCTG
ATCGAGAAGC GCTGGGTGCT GGCCATCGCC CACAGCCCGG AGATGAGCGT CGACGTCAAG
CAGCCCTGGA CCGAGCCGGC CAAGCGCTTC CCGCGCATCG GCATCGACAT CCCGCCGCAC
TTCAACCCGG CGCCGGACAT CGGCTGGGAG CCGCGCGAGG TGTACGTGGT GGAGGGTACC
TGCCCCGACG AGCACCCGTA CAGCAAGAAG GTGGTGTACA TGGAAGTCGA CTTCCCGCGC
CCCTACCTCG GCTATGCGCT CGACCGCAAG GGCGAGTTCT GGAAGATGTT CATCTTCCAG
AACCGGCCTG ACGTCGGCGA CGACGGCTAC AAGGCGGTCA TGCCCGTCAT CGGCCACATC
ATCGACGTGA AGCGCGGCCA CGCCACCAAC TGGTCGTCGA ACATGAAGTC CAACCCCAAG
GGCGTCAAGG AAACCGACGT GGCGCTGAAC ATCCTGGAAG AGGTCGCGAC CGGCACCGGG
AAGTAG
 
Protein sequence
MKTGKNMTNR RSVLRTGFGG AVALAAALVA TPVMAKELEA GFVINKGNFD SIKNDTFEGK 
TIASMVPEKL EWMIRNYDLT IKLANSKKIT MDPKYVQATK DGAKDVKFNP ADRTVSGWKA
GMMFPPESIK MDDPHAGDKV IWNLRAATYG ATMDLRDIAW AFLDAKKGYE RVQAFQSRRY
YMEGRLDGGP VSEGDGTIAQ KTYFVATYPQ DIRGLGLFSV RYNQADSKKP DDSYAYLKSV
RRTRRLSGGA WMDPIGGTDQ LYDDWDIWDA APTKYKSNKL IEKRWVLAIA HSPEMSVDVK
QPWTEPAKRF PRIGIDIPPH FNPAPDIGWE PREVYVVEGT CPDEHPYSKK VVYMEVDFPR
PYLGYALDRK GEFWKMFIFQ NRPDVGDDGY KAVMPVIGHI IDVKRGHATN WSSNMKSNPK
GVKETDVALN ILEEVATGTG K