Gene Mpe_A2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2048 
Symbol 
ID4784625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2189101 
End bp2190225 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content74% 
IMG OID640090618 
Producthypothetical protein 
Protein accessionYP_001021241 
Protein GI124267237 
COG category[R] General function prediction only 
COG ID[COG5621] Predicted secreted hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.356094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGACA AGCCCCTGCC GCTGTCGCGC CGCGCGCTGC TGCTGGCGTG TGCGCCGGGT 
TTCGGGCTGG GCGTGGGCAC CGCACGCGCG GCGCCGGTGG TGCGGGCCGG CACGCCCATC
GTGCTGCCGC GCGACTTCGG CTCGCACCCC GAGTACCGCA CCGAATGGTG GTATGTGACC
GGCTGGCTCG AGGCCGCCGC AACGCCCGAA CCCTTCGGCT TCCAGATCAC CTTCTTCCGG
TCGCGCACCG ACGTGGCGGC CGCCGACCAC CCGAGCCGCT TCGCCGCGCG GCAGTTGCTG
TTCGCGCATG CCGCGCTCAC CGATCCGCTG GCCGGCCGGC TGCGCCACGA CCAACGCATC
GCCCGCGCCG GCTTCGACAT CGCCGAGGCC GCCACCACCG ACACCGACGT GCGGCTGCGC
GGGTGGCAAC TGGTACGCGA CGGCAAATCC TCTCCGAGCG CGCACCACTA CCGCGCCGTG
GTCGATGCCG GCACGCAGGG CTTCTCCTTC GACCTGCGCC TCGACGCGAC GCAGCCGGTG
CTGCTGCAGG GCGACGCCGG CTATTCGCGC AAGGGCCCGG CGCCCGAACA GGCGAGCCAC
TACCTCAGCG AGCCGCAGCT CGCCGTGGGC GGCACGCTGC GCGTCGAGGG CCGCGCGCTC
GCGGTCAGCG GCCGCGCCTG GCTCGACCAC GAATGGAGCG AGGCGCTGAT GCACCCCGAC
GCCGTGGGCT GGGACTGGGT CGGCATGAAC CTCGACGACG GCAGCGCCCT CACCGCCTTC
CGGCTGCGCC GCGCCGACGG CTCGGCGCTG TGGGCCGGCG GCAGCTTCCG GCCGCGCGAC
GGCGCGGTGC GCGCGTTCGG GCCCGACGAG GTGCGCTTCA CGCCACTGCG GCGCTGGCGC
AGCCCGGCCA CGCAGGCCGA GTACGCGGTG GCATGGCAGC TCGACACACC CGCCGGCACG
CACCAGGTGC GCGCCCGCCT GGACGCGCAG GAACTCGACA GCCGCGGCTC GACCGGCTCG
GTCTACTGGG AGGGCCTGAG CGACCTGCTC GACGCGCGCA GCGGCACGCG CATCGGCCGT
GGCTACCTGG AAATGACCGG CTATGCGAGC CCGCTGAAGC TGTAG
 
Protein sequence
MNDKPLPLSR RALLLACAPG FGLGVGTARA APVVRAGTPI VLPRDFGSHP EYRTEWWYVT 
GWLEAAATPE PFGFQITFFR SRTDVAAADH PSRFAARQLL FAHAALTDPL AGRLRHDQRI
ARAGFDIAEA ATTDTDVRLR GWQLVRDGKS SPSAHHYRAV VDAGTQGFSF DLRLDATQPV
LLQGDAGYSR KGPAPEQASH YLSEPQLAVG GTLRVEGRAL AVSGRAWLDH EWSEALMHPD
AVGWDWVGMN LDDGSALTAF RLRRADGSAL WAGGSFRPRD GAVRAFGPDE VRFTPLRRWR
SPATQAEYAV AWQLDTPAGT HQVRARLDAQ ELDSRGSTGS VYWEGLSDLL DARSGTRIGR
GYLEMTGYAS PLKL