Gene Mpe_A2009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2009 
Symbol 
ID4783796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2151800 
End bp2152897 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content67% 
IMG OID640090579 
Producthypothetical protein 
Protein accessionYP_001021202 
Protein GI124267198 
COG category[R] General function prediction only 
COG ID[COG1485] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.314691 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTCC GCCAACTGTT CGGTGAGACG CTTTCCGAGC GCGGCTACCG GGCCGACGAG 
GCGCAGTTGC GCGCGCTCGA CGCGCTGGAG CGCTGCGAGA ACGAGTGGAT CGACTACAAG
GCACGCCGCA GCAATGCGGT GAGCAAGCTG CTGCGCCGGC CACCGATTCC TCGCGGCGTC
TACATGTACG GCGGTGTCGG GCGCGGCAAG AGCTTCCTGA TGGACTGCTT CTTCCAGTCG
GTGCCGCTGG TGCGCAAGAC GCGGCTGCAC TTCCACGAGT TCATGCGCGA GGTGCATCGC
GAATTGCAGG AGCTCAAGGG CACGGCCGAC CCGCTGGACG AACTGGGCAG CCGCATCGCG
CGGCGCTTCC GGTTGATCTG CTTCGACGAG TTCCACGTCG CCGACGTGAC CGACGCGATG
ATCCTGCACC GCCTGCTGGC GGCACTGTTT GCCAACCGCG TCAGCATCGT CACGACGTCC
AACTTCCACC CCGACGCGCT CTATCCCAAT GGCCTGCATC GCGACCGGAT CCTGCCGGCG
ATCGAACTGC TCAAGGACAG GCTGGAGGTG ATCAATGTCG ACGCCGGGGT CGACTACCGC
CAGCGCACGC TGGAGGACGT GGCGCTCTAC CACACACCGC TCGGACCGGA GGCCGACGGA
GCGCTGACCG AGACCTTCGA GCGCCTCGCC GAGGCCAAGG ACGAGGATCC GGTGCTGAAC
ATCGAGCAGC GGACGATCCG TGCGCGCCGG CGCGCCGGGG GGGTGGTGTG GTTCGACTTC
AAGACCCTGT GCGGCGGCCC GCGCTCGCAG AACGACTACC TCGAACTGGC CTCGCAGTTC
CATACCGTGC TGCTGTCCGA CGTGCCCGAG ATGCCGCCCC GGCTGGCGTC CGAGGCGCGG
CGCTTCACGT GGCTGGTCGA CGTGCTCTAC GATCGGCGCG TGAAACTCGT GATATCCGCC
GCCGTGCCTC CCGAACAGCT CTACACCGAC GGGCCGCTGG CCCATGAATT TCCGCGCACC
GTGTCTCGCT TGACCGAGAT GCAGTCGGCC GAGTTTCTGG CGCTGTCGCG GCGAGATGTC
GATACGAGCT TGACGTGA
 
Protein sequence
MTVRQLFGET LSERGYRADE AQLRALDALE RCENEWIDYK ARRSNAVSKL LRRPPIPRGV 
YMYGGVGRGK SFLMDCFFQS VPLVRKTRLH FHEFMREVHR ELQELKGTAD PLDELGSRIA
RRFRLICFDE FHVADVTDAM ILHRLLAALF ANRVSIVTTS NFHPDALYPN GLHRDRILPA
IELLKDRLEV INVDAGVDYR QRTLEDVALY HTPLGPEADG ALTETFERLA EAKDEDPVLN
IEQRTIRARR RAGGVVWFDF KTLCGGPRSQ NDYLELASQF HTVLLSDVPE MPPRLASEAR
RFTWLVDVLY DRRVKLVISA AVPPEQLYTD GPLAHEFPRT VSRLTEMQSA EFLALSRRDV
DTSLT