Gene Mpe_A2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2229 
Symbol 
ID4785361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2383606 
End bp2384655 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content66% 
IMG OID640090797 
Producthypothetical protein 
Protein accessionYP_001021420 
Protein GI124267416 
COG category[S] Function unknown 
COG ID[COG3181] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAATC CGGCGTCCGC CGACCACGAA GGAGACAAGA TGCAGACAGT CAAGAAGCTC 
TTCCTGCGCA CCGCGCTCGC GACGGCCGCG CTGGCCGGCC TAGGCCATGC GCCGCTGGCT
GCCGCCTGGG AACCCGTCAA GCCGATCGAG TTCGTCGTGC CGGCGGGCAC CGGTGGCGGT
GCCGACCAGA TGGCGCGCTT CATCCAGGGC GTGGCGGCGA AGAACAACCT GACCAAGCAG
CCGATCGTCG TGGTCAATCG TTCGGGCGGC GCTGGCGCGG AGGGCTTCCT CGCCGTGAAG
GAAGCGAAGG GCGATCCGCA CAAGATCATC ATCACGCTGT CGAACCTGTT CACCACGCCG
CTGGCCACCG GCGTGCCATT CAACTGGCGC GACCTGACGC CGGTGCAGAT GCTGGCGCTC
GATCAGTTCG TGCTGTGGGT CAACGAGGAG TCGCCTTACA AGACGGCCAA GGCCTACTTC
GACGCGGTGA AGGCCGCGCC GCCCGGCAGC GTGAAGATGG CCGGCACCGG CTCCAAGCAG
GAAGACCAGA TCATCACCGT GCTGCTGGAA AAGGCCGCCG GCAAGAAGAT CACCTACATC
CCCTTCAAGG GCGGCGGCGA CGTGGCGGTG CAACTGGTCG GCAAGCACGT CGACTCCACC
GTCAACAACC CGATCGAGGC CGAGTCGCAC TGGCGCGCCG GCAAGCTGCG GGCGCTGTGC
GTGTTCGACA AGCAGCCGAT GCCGTACAAG ACCAAGCTCA CAGCCACCCA GTCGTGGGCC
GATGTGCCGA CCTGCCCGGC GGCGGGCCTG CCGGTCGAGT ACGTGATGCT GCGCGGCATC
TTCATGCCGC CTGGCGTGTC GCAGGAGCAG GTGGCCTACT ACCTCGACCT GTTCAAGAAG
CTGCGCGCGC TGCCCGAGTG GCAGGAGTTC ATGGCCAAGG GCGCCTTCAA CCAGACGGCA
CTCACCGGCT CCGAATTCTT CGACTGGCTC GGCAAGACCG AGCAGATGCA CCGCGTCCTC
ATGAAGGAAG CGGGCTTCAT CGCGCAATAA
 
Protein sequence
MINPASADHE GDKMQTVKKL FLRTALATAA LAGLGHAPLA AAWEPVKPIE FVVPAGTGGG 
ADQMARFIQG VAAKNNLTKQ PIVVVNRSGG AGAEGFLAVK EAKGDPHKII ITLSNLFTTP
LATGVPFNWR DLTPVQMLAL DQFVLWVNEE SPYKTAKAYF DAVKAAPPGS VKMAGTGSKQ
EDQIITVLLE KAAGKKITYI PFKGGGDVAV QLVGKHVDST VNNPIEAESH WRAGKLRALC
VFDKQPMPYK TKLTATQSWA DVPTCPAAGL PVEYVMLRGI FMPPGVSQEQ VAYYLDLFKK
LRALPEWQEF MAKGAFNQTA LTGSEFFDWL GKTEQMHRVL MKEAGFIAQ