Gene Mpe_A0610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0610 
Symbol 
ID4785177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp642664 
End bp643941 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content70% 
IMG OID640089169 
Productputative capsule polysaccharide export protein 
Protein accessionYP_001019807 
Protein GI124265803 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3562] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.251819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTCG CCTATCTCGA CCCTCCTTAC AGCCGCTACT TCCACGAGCT GGCGGCGCGG 
TTGGCTCGGC CTTCGGGCGG CAGTGTCGTC GCCCTGCTGT CCTGTCCGGC CTATCGCCTG
TACGCGGGGG GCGACCGCGC GCAGGTCTGG GAGCCGGGTG CGCCGGCGCA GGCGCACGAC
GTGCCACCGG CCTTCGAGCG GGCGGGCTGG GCGCAGACCG ACTCGCCGGA ATTCCGGCGC
GCCTTCTCGC ACGCCGTGGA GTGGTTCAAG GAGCGTTTCA CCGCTGAACA CACCGACGTC
TGCCTGGTGT TCTCCGATGC CCGGCCGTTC TCGCAGGCGG CGCACCTGGC GGCGCAGCAG
CTGGGCGTCG TCTGCGTGTT CTTCGAGCGC GGCGCCTTCC GCTACCGCAC CGCGAGCCTG
AGCACGCAGG GGCTCAATGC GCGCTTCTGC CTGCAGCAGG CGCAGCAATC GCCCCTGCTC
GAGGCGCTCC CGCTCTTCGA TCTGCCGCCG CGCCGGGCGA TCGAGCCCTG GTTGAAGCTG
CGTTTCGTGG GCTTCATGGC GCTCAACGGC CTGCTCGGCG CGCTGCAGCC GCAGCGCCGG
CCGATGCAGC ACAAGAGCTA CCACTTCTTC AACTACCTGC GCATCGCCCT CAAGCAGTTC
GGCGCCGAGC ATCCCGAACT GCCGCTCGCG CAGGCCCCGG AACCGCCGGC CACCGACGGG
CCGGTGGTGG TGCTGCCGCT GCAGCTGCCG ACGGATTCGC AGTTCGTCAT GTACTCGCCG
TTCCGGCACA ACCAGGAACT GATCGATTTC GTGGCCCGTC AGATGCGGAA CGCGCTGCCC
GGGACCCCGC TGCTGGTGAA GAAGCACCCG ATGGATGTGC GCAGCTACCG GCTGCCGGCC
GGCGCGCGCT GGATCGACGG CAGCCTGGCG CGCTTCAACG AGCGTCCCGC GGTGTTCGTC
TGCCTCAATT CGAACGTGGG CTTCGAGGCG GCGATCCATG GCAAGCCGGT GCTGTGCTTC
GCCGACAGCT TCTACACCGG CCACCCGAGC GTGACGCGGG TGAGCCGCGA GGACTTCGCA
CCACAGCTCG CGGCCGCGGC GGCGCGGCCC GATGACCTGG CGGCGGGCAG GGCACTGCGC
GCGGCCGTGC TGCGGCATTG CCAGGCGCCG GGCGACGTGT GGGCCTACAG CGCCGAAGAC
TTGGCACTGA CGCGGGACAT CGTGGCGACG CACTACGATG CGGCCCGGCT GTCGTCCGGG
GCTGCGCCGC CTGCCTGA
 
Protein sequence
MNVAYLDPPY SRYFHELAAR LARPSGGSVV ALLSCPAYRL YAGGDRAQVW EPGAPAQAHD 
VPPAFERAGW AQTDSPEFRR AFSHAVEWFK ERFTAEHTDV CLVFSDARPF SQAAHLAAQQ
LGVVCVFFER GAFRYRTASL STQGLNARFC LQQAQQSPLL EALPLFDLPP RRAIEPWLKL
RFVGFMALNG LLGALQPQRR PMQHKSYHFF NYLRIALKQF GAEHPELPLA QAPEPPATDG
PVVVLPLQLP TDSQFVMYSP FRHNQELIDF VARQMRNALP GTPLLVKKHP MDVRSYRLPA
GARWIDGSLA RFNERPAVFV CLNSNVGFEA AIHGKPVLCF ADSFYTGHPS VTRVSREDFA
PQLAAAAARP DDLAAGRALR AAVLRHCQAP GDVWAYSAED LALTRDIVAT HYDAARLSSG
AAPPA