Gene Mpe_A0737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0737 
Symbol 
ID4784983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp765760 
End bp767196 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content66% 
IMG OID640089298 
Productchain length determinant protein 
Protein accessionYP_001019934 
Protein GI124265930 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03017] chain length determinant protein EpsF 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTCA GCCAATTCTT CACCATCGTC CGGGCCCGCT GGGTCTCGGG CTTCGCCGTT 
CTCGCCGTCG TGCTGGCGGT CGTCATCGCG GTCAGCCTGT CGCTGCCGAA GCAGTACACG
TCGACCGCTT CGGTGCTGAT CGACGTGAAG CTGCCCGACC TGGGCGGCAA CACCGCTGCC
GCCGGCGGCA TCGTGCCGCT GGGCTTCATG GCCACCCAGC TCGACGTGGT GCGCAGCGAA
CGGGTGGCGT TGCGTGCACT GCGCAGCCCC GGCCTGAAGC TCAACGAGAG CGTCGAGCTG
CGCGAGCAAT GGCGGGCTTC CACTGGCGGC CAGGGCGACT TCGAATCGTG GCTGGCGGCG
CTGATCCAGA AGAAGCTCGA CATCCTGCCC GCGCGGGAAT CCAACGTCAT CACGCTGGCC
TATTCGTCGC CCGACCCCAA GTTCTCTGCA GCTGTTGCGA ACGCCTTCAT GCAGGCCTAT
ATAGACACCA CGCTCGACCT GAAGGTGGAG CCGGCCAAGC AGTACAACGC CTTCTTCGAC
GACCGCTCCA AGCACACCCG CGAAGCGCTG GAACAGGCGC AGGCCAAGCT GTCCGCCTAC
CAGCAGCAGA AGGGCATCAT CGCCACGGAC GAGCGCCTGG ACGTCGAGAA CGCCCGCCTC
AACGAGCTGA CCACGCAACT GGTCGTGCTG CAGGGTCTGG CCGCCGAATC GCGCGGACGG
CAGGATCAGT CGAGCGGCAA CAACGACCGC ATGCAGGAAG TGCTGAACAA CCCGGTGGTC
AGCGCCCTGA CCGCCGACCT GTCGCGCCAA CAGGCCAAGC TGACCGAACT GAACGAACGC
CTGGGCGAGA ACAACCCGCA GGTCGTGGAG CTGCGCGCCA ACATCGAGGA GCTGCACAAG
CGCATCCAGG CGCAGACGAC ACGCGTGACC GGGAGCCTGA ATGTGAACAA CACCGTGAAC
GAGGGCCGCC TGGCCCAGCT CAATGCCGAG ATCCAGCAAC AGCGCGCCAA GCTGCTGAAG
CTGAAGGACC TGCGCGACGA AGCCGCCGTG CTGCAGCGGG ACGTGGAGAA CGCCCAACGC
ACCTACGACG CCGTGCTGAC CCGCGTGAAC CAGACCAGCA TGGAAAGCCA GAACACCCAG
ACCAACGTGT CGGTGCTCAA GCAGGCCACC GCGCCGGCGT TCCCCTCGTC CCCTCGCCTG
CTGCTGAACA CTGCGGTGGC GCTGGTGCTG GGCTCACTGC TCGGCATCGG CCTGATGCTG
GCACGCGAAC TGCTGGACCG ACGCATGCGC ACCATCGAAG ACGTGGTCAG CGGCCTGCGG
CAGCCGCTGC TGATCGTGCT GCCCAAGACC AGCCGGCAGG ACGCCCATGG CGGCTCGCGG
CTGAAGCTGA CCAAGGCGCG CGTCGTAAGA GGGCTCGCTG GCCCCGCCCG ATCATGA
 
Protein sequence
MTFSQFFTIV RARWVSGFAV LAVVLAVVIA VSLSLPKQYT STASVLIDVK LPDLGGNTAA 
AGGIVPLGFM ATQLDVVRSE RVALRALRSP GLKLNESVEL REQWRASTGG QGDFESWLAA
LIQKKLDILP ARESNVITLA YSSPDPKFSA AVANAFMQAY IDTTLDLKVE PAKQYNAFFD
DRSKHTREAL EQAQAKLSAY QQQKGIIATD ERLDVENARL NELTTQLVVL QGLAAESRGR
QDQSSGNNDR MQEVLNNPVV SALTADLSRQ QAKLTELNER LGENNPQVVE LRANIEELHK
RIQAQTTRVT GSLNVNNTVN EGRLAQLNAE IQQQRAKLLK LKDLRDEAAV LQRDVENAQR
TYDAVLTRVN QTSMESQNTQ TNVSVLKQAT APAFPSSPRL LLNTAVALVL GSLLGIGLML
ARELLDRRMR TIEDVVSGLR QPLLIVLPKT SRQDAHGGSR LKLTKARVVR GLAGPARS