Gene Mpe_A2733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2733 
Symbol 
ID4783750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2906730 
End bp2908157 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content63% 
IMG OID640091304 
Productchain length determinant protein 
Protein accessionYP_001021922 
Protein GI124267918 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03017] chain length determinant protein EpsF 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0768542 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTCG GCCAGTTCCT TTCCATCCTC GCTGCGCGTT GGCGGCTCGT GCTGTCGATC 
GTCGCGCTCG TGGTGACCGC CGCGATCGGC GTCAGCCTCG CGTTGCCGAA GCAGTACACG
GCGACCGCGG CGATCGTCTT CGACGTCAAT CCGGACCCTG TGTCGACGGT CGGCTATGGC
GAGATGGTGT GGCCCGCGTA TCTCGCGACG CAGGTCGAGA TCATGCAGAG CGTTCGGGTT
GCCAGACGCG TGGTCGAGGC GCTGCAGATG AAGGACGACG AACTCAGCCG TCGGCGCTGG
CAGGAGGCAA CGGGTGGGCA GGGCGATTTC GAGGAATGGA TGATCAATGT CCTGAGTCGA
GGACTGGTGG TCAAGCTGAC GCGTGAGTCC AACGTCGTGA CGCTGTCCTA TCGCGCCCCC
GACCCCCAGG TTGCAGCGCG GGTGTCCAAC GCGTTCGTGA AGGCCTATCT CGATACCGTG
GTCGACCTGA AGGTGGATCC AGCACGCCAG TATTCAACGT TCTTCGAGAG CCGTGCCAAA
GAGCTGCGGG GGCAGTTGGA GCAGGCACAG GCCAAGCTTT CCGCCTTCCA GCGGCAGAAG
GGGTTGATCG GGGCCGATGA ACGGCTCGAC ACTGAGTCCG CTCGGTTGGC GGAGCTCTCT
GCACAGGTGG TGGCCATGCA GGCGCTGTCG GCCGAGTCGG GCAGCCGTCA GACACAGGCT
CTGGCGCGCT CGGCGGAGCA ACTGCCTGAT GTCCAGGCCA ATCCGGTGGT CGCCAGCCTC
AAGGCCGATC TCTCTCGCCA AGAGGTCCGA CTACAGGAAT TGAACGCGCG CCTTGGCGAT
GCCCACCCGC AAGTGATGGA AGCCAAGGCA AATATCGCGG CGTTGCGCGG TCGCATCAGC
GCCGAATCGC GGCAGGTCGC TTCTGGCGTC GGCGTGACGA ACACGATCAA TCGGCAACGT
GAGGCGGAGA TTCGTGCGGC CTATGAAGCC CAGCGCCAGC GTGTGCTGCG CATGAAGGAG
CAGCGTGACG AGGCGTCGAT CTACCAGCGC GAAGTCGAAG CGGCGCAACG CGCGCTTGAC
AGCGTCATGA CGCGCTTTAA CCAGACCGCG CTCGAAAGCC AGGCGACCCG TTCCAATGCA
TCTGTTCTGA CACCGGCCAG CACGCCCTTG CTGCCCTCTT CGCCCAAGAT CTTTCTCAAT
GCGTTCATTG GGCTTTTCCT CGGAACGCTG GGTGCTGTCG CCATCGCGAT CGTTCTGGAG
ATGATCAATC GCAGGGTGCG AAACGTCGAC GACATCACCG AGGCACTCGG TCTTCCAGTG
ATCGGTACCT TGCCAAAGCC GGACCGCATT GGTGTGTTTG GCAAGCCTTC GTCTCAGCCC
ATTTTGGCGC GCCGTGTGCT GGGGCAGTTG CCGATGTCCC GGCCGTGA
 
Protein sequence
MTFGQFLSIL AARWRLVLSI VALVVTAAIG VSLALPKQYT ATAAIVFDVN PDPVSTVGYG 
EMVWPAYLAT QVEIMQSVRV ARRVVEALQM KDDELSRRRW QEATGGQGDF EEWMINVLSR
GLVVKLTRES NVVTLSYRAP DPQVAARVSN AFVKAYLDTV VDLKVDPARQ YSTFFESRAK
ELRGQLEQAQ AKLSAFQRQK GLIGADERLD TESARLAELS AQVVAMQALS AESGSRQTQA
LARSAEQLPD VQANPVVASL KADLSRQEVR LQELNARLGD AHPQVMEAKA NIAALRGRIS
AESRQVASGV GVTNTINRQR EAEIRAAYEA QRQRVLRMKE QRDEASIYQR EVEAAQRALD
SVMTRFNQTA LESQATRSNA SVLTPASTPL LPSSPKIFLN AFIGLFLGTL GAVAIAIVLE
MINRRVRNVD DITEALGLPV IGTLPKPDRI GVFGKPSSQP ILARRVLGQL PMSRP