Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2733 |
Symbol | |
ID | 4783750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 2906730 |
End bp | 2908157 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640091304 |
Product | chain length determinant protein |
Protein accession | YP_001021922 |
Protein GI | 124267918 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03017] chain length determinant protein EpsF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0768542 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTTCG GCCAGTTCCT TTCCATCCTC GCTGCGCGTT GGCGGCTCGT GCTGTCGATC GTCGCGCTCG TGGTGACCGC CGCGATCGGC GTCAGCCTCG CGTTGCCGAA GCAGTACACG GCGACCGCGG CGATCGTCTT CGACGTCAAT CCGGACCCTG TGTCGACGGT CGGCTATGGC GAGATGGTGT GGCCCGCGTA TCTCGCGACG CAGGTCGAGA TCATGCAGAG CGTTCGGGTT GCCAGACGCG TGGTCGAGGC GCTGCAGATG AAGGACGACG AACTCAGCCG TCGGCGCTGG CAGGAGGCAA CGGGTGGGCA GGGCGATTTC GAGGAATGGA TGATCAATGT CCTGAGTCGA GGACTGGTGG TCAAGCTGAC GCGTGAGTCC AACGTCGTGA CGCTGTCCTA TCGCGCCCCC GACCCCCAGG TTGCAGCGCG GGTGTCCAAC GCGTTCGTGA AGGCCTATCT CGATACCGTG GTCGACCTGA AGGTGGATCC AGCACGCCAG TATTCAACGT TCTTCGAGAG CCGTGCCAAA GAGCTGCGGG GGCAGTTGGA GCAGGCACAG GCCAAGCTTT CCGCCTTCCA GCGGCAGAAG GGGTTGATCG GGGCCGATGA ACGGCTCGAC ACTGAGTCCG CTCGGTTGGC GGAGCTCTCT GCACAGGTGG TGGCCATGCA GGCGCTGTCG GCCGAGTCGG GCAGCCGTCA GACACAGGCT CTGGCGCGCT CGGCGGAGCA ACTGCCTGAT GTCCAGGCCA ATCCGGTGGT CGCCAGCCTC AAGGCCGATC TCTCTCGCCA AGAGGTCCGA CTACAGGAAT TGAACGCGCG CCTTGGCGAT GCCCACCCGC AAGTGATGGA AGCCAAGGCA AATATCGCGG CGTTGCGCGG TCGCATCAGC GCCGAATCGC GGCAGGTCGC TTCTGGCGTC GGCGTGACGA ACACGATCAA TCGGCAACGT GAGGCGGAGA TTCGTGCGGC CTATGAAGCC CAGCGCCAGC GTGTGCTGCG CATGAAGGAG CAGCGTGACG AGGCGTCGAT CTACCAGCGC GAAGTCGAAG CGGCGCAACG CGCGCTTGAC AGCGTCATGA CGCGCTTTAA CCAGACCGCG CTCGAAAGCC AGGCGACCCG TTCCAATGCA TCTGTTCTGA CACCGGCCAG CACGCCCTTG CTGCCCTCTT CGCCCAAGAT CTTTCTCAAT GCGTTCATTG GGCTTTTCCT CGGAACGCTG GGTGCTGTCG CCATCGCGAT CGTTCTGGAG ATGATCAATC GCAGGGTGCG AAACGTCGAC GACATCACCG AGGCACTCGG TCTTCCAGTG ATCGGTACCT TGCCAAAGCC GGACCGCATT GGTGTGTTTG GCAAGCCTTC GTCTCAGCCC ATTTTGGCGC GCCGTGTGCT GGGGCAGTTG CCGATGTCCC GGCCGTGA
|
Protein sequence | MTFGQFLSIL AARWRLVLSI VALVVTAAIG VSLALPKQYT ATAAIVFDVN PDPVSTVGYG EMVWPAYLAT QVEIMQSVRV ARRVVEALQM KDDELSRRRW QEATGGQGDF EEWMINVLSR GLVVKLTRES NVVTLSYRAP DPQVAARVSN AFVKAYLDTV VDLKVDPARQ YSTFFESRAK ELRGQLEQAQ AKLSAFQRQK GLIGADERLD TESARLAELS AQVVAMQALS AESGSRQTQA LARSAEQLPD VQANPVVASL KADLSRQEVR LQELNARLGD AHPQVMEAKA NIAALRGRIS AESRQVASGV GVTNTINRQR EAEIRAAYEA QRQRVLRMKE QRDEASIYQR EVEAAQRALD SVMTRFNQTA LESQATRSNA SVLTPASTPL LPSSPKIFLN AFIGLFLGTL GAVAIAIVLE MINRRVRNVD DITEALGLPV IGTLPKPDRI GVFGKPSSQP ILARRVLGQL PMSRP
|
| |