Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0722 |
Symbol | |
ID | 4783908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 748517 |
End bp | 749578 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640089283 |
Product | succinoglycan biosynthesis protein ExoA |
Protein accession | YP_001019919 |
Protein GI | 124265915 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGTCG AACCTTCTGC CTCGCCGGCC GCCGCAACCC GCGTCCTGCT CGTGATCCCC ACGCTCAACG AGGCGGCCCA CATCGACGGC CTGCTCGCAG GGCTGCTGGC GGATCCGCCG AGCTCTGCGA GCTGGCGCAT CGTCGTCGTC GACGGCGGCA GCACCGATGG CACCGTCGAC AAGGTCAAGT CCGTGGCAGC GCAGCAACCC GCGGTGCAGT GGCTGCACAA CCCGGCGCGC ATCCAGAGCG CGGCGCTCAA CCTTGCGGCG CGGCAGTTCG GCCGCGAGGC CGACGTGCTG ATCCGCTGCG ACGCCCACGC CGTCTACCCC GCCGGCTTCT GCCAGCGCCT GCTCGATACC CTGGCCCGTT CTGGCGCCGA TGCGGTGGTG GTGCCCATGG ATTCCATCGG CCACGGCTGC CTTGAGCGCG CCGTGGCCTG GGTGTCCAAT TCGCCGGTCG GCACCGGCGG CTCGGCGCAT CGCGGCGGCC ACCGCAGCGG CTTTGTCGAC CATGGCCATC ACGCCGCATT CCGCATGGAC ATGTTCCGCC GTGCCGGGGG CTACGACGAG AGCTTCACGC ACAACGAGGA CGCCGAGCTC GACTGCCGGC AGCGCGCCCT CGGTGCCCGC ATCTACCTGG ACGCCGACAT CCGCCTCGGC TACCACCCGC GCAGCAGCCT GCCGGCGCTG GCGCGTCAGT ACTTCCGCTA CGGCGCCGGA CGCTCGCGCA CTGCGCGCCG CCACCCGGGC TCGCTGCGCC TGCGGCAGCT TGCCGTGCCC GCGCATCTGC TGCTGTCGGT GCTGGCTGTC GCGATTAGTC CCTGGTTCGC CGGGCTGCTG GCGTGGCCGG CGTTCTACGT CGGCGTGCTG CTGCTCACGT CGCTGCTGAT GGCGGTGCGG CACCGCTCGG CCTGCGGCCT GCTGGCCGGC GTGGCTGCCG CCACCATGCA CACGGCCTGG GCGCTGGGCT TCCTGTCGGG TCTGGTGGGC CGGCGCGAGC AGGCCTGGCA GCCGGTGCTG GCCGCCCCCC TGTGGCCCGC TGAAACGAGA GCCCCCGCCT GA
|
Protein sequence | MPVEPSASPA AATRVLLVIP TLNEAAHIDG LLAGLLADPP SSASWRIVVV DGGSTDGTVD KVKSVAAQQP AVQWLHNPAR IQSAALNLAA RQFGREADVL IRCDAHAVYP AGFCQRLLDT LARSGADAVV VPMDSIGHGC LERAVAWVSN SPVGTGGSAH RGGHRSGFVD HGHHAAFRMD MFRRAGGYDE SFTHNEDAEL DCRQRALGAR IYLDADIRLG YHPRSSLPAL ARQYFRYGAG RSRTARRHPG SLRLRQLAVP AHLLLSVLAV AISPWFAGLL AWPAFYVGVL LLTSLLMAVR HRSACGLLAG VAAATMHTAW ALGFLSGLVG RREQAWQPVL AAPLWPAETR APA
|
| |