Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3532 |
Symbol | |
ID | 4786235 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3743655 |
End bp | 3744533 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640092113 |
Product | hypothetical protein |
Protein accession | YP_001022720 |
Protein GI | 124268716 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1462] Uncharacterized protein involved in formation of curli polymers |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.348031 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.264846 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACAGT CCTTGCGCGC GCTCGCCGTG GTGGCGCTCG CGGCCCTCGG GATCGCGGGT TGCAGCACCT CCAAAACAGA GATCGGCGGC CCCTCCGACA TGGCCATCGC CGACCAGGCG CCGCCGCAAG AGGGCGGTGT GGGACGCTGC GAGAAGCGGC TCGGCACCGT GGCGATCACC GAATCAGAAG TCAACAGCCA GGCGCTGATG TCGGCCGGCC TGCCGCGTTC GATGGCGCCG CTGGTGCGCC ACCTGCTGAT CCGCAGCGGC TGCTTCAACG TGGTCGACCG CGGCGCGGCC TACTCGCTGC TCGAGGCCGA GCGCAGGCTG CGCGAGCAGC TCGGCACCGA CGCCAACGCG ACGGTCGCCC GGCACCTGCA GCCGCTGGAC TACATCCTGC GCGCAGAGAT CGTGTTCGCC GAACAGATCG GCCAGAGCAA GGGTGTGCTC GGCGGCGTGT TCGGTGACGT GATCGGCGGC ATCGGCGGCC AGTACAACAA GAAGGAAGCG GTGGTGCTGC TGAGCGTGGT GGACGCGCGC ACCAGCGAGA TCACGAGCTC CGTGTTCGGC CGTGGCACCA GCGATTCGGC CGGCCTCGGC AGCCTGGTGC TCAGCAGCGG CGTGTTCGCG ATCGATGGCG GCTGGGCCGA CACGCCGCAG GCGAAGACGG TGGCCGCTGC GCTGGTCGAC GCCTGGAACC GCACGCTGCC CAAGCTGCCG GCGGCCGACA TCGCGCCGCC CCCGAAGGCC AAGCCCGTCG TGGCGCCGGT GGCACCGCCC GCGCCCGTGC CCTTGCCGCT CCCGCTCGAG CCGTCTGCCG CACCGCCGCC ACCCGTGCCC GCCTCCGCGC CCGAGCGCGC GGCCAGCGCG CCGGCCTGA
|
Protein sequence | MKQSLRALAV VALAALGIAG CSTSKTEIGG PSDMAIADQA PPQEGGVGRC EKRLGTVAIT ESEVNSQALM SAGLPRSMAP LVRHLLIRSG CFNVVDRGAA YSLLEAERRL REQLGTDANA TVARHLQPLD YILRAEIVFA EQIGQSKGVL GGVFGDVIGG IGGQYNKKEA VVLLSVVDAR TSEITSSVFG RGTSDSAGLG SLVLSSGVFA IDGGWADTPQ AKTVAAALVD AWNRTLPKLP AADIAPPPKA KPVVAPVAPP APVPLPLPLE PSAAPPPPVP ASAPERAASA PA
|
| |