Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1745 |
Symbol | |
ID | 4784203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 1871443 |
End bp | 1872783 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640090315 |
Product | putative membrane transport protein |
Protein accession | YP_001020939 |
Protein GI | 124266935 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.27935 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGCT TCAGCCACGC CCAGCCCCAC CTGCCGTCGT CCAGCGCCGC CGAGCGCGAC GCGCGGCAGC GCTCCGCCGA TCACCAGAAG GTCGCTCCCG GCGAGATCGC GGTGGGCGTG GTCATCGGCC GCGCGTCCGA GTACTTCGAC TTCTTCGTCT ACGGCATCGC CTCGGTGCTG GTGTTCCCGG CGTTGTTCTT CCCCTTCGTC GACCGCCTGC AGGGCACGCT GTACGCCTTC GTGGTGTTCT CGTTCGCCTT CATCGCGCGG CCGCTCGGCA CCGTGCTGTC GATGGCCATC CAGCGGCGCT GGAGCCGCGG CACCAAGCTC ACGGTGGCGC TGTTCCTGCT GGGCACCTCG ACCGCCGGCA TCGCCTTCCT GCCCGGCTAC CAGACGCTGG GCGCGGCCTC GATCGTGCTG CTGGCGCTGT TCCGCATCGG CCAGGGCGTG GCGCTGGGCG GGTCCTGGGA TGGGCTGCCC TCGCTGCTGG CGCTCAATGC CCCGCCCGAG CGTCGCGGCT GGTACGCCAT GCTCGGACAG CTCGGCGCGC CACTGGGCTT CATGCTCGCC AGCGGCCTGT TCGCCTACCT GGTGGCGAGC CTGTCGACCG CCGACTTCCT GGCCTGGGGC TGGCGCTACC CGTTCTTCGT GGCCTTCGCG ATCAACGTGG TGGCGCTGTT CGCGCGGCTG CGGCTGGTCG TCACCCACGA ATACGAACGC CTGCTCGACG AGCGCGAACT CGAGCCCATC GGCGTGCTCG AACTGGTCCG CAGCGAGGGC CACAACCTGG TGATCGGGGC CTTCGCGGCG CTCGCCAGCT ACGCGCTGTT CCATCTGGTG ACCGTGTTTC CGCTGTCGTG GGTCACGCTC TACTCGCAGC AGTCGGTGGC CGGCTTCCTG ACCATCCAGA TCTTCGGCGC CGCACTGGCG GCCGGTGGCA TCGTCGCCTC GGGCTGGATC GCCGACCGCA TCGGTCGGCG CAGCACGCTG GGCGCTTCGG CGGTGCTGAT CGGCATGTTC AGCGGCTTCG CGCCCACGCT GCTGGGCGGT GGCCCGCTGG GCCAGGACGT GTTCATGCTG ATCGGCTTCG CGCTACTGGG GCTGTCCTAC GGCCAGGCGG CCGGCGCGGT GACCTCGAAC TTCTCGTCGA AGTACCGCTA CACCGGCGCC GCGCTGACCG CCGACCTGGC CTGGCTGATC GGCGCGGCCT TCGCGCCGCT GGTCGCGCTG GGGCTGTCGG CCCACTTCGG ACTGGCCTAC GTCAGCATCT ACCTGCTGTC GGGCGCGGCC TGCACGCTGG CCGCATTGAG CCTCAACCGC GCGCTGGGCC CTCGCGACTG A
|
Protein sequence | MSSFSHAQPH LPSSSAAERD ARQRSADHQK VAPGEIAVGV VIGRASEYFD FFVYGIASVL VFPALFFPFV DRLQGTLYAF VVFSFAFIAR PLGTVLSMAI QRRWSRGTKL TVALFLLGTS TAGIAFLPGY QTLGAASIVL LALFRIGQGV ALGGSWDGLP SLLALNAPPE RRGWYAMLGQ LGAPLGFMLA SGLFAYLVAS LSTADFLAWG WRYPFFVAFA INVVALFARL RLVVTHEYER LLDERELEPI GVLELVRSEG HNLVIGAFAA LASYALFHLV TVFPLSWVTL YSQQSVAGFL TIQIFGAALA AGGIVASGWI ADRIGRRSTL GASAVLIGMF SGFAPTLLGG GPLGQDVFML IGFALLGLSY GQAAGAVTSN FSSKYRYTGA ALTADLAWLI GAAFAPLVAL GLSAHFGLAY VSIYLLSGAA CTLAALSLNR ALGPRD
|
| |