Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3400 |
Symbol | |
ID | 4786330 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 3614285 |
End bp | 3615466 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640091976 |
Product | hypothetical protein |
Protein accession | YP_001022588 |
Protein GI | 124268584 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0535164 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGACCG CTCCCCGCCG CATTCTGCCG ATCATCGTCG CCGCCCAGTT CGCGAGCACT TCGCTATGGT TCGCCGTCAA TGCGGTGATG CCCGATCTTC AGCACGCGCT CGGTCTGCCG CCTACCGCTG TCGGCTCGTT GACTTCGGCG GTGCAACTGG GCTTCATCGC CGGCACGCTG GTGTTCGCGC TGGGGTCGGT CGCCGACCGC CATTCGCCGC GCCTCGTGTT CCTGCTGTGC GCGATCTCCG GTGCACTGTT CAACGCCGCG CCCGTGCTCT GGCCGGCAGG CGCGCTGGAC CTGCCGACGC TGTGGCTGCT GCGGGCCTTG ACCGGCTTTG CGCTGGCCGG CATCTACCCG GTCGGCATGA AGATCGCCGC TGGCTGGTAT CCGCAGGGGC TGGGCCGTGC GCTGGGCTGG CTGATCGGCG CGCTGGTTCT GGGCACCGCG ACGCCGCATG CGCTGCGGGC CTTGGGCGCG CACTGGCCCT GGCAGCAGGT GATGCTGGTG GTGTCCGGGC TTGCGGTGCT GGGGGGCTGG ATGGTGTGGC AGTGGGTGCC GGAGAGCACC CATCTGCCCC GTGCCACGCG CCTGCAGTGG CGTGCCTTCG GCCGCGTGCT GACCGACCGC CGCCTGCGTG CACCCGTGTT CGGCTACTTC GGCCACATGT GGGAGCTCTA CACCTTGTGG GTGTTGCTGC CGGCGCTGCT GGCCACGCGG CTCCAGGGGG CAGCCATCTC GGTGGGTGCG TTCGGCGTCA TCGCCGCGGG GGCGGTCGGG TGCGTCGCGG GGGGCTGGCT GGCGCAGCGC CACGGCAGTG CGCGCGTGGC GGCGGGGCAG CTGGCGCTGA GCGGCCTGTG CTGCGTGCTC ACGCCGCTCG CGATGGCAGC GCCCGCCGCG GTGTTCGCGG CGTGGCTGCT GGTGTGGGGC ATCAGCGTGG CCGGCGACTC GCCGCAATTC TCGGCGCTAA CGGCCACGCA TGCCCCGCGC GAGGCGGTCG GCAGCGTGCT GACGCTGGTC AACTGCATCG GCTTCTCGAT CTCCATCCTG AGCATCCAGG GCTTCGTGGC GCTGGCACAG CACCACGATC TGTCGCTGTT GCTGCCGTGG CTCGGTCTCG GGCCGCTGCT CGGCCTGTGG GGCCTGCGGC CCTTGCTGGT CCGCGAGGGC TCGTCGCCCT GA
|
Protein sequence | MPTAPRRILP IIVAAQFAST SLWFAVNAVM PDLQHALGLP PTAVGSLTSA VQLGFIAGTL VFALGSVADR HSPRLVFLLC AISGALFNAA PVLWPAGALD LPTLWLLRAL TGFALAGIYP VGMKIAAGWY PQGLGRALGW LIGALVLGTA TPHALRALGA HWPWQQVMLV VSGLAVLGGW MVWQWVPEST HLPRATRLQW RAFGRVLTDR RLRAPVFGYF GHMWELYTLW VLLPALLATR LQGAAISVGA FGVIAAGAVG CVAGGWLAQR HGSARVAAGQ LALSGLCCVL TPLAMAAPAA VFAAWLLVWG ISVAGDSPQF SALTATHAPR EAVGSVLTLV NCIGFSISIL SIQGFVALAQ HHDLSLLLPW LGLGPLLGLW GLRPLLVREG SSP
|
| |