Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_1447 |
Symbol | |
ID | 7270052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 1494725 |
End bp | 1495924 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643570071 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002466493 |
Protein GI | 219852061 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.320461 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTCA AAAATATTCA CATTCTCATA CTCTGTCTCG TTGCATTCTT TGCAATGGCC GGCGGGGCTC TCCTGGCACC CGTATTGCCG GAGATGATGG GACCCCTGGG GACCACCGCA CAGAATGTAG CGCTGCTCAT GTCAGTATTT ACGATCTCAA CAGCGGTCTT CACCCTCATC ATAGGACATT TCATTGACCG CGTAAATCGT AAGAGGATAC TGGTACCGAG CCTTGTGCTC TATGGCCTGA CGGGCCTGGT CAGTTTTTTT GTTGCTGATT TCTCATTACT CCTTGTCCTG AGGTTTTTAC AGGGTGTTGG CGTGGCCGGG ATGACATCCA TGGCAATGCT GGTCATCGGA GATGTTTACA CCGGCTTTGA TCGGGTATCG GCCATGAGTA AGATCAGCAT ATCATTTGCC CTTGGATCAA TATTTGCCCC GGTGATCGGA GGCAGCCTTG CTCTGCTGGG ATGGAATTAT GCGTTCCTGT TTTATGCACT TTCCCTGCCA TTTGCGATAA TCGTGATACT ATCGCTTCCT GAGACGAAGG TCCAGACTGA TACGAGAGAT CATAAGGGCA TGATTGAAGC GCTCAAATGT CTCAGGGTAT TACCAATCAT CTATACGATA TTCATGGGGT TTTCGATCTT CTTCATCCTG TTTGCCATGA TGATCTACGT GCCTTTCATG CTCAAGAGTG TGTTTGGATA TGGGTCAGGG GAATCGGGAC TTATGCTCGC AATTCCGGGG ATCTCCTGCA TACTGTTTGC ATCCCGTGTG GGACCCCTTG CTGGTAAGCA TTCATTACTC ATGGTGATTG CTGCGGGTTT TGCCTGTGTC GGTCTGTCGA TGTTATTCAT GCCGGTTTTG CATTCCCTTG CCGCAGTGTT TCTTTTATTA CTGCTGTTCG GAGCAGGCCT TGTCCTTGCC CAGACTGCCA TTGACGTGCA GATCATTCAG GTCTCCCCTC CTGCATCAAG AGGAGGAGTG ATCTCGATTC ACAACTGCAT GAAATACGTT GGCCAGAGTG CCTCTCCCAT TGTACTTGGT ATCATCCTCG CGTATTATGG CCTGAACGCA GTTTTTATAG CAGCAGGTGT CTTTGGATTG CTTATCGCCC TGACAACGTA CCTGATGAAA AAACGGTTTG GTGGTTCAGG CACTCACCCG GTTAAAGATA CTAAAGCATC AGTTCTTTAG
|
Protein sequence | MKFKNIHILI LCLVAFFAMA GGALLAPVLP EMMGPLGTTA QNVALLMSVF TISTAVFTLI IGHFIDRVNR KRILVPSLVL YGLTGLVSFF VADFSLLLVL RFLQGVGVAG MTSMAMLVIG DVYTGFDRVS AMSKISISFA LGSIFAPVIG GSLALLGWNY AFLFYALSLP FAIIVILSLP ETKVQTDTRD HKGMIEALKC LRVLPIIYTI FMGFSIFFIL FAMMIYVPFM LKSVFGYGSG ESGLMLAIPG ISCILFASRV GPLAGKHSLL MVIAAGFACV GLSMLFMPVL HSLAAVFLLL LLFGAGLVLA QTAIDVQIIQ VSPPASRGGV ISIHNCMKYV GQSASPIVLG IILAYYGLNA VFIAAGVFGL LIALTTYLMK KRFGGSGTHP VKDTKASVL
|
| |