Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1095 |
Symbol | |
ID | 5103569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1021515 |
End bp | 1022948 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640506990 |
Product | major facilitator transporter |
Protein accession | YP_001191183 |
Protein GI | 146303867 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAACTC TTCTCTCCCT AACCCTTATG TTAATGCTTG TGAACTACGT GGAGACCATG GTGATTCCAG CACTGCCTAA GATAGAGGAC CAATTCTCCA CAACCGCGAC CACCGTGGCG TGGGTAACCT CGGCATACCT CATTGTGGGG GCGGTCGCGT CTCCGATTTT CGGCAAACTG GGCGACAGAT ACGGGAAAAA GAAGGTTTAC CTGATCTCAA TCGGGTTCTA CTCGCTTGCG GTTTTGATGG CAGGTTTCTC TCCGAACATT TACTTCCTGA TCTTCTCTAG GGGAGTTCAG GGAATAGGAT ATTCCACATT TCCCTTGGCT ATTGCCATCA TCACTGACCT GTTCCCCAAG GAAAGGGTGG CATGGGCACA GGGCATACTT AGCGCAACCT TGGCTGCTGG TCCTGCACTA GGTCTCCTTG TGGGATCCTA TATAGTCCAG GACTTGGGGT GGCCGTACGC CTTTCACACG GCTTTCATCC TCTCCTTGAT TTTGCTGGGC ATCTCGGCCA AGTACATCGT GGAAATCCCT GAGAAGACTA GGGAAAAGAT TGACTACCTT GGTGCTACCT TCCTCATGTT AACAGTGGTG CCACTCCTAG TTTATCTTTC CAATGGGCCC AACGTGGGGT GGACCACCTT GAGCCAGATA GCCCTCATCG TGGTGTCAGT GGTAGCGTTC CCTATCTTCT TGATCGTGGA GAGGAGAACC TCGGAGCCCT TGATGAGGCT TGACCTCTTT AGGGTAAGGA ACCTCATGGT GGCAAACGTG GCTGGTCTCA TTTCCGGTAC GGGTATGTTC CTGATGTTCA CTGGATTGGT TTACTACCTT CAGCTACCCA GACCCTACGG ACTAGGTTTA ACTATTATCG AGTCAGGTCT CCTCATGGCA CCCGTTGCCC TGGTCATGAT GACCTTGGGT CCTATTGTGG GTAGAGCTAT AAATGTGATT GGCCCGAAAC CTCTGCTCGT CGTAGGATCA TCGGTCAGCA TGTTGGGCTA CTTCCTACTG GACACCTTTA GGTATAGCGA GTACGAGGTT CTATTTGACG TGATAGTGAC AGCTGCAGGA TTGGTTAACC TGATTATCCC CCTAGTTAAC ATGGTCGCCT TGGCTTTACC TGAGGAGCAA AGGGGAATAG GGATCGGAAT GAACACCTTG ATAAGGACCA TAGGAAGCGC AATCGGTCCA GTGATATCAA CAGTGTTCAT GGACACTTAT GTCACGTGGT TGCTATATGA CGTTAATGGA CAGTTCATTC CCGTGGCACA GGTACCTGAC TACTCGGCCT TCCACTACAT GTACATGGTA GCAATTGCCC TTATGTTCCT GAGTTTGATA GCCTCATTGT TCACGAAAAA CTATGTTATA AAGGCCAGAC AGGAGGTGAA AAGGGAAGTA GTTGAGGCCA AACATCCTGG ATGA
|
Protein sequence | MRTLLSLTLM LMLVNYVETM VIPALPKIED QFSTTATTVA WVTSAYLIVG AVASPIFGKL GDRYGKKKVY LISIGFYSLA VLMAGFSPNI YFLIFSRGVQ GIGYSTFPLA IAIITDLFPK ERVAWAQGIL SATLAAGPAL GLLVGSYIVQ DLGWPYAFHT AFILSLILLG ISAKYIVEIP EKTREKIDYL GATFLMLTVV PLLVYLSNGP NVGWTTLSQI ALIVVSVVAF PIFLIVERRT SEPLMRLDLF RVRNLMVANV AGLISGTGMF LMFTGLVYYL QLPRPYGLGL TIIESGLLMA PVALVMMTLG PIVGRAINVI GPKPLLVVGS SVSMLGYFLL DTFRYSEYEV LFDVIVTAAG LVNLIIPLVN MVALALPEEQ RGIGIGMNTL IRTIGSAIGP VISTVFMDTY VTWLLYDVNG QFIPVAQVPD YSAFHYMYMV AIALMFLSLI ASLFTKNYVI KARQEVKREV VEAKHPG
|
| |