Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0467 |
Symbol | |
ID | 5105463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 421881 |
End bp | 422960 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640506373 |
Product | major facilitator transporter |
Protein accession | YP_001190568 |
Protein GI | 146303252 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0594986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGGA CACTCCTGGC ACTGGCAATG GGTGGTTACA CAGACGGATT TGATCTCCTG ATTATTGCAG GAGTACTGGG AGAAGTACTT AAGGTCTTTA AACCTACACG GCTCGAGACT GGACTCTTAG TTTCAACTTC TTTTCTAGGA TCAATTGTAG GGGCAATTAT CCTTGGGTTA GTGTGTGACA TCATGGGTAG GAGAAAGAGC TACCTCATCT CGTTAATCCT TTTCATTATA GGGGCTCTAA TTAGTGCCAC TGCACAGGAT TACGGCTCAC TTGTTCTTGG AAGGTTGTTG GTAGGTCTAG GAATTGGAGG TGAAATTCCG TCCTCAACCA CCCTTCTAAC GGAGATTTCG AAAAAATGGG TAGGTCTTAT CTTCGCCTCT TGGGCCATAG GAGCACTCAG TGCGACAATA ATACCCTTTT TCGTATATCC CTGGAGGATT GCGTTGTTGC TAGGAGCGGT TCCACCGCTT ATTGCCGTGG CCCTCCACAG GTGGATTAGG GAAAGCGAAG TTTGGCTCAA TTCGTCCAGG ATTAATAACG CCTCGAAGTT TACAGTTAAT AAAGCCTCAA GTATAGCAAC CTTGGTCACC GGGTTAAGTC AACTAGTCCT CACCATGGTC TTAGCATCGT TTGCTGTTTA TGTACCTGGA TACTCCATGG GAACTCAGTT AATCAATTGG ACACTTTTCG CGATAGGTTC AGTCCTGACT ATCTCGGCAC TCAGGAGAAA GCACATTCTT TTCCTTTCCT ACTTACTAAT AGGGGTGTTC CTTGTTACAT ATTCCGTCAC CGGCTTGTTC AGTGCAATTG CCTTAGTCTG GTTCTTCTCG TGTTTAGCCT TCGGATTCTC GTTCTTGTAC GTGGGAGAAC TTAGAAGCGC CACCGATAGA GGAACCTTCA ATGGATTCCT CTTTTTTATG GGAAGGTTAG GAGGAGCCAT AGGTACGTTC TCTTATCCAC TTCTGAGATA CGAGTTGAAA GAGGTGCTCC TAATGATTTC CCTTTCCCTT ATGGTCCTAA GTCTTATCAT ACCATTTTTA GAAGAAACTA GGATAGAAAA GGTAGAGTGA
|
Protein sequence | MKRTLLALAM GGYTDGFDLL IIAGVLGEVL KVFKPTRLET GLLVSTSFLG SIVGAIILGL VCDIMGRRKS YLISLILFII GALISATAQD YGSLVLGRLL VGLGIGGEIP SSTTLLTEIS KKWVGLIFAS WAIGALSATI IPFFVYPWRI ALLLGAVPPL IAVALHRWIR ESEVWLNSSR INNASKFTVN KASSIATLVT GLSQLVLTMV LASFAVYVPG YSMGTQLINW TLFAIGSVLT ISALRRKHIL FLSYLLIGVF LVTYSVTGLF SAIALVWFFS CLAFGFSFLY VGELRSATDR GTFNGFLFFM GRLGGAIGTF SYPLLRYELK EVLLMISLSL MVLSLIIPFL EETRIEKVE
|
| |