Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2070 |
Symbol | |
ID | 5105050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1987398 |
End bp | 1988906 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640507960 |
Product | major facilitator transporter |
Protein accession | YP_001192134 |
Protein GI | 146304818 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAATTC CCAAGGACAA GTCCAGTGAA GGCAAGGATT ATGATCTGAA ATACGCATAT AGGGCTTTGG GCATTCTGGC TCCCTTAGCC ATAGTTGTGA TGTATACAGA AGGAATGTTG ATTCCCTCCC TTGTGAAAAT CGAAGATGAC TTTGGGGTCA ACGCTGCCCA AGTTAGCTGG GTTCTAACAG TGTATCTCCT TACAGGATCG GTCATGAATC CCATAGCAGG GAAACTGGGA GATATGTTTG GGAAGAAAAG AGTCCTTACC ATGATCATCT GGATTTACGC TGTGGGTGTT ACCTTGACAG GATTTGCACC GAGTTTCGGC TTCCTGATAT TTGCTAGGGC TATTCAGGGA CTAGGTCTTG CCATGTTTCC GTTAGCGTTC AGCCTAATAA GGGAAGAGTT CCCTCCAAAA CTAGTTCCTA CTGCTCAAGG TATAGTTAGC GCCATGTTTG GGGCAGGATC TGCAATAGCC CTTCCCATAG GTGCATACAT TTCACAAAAT TTTGGATGGC AGTATACATA TCACACTGTT ATACCATTCG TGGCCCTGAT GGCAATACTG ACCTCCACAC AGATAAGGGA GTCTAAGTTC AAGAACCCGA ATACCAAGAT CGACTTCGTT GGAGCAGGAG TACTCTCGAT CTCGTTGGCC TCCTTAATCC TAGGTTTCAG CGAGGCTCCT AGCTGGGGTT GGTCATCACC CCTTACCATA GGCACCCTTC TCCTCTCCAT GATCACTTTT GCCACGTTCA TCTATCTTCA GACTATTACG CCATTTCCAT TAATATCCGT GAAGCTATTG AAGAGAAGAA ACGTTCTTGT CGCTAACGTG GCCGCAGTGG TTGCAGGATT TGCAATATTT ATGGGGTCCC AAACCTTAAC TTACTTGTTT GAGGAGCCTA ATCCAGTTGG GTTTGGGCTA GATATCCAGG CAACGGGGCT TGCCCTACTA CCCACAGCTC TTATCCAGTT AGTGGGAGGC CCCCTCGCAG GAAAGGCAAT ATCTAGAAGC GGACCAAGAA AAGTGATGAT AGTTGGTTCC ACTGCACTAA TTCCGGTGTA TCTAGTTCTT TCAGTGTTAA CGTCCGCTGG GGGATCACAG TCAATTAACC TTGTGATTAC CTTCGCCACT TTGGCAATGT TAAGTGCCAC GCTTCTGAAC GTGAGCCTGG TAAACCTACT CACCTTTTCC GTGGAGAGGC AGGTCATGGG GACAGTCACT TCAATTAATA CAGTCTTCAG GTTGGTAGGA GGAACCATAG GTCCATCTGT AGCCGGAGCT ATTATGGGGA CTTATCAAAG CAGTATCGTG GAGATCATAC CGGTAGGAGG AACTACGGTC TACTACCCGG TCATCATTCC TTCGGACCAG GCCTTTAGTC TTATTTATCT GATTGCAACT CTTCTGGCTG TAGTTATGAC TGGGATTTCC TTCATGACTA AGGACATAAA AATTGGGAAT GTAATGAAGG AAAGAAATGA GTTCGTTGCA GGACATTAG
|
Protein sequence | MEIPKDKSSE GKDYDLKYAY RALGILAPLA IVVMYTEGML IPSLVKIEDD FGVNAAQVSW VLTVYLLTGS VMNPIAGKLG DMFGKKRVLT MIIWIYAVGV TLTGFAPSFG FLIFARAIQG LGLAMFPLAF SLIREEFPPK LVPTAQGIVS AMFGAGSAIA LPIGAYISQN FGWQYTYHTV IPFVALMAIL TSTQIRESKF KNPNTKIDFV GAGVLSISLA SLILGFSEAP SWGWSSPLTI GTLLLSMITF ATFIYLQTIT PFPLISVKLL KRRNVLVANV AAVVAGFAIF MGSQTLTYLF EEPNPVGFGL DIQATGLALL PTALIQLVGG PLAGKAISRS GPRKVMIVGS TALIPVYLVL SVLTSAGGSQ SINLVITFAT LAMLSATLLN VSLVNLLTFS VERQVMGTVT SINTVFRLVG GTIGPSVAGA IMGTYQSSIV EIIPVGGTTV YYPVIIPSDQ AFSLIYLIAT LLAVVMTGIS FMTKDIKIGN VMKERNEFVA GH
|
| |