Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0907 |
Symbol | |
ID | 5103553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 838802 |
End bp | 840235 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640506810 |
Product | major facilitator transporter |
Protein accession | YP_001191003 |
Protein GI | 146303687 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0239598 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAGGA AAGCTCGGGA GACCTTAATT ATTCTAATAT CCGTGTCGCT TCTCATTAAC TACGTTGAGA CCATGGTAGT CCCGGCCGTC CCCAAAATTC AACAGGACTT TGCGACCACA GAGACCCTGG TAGCCTGGGT AACCTCGGCA TTTACCATCG TGGGGGCAGT GGTCTCGCCG GTTTTCGGTA AGCTCGGAGA TCTTTACGGG AAGAGGAGAG TTTATCTCCT TTCCATGGTT TTCTATACCT TCGCGGTGGT AATGGCTGGT TTCTCCCCAA ACATTTACTT TCTGATTGCG GCTAGGGCTA TCCAGGGACT TGGTTTCGCC ATGTTTCCGC TGAGCTTGGC AATAATCACG GACATTCTCC CGCCTGAGAT GATAGCAACG GCACAGGGTA TCATAAGTGG AACCATGGGG ATAGGGACTG CGCTCGGACT AGTCGTGGGT GCCTTCATTG ATCAGGATCT AGGCTGGCAG TACGCCTTTC ACATCGCGTT CGTGATTTCC CTCATCCTCC TATTCGCTGC GGTGAAGTTA GTTCCAGAGA CGGGGGTTAG AAAGAAGGCG ATGATAGATT ACGTGGGGTT TGGGACCCTC ACTGCCGGTG TCACGCTGAT CCTGATTTAC CTAACCCAGG AGTCAAGTTG GGGATGGTTT TCCCCGCAGT CATTGTCGCT CTTGATCCCT GGTATGGTTT TCCTTGGCTT CTTTGGATGG TATGAGCAGA GAGTTAAGAA TCCCGTAATA GAGCTTAGGC TCCTTAAGAT CAGGAACGTT ATGGTGGCCA ACATCGCGGG CCTTATCTCC GGGATTATGA TCCTGGCCCT GTTTTACGGG ATAATATACT ACACCCAACT GCCTCACCCC TTTGGCCTCG GCCTCGACAT AATCTCAGCC GGGTTGACCC TGGCTCCCTC TACCCTTGTG ATGTTCGTGG TGGGTCCAAT TCTAGGAAGA ATGATAAATA GGGTTGGGCC GAAACCCATC ATTGCCGTGG GTTCCCTGGT CATGATGCTA GGTTTCTATC TCTTAATAGT AAATAGGGCA ACTCCCCTAG ACGTTACCAT GGACACTGTG GTGGGCATGC TAGGCCTTCT CTGCCTCATG ATTCCCATAG TGAACATGAT ATCCGTGTCT CTACCTCCCG AGGATAGGGG AGTGGGCATA GGGATGAACA CGTTGATTAG GAACATAGGT AGCGCTGTTG GGCCAGTGAT CACGACCTCC ATAATGTCGA GCTATCAGGG GGCCTTTGTT CTTCCCTTCG ACGGGACTTA CATGGTTGAG ATACTCCCAA GTTCAACGGC TTTCGACCTC ATATTTACCG TGGGAATAGG GATGGCAATC CTGAACCTGT TAATATCTTT GACCGTTAAG AACTATAGGT TCCAGGCTAA GCCAACAAAG GAGGTGGTAG TTGAGGCCAA GTGA
|
Protein sequence | MDRKARETLI ILISVSLLIN YVETMVVPAV PKIQQDFATT ETLVAWVTSA FTIVGAVVSP VFGKLGDLYG KRRVYLLSMV FYTFAVVMAG FSPNIYFLIA ARAIQGLGFA MFPLSLAIIT DILPPEMIAT AQGIISGTMG IGTALGLVVG AFIDQDLGWQ YAFHIAFVIS LILLFAAVKL VPETGVRKKA MIDYVGFGTL TAGVTLILIY LTQESSWGWF SPQSLSLLIP GMVFLGFFGW YEQRVKNPVI ELRLLKIRNV MVANIAGLIS GIMILALFYG IIYYTQLPHP FGLGLDIISA GLTLAPSTLV MFVVGPILGR MINRVGPKPI IAVGSLVMML GFYLLIVNRA TPLDVTMDTV VGMLGLLCLM IPIVNMISVS LPPEDRGVGI GMNTLIRNIG SAVGPVITTS IMSSYQGAFV LPFDGTYMVE ILPSSTAFDL IFTVGIGMAI LNLLISLTVK NYRFQAKPTK EVVVEAK
|
| |