Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1772 |
Symbol | |
ID | 5104772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1713915 |
End bp | 1715384 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640507670 |
Product | major facilitator transporter |
Protein accession | YP_001191851 |
Protein GI | 146304535 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0605301 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGAGA AGAACGCATC TGAGATTATA GCCAGGCTGG ACAGGTTACC AGTCTGGTCT CTTCCCGCCA CCTTCATTGC TGTTTTAGGT ACAGGTTTCC TTTTCACGTT TTTTGATATC TTTGATATTA ACGTCTCATT TATCCAGACA GCTCTTACAA CCTTTGGCGT GAGTTCGCCG TCATCTCCCG AGATCCCCCA ACTACTTGGA CCCGTAGTCC TCTGGAATCT GGTGGGGTAT GTAATTGGGG CCCTTGCCTT AACCCCTCTT GCGGACAGAT ATGGGAGGAA GAGAATGCTC ATGATTACCA TGGCCATCAC AGGGCTGGGG TCACTGTATA ACGCGCTCTC CCCAGATTAC CTGAACTATT TGCTCGCTAG GACCATTACA GGGATAGGTG TGGGGGCAGA TCTGGCCATA GTTAACACGT ATATAAACGA GGTATCTCCC GTAAACGGAA GGGCGAAGTT TACATCCTTG GTTTTCCTTT TCGCTACCCT AGGAGCGTTT CTAGGCCTAT GGCTTGGTCT CCTGATCACT ACTCCGCCAG CACCTTTCCC CCTAGGGCTT CCATTTGCGT TGGGGACAAC CGGGATTTTC GCAACCGCTG GTTGGAGAAT CATGTACGGA ATAGGTTCCC TCCTCGCCCT GATAGGCCTC CTACTCAGGG TGGAGCTACC CGAGTCTCCG AGATGGCTTG CGTCTAAGGG GAGGATTATG GAGGCCTCAA AAATCGTGGA GAGGATGGAG GAGGCAGCTA GAAAGAAGAT AGGAGAACTG CCTCCAGTAC CGCAACACAT TGAGGTCAAG ACTATCACAG AGGTTTCATA CAAGGAAGCC TTGAGGACAA TTCTAGGGAA TAGGGTTTAC CTGAAGAGAT GGATCATTGT AATGTCCATG TGGTTCTTTG GGTATATCAC GGTGTATACA AACGCTGCAG GGCTAACCAC TATTCTCTCG TCCTTAGGGT ATCCATCATC CGAGGCAGGC ATGATAGCAT CGCTTGGGAT ACTTGGTTTC GTGGCTGTAC CGGTTCTTCT AATCCTGTTT GGGGACAGGC TAGAGAGAAA GGTGTGGGTT CCAATCTCCG CGGTAATCAT GTTGTTGGGC GGGGCAATCA TGGCTGAGGC TGGGCATAAC TTCTTCCTTG AGGTTTTGGG GGCATTTGTT CTCTTCTTCG GGAATAACCT GTGGATTCCC ATCTCCTACG CCTGGACTAC GGAAAACTTT CCCACAAGAG CTAGGGTTAC TGGCTTCGGA TTGGCCGATG GAATAGGGCA TATTGGAGGA GGAATAGGTG CCTTCCTCGT GGCATTACAG ATTGGTAACA TAGTCTCTCA TGGTGTAACA AGTAACACAC CCCTGGAGGT GTTCATGTTA ATGATATCGT TCCAGCTCTT ATCTGCGTTG ATATCCCTGG CAGGAATCAG GACAGCAAAG AGGCGTCTCG ACGAGATATC CCCTCACTGA
|
Protein sequence | MEEKNASEII ARLDRLPVWS LPATFIAVLG TGFLFTFFDI FDINVSFIQT ALTTFGVSSP SSPEIPQLLG PVVLWNLVGY VIGALALTPL ADRYGRKRML MITMAITGLG SLYNALSPDY LNYLLARTIT GIGVGADLAI VNTYINEVSP VNGRAKFTSL VFLFATLGAF LGLWLGLLIT TPPAPFPLGL PFALGTTGIF ATAGWRIMYG IGSLLALIGL LLRVELPESP RWLASKGRIM EASKIVERME EAARKKIGEL PPVPQHIEVK TITEVSYKEA LRTILGNRVY LKRWIIVMSM WFFGYITVYT NAAGLTTILS SLGYPSSEAG MIASLGILGF VAVPVLLILF GDRLERKVWV PISAVIMLLG GAIMAEAGHN FFLEVLGAFV LFFGNNLWIP ISYAWTTENF PTRARVTGFG LADGIGHIGG GIGAFLVALQ IGNIVSHGVT SNTPLEVFML MISFQLLSAL ISLAGIRTAK RRLDEISPH
|
| |