Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1067 |
Symbol | |
ID | 5104448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 996981 |
End bp | 998171 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640506962 |
Product | major facilitator transporter |
Protein accession | YP_001191155 |
Protein GI | 146303839 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAGGG ACGTCTTTTT ATTAATGATC TCGAGGGTTG CAAGGAGCTT CGCAGCGGGT CTTCTTGCTG TCATTGTGGG GTTGTATTAC GTTCATGTAC TTCACTTGTC GCTTCTCCAG GTAGGAATAC TGTTTGGCGT TGGGGCGTTC GTCACCCCCC TCCTCACGTT GATCCTGGGA TTCTACTCCG ATAGGTATGG AAGAAAGAAA ATATTGCTCA TTACCCTCTC TTTCCTTCCC TTGTCCGTGT TGATTCTGCT CCTCACATCC AACTTCTTTC TACTCATGTT GTCCTCAGCC CTTGGCGGTT TCGGGATAGC GGGAGGTTTA GTCGGAGGAG GTGTAGGTGC AAGCGTCGCC CCCATGCAAA CTGCCCTCCT CACCGAGAAG GTGAAGCCTG AGGAGAGAAC GAAGATTTTT TCGTGGTTCA CAATAATCTC CAGTTACGCA GGTTCCGCTG GAGCACTCCT GGCGAACGGT TCGAGTTACG AGGAGTTATT TATCATTTCT CTTCTAGTGT CGGGTCTCTC AGCCCTCGCG GTTATCCCTG TGAAGGAGAA CTTCAAGCCG AGAAGGAAGG AGGAACCTAA GGCCGCAAGT AAAGATAACG ATGTTATTAA GAAGTTTACC CTCACTGGGA TTCTGAACGG GGTATCACAG GGGCTCATAG TCCCATTCAT ACCCATCATC TTTAGTGAGG TATACGGTCT ATCACAGGGC TTCATTGGAG ATCTCGTATC CCTGGGAGGG GTTATATCTG CGAGCGCCAT GCTTGCCACT CCCGCGTTAA CGGAGAGGCT GGGATTCGTG AGATTAATCA TAATCACAAG GACAATATCT GCGGTCATGG TACTTCTTTT CCCCTTCCTT GGAGTGTGGT ACCTTGCCTC CATAGACTAT GTTCTGTTTA CCCCACTTAG GGTTATAGCA CTACCAGCCC AGCAGGCATT AATGATGAAC CTGGTGGGAG AGGGTAGGAG GGCGACTGCC ACAGGAGCTA ATCAAGCTGG TAGACTCATA CCCGCAGCTG CGTCAACTTC TCTCTCAGGC TACATAATGC ATGCTCTCTC GTTTGCCATA CCCTTCGAGG CGGCCTTTGT TGCCACAATT GTGAACTCGT TTCTCTATTT TAAGTTCTTC AGGGGTGTAG ATAAGGCGGT AGCCGGGAAA GTCGTACTGT CGGAGGGATA G
|
Protein sequence | MQRDVFLLMI SRVARSFAAG LLAVIVGLYY VHVLHLSLLQ VGILFGVGAF VTPLLTLILG FYSDRYGRKK ILLITLSFLP LSVLILLLTS NFFLLMLSSA LGGFGIAGGL VGGGVGASVA PMQTALLTEK VKPEERTKIF SWFTIISSYA GSAGALLANG SSYEELFIIS LLVSGLSALA VIPVKENFKP RRKEEPKAAS KDNDVIKKFT LTGILNGVSQ GLIVPFIPII FSEVYGLSQG FIGDLVSLGG VISASAMLAT PALTERLGFV RLIIITRTIS AVMVLLFPFL GVWYLASIDY VLFTPLRVIA LPAQQALMMN LVGEGRRATA TGANQAGRLI PAAASTSLSG YIMHALSFAI PFEAAFVATI VNSFLYFKFF RGVDKAVAGK VVLSEG
|
| |