Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0993 |
Symbol | |
ID | 5104542 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 916253 |
End bp | 917746 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640506892 |
Product | major facilitator transporter |
Protein accession | YP_001191085 |
Protein GI | 146303769 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000427964 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0333525 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAATT CTCCAATTAT GGCAGAAAAG TCTGTGAAAG CCGGGGAAAT AATCGCCCGG ATGGACAGGT TACCCATCTG GTCACTCTCG TACATTTTCA TTGGAATACT GGGGATGGGA TTTCTCTTCA CATTTTTTGA TATTTTTGAC ATTAACGTCT CATTTATCCA GACATCTCTC ACCATATTTC ACGTTAGTAG TCCATCCTCG CCAGAGATTG GGGTTCTACT GGGACCGGCA GTTCTCCTTA ACCTGGTTGG ATATATTGTG GGCTCCCTAC TCCTTTCCCC TCTCTCGGAT CGGATAGGCA GGAGGAACAT GTTAATGATA ACCATGGCCA TCACGGGGCT CGGGAGTCTG TACAACGCAC TTGTTAACGA TTATTCTAAC TTCCTCCTGG CTAGGACTAT CACAGGAATA GGAGTCGGAG CAGACCTAGC AGTGGTCAAC ACTTACATAG GCGAAGTTGC TCCACTAAAT GGGAGGGCAA AGTACACCAG TTTCGTGTTC CTGTTCTCAA CTCTTGGGGC TGGGTTGGGT CTCTGGTTAG GACTCCTGTT AACAACTCCA CCTGCTCCAT TTCCCCTAGG TTTACCCTTC GCTCTAGGAG GATCTGGCTT CCTTGCCGTA AACGGGTGGA GGGTGATGTA CGGAATTGGC GCTCTCCTAG CCTTGATAGG CTTGCTCCTC AGGTTCAATC TTCCTGAATC TCCTAGATGG TTGATATCCC GCGGTAGGAT AGCTGACGCC GAGGCCGTGG TAAAACAAAT GGAAGAGAGA GCGTCGAGGA AGCTAAGGTC TCTTCCTCCA CTTCCCGCGG TAATACCCCC TTACGTTGTG GAGAGATTGT CCTATCTCGA CTCGTTAAAG GCAGTGATAC TGGATAGGAG GTATGCGAGA AGGCTCGCGG TCCTAATCCC GATGTGGTTC TTTGGCTACA TGACAGTTTA CGTGTTAGCA GCAGGATTAA CCACTATCCT GGCGTCCCTA GGATATCCTC CGCCCGAGGC CGGTATCATT GCCTCCTTTG GGGATATAGG GTTCATCCTA TGCGCAGTAA CCATCATGTT GGTTGGGGAT AAGATGGAGA GGAGCAGGTG GACTGCAATC TCAGTTCTCT TTACCATAGT GGGGGGCGTG GTGATAGCGT TAGCGAAGAC CAACTTACCG TTATCTTTCC TGGGATCGTC AATACTGTTC TACGGCTTCA ATCTATGGGT TCCAGTGTCA TATGCGTGGA GCGCTGAGAG TTTTCCAACA AGGGCTAGGG CGACAGGTTT CGCCCTCACC GACGGGCTGG GGCATATAGG AGGAGGAGTA GGGACAGTTG TTGTAGCGTC TTTTGTGGCG TCCCTAGTGT CCAGTGGTGT CACCACGGGA TTGGCAATTG AGGTTTTCCT GCTCATAGCG TCCTTCCAGA TAATCTCAAC AGTGATTGCA GTATCCCTAG GACATAAAAC AGCTAATAAA AGGTTGGACG AAATATCTCC GTGA
|
Protein sequence | MNNSPIMAEK SVKAGEIIAR MDRLPIWSLS YIFIGILGMG FLFTFFDIFD INVSFIQTSL TIFHVSSPSS PEIGVLLGPA VLLNLVGYIV GSLLLSPLSD RIGRRNMLMI TMAITGLGSL YNALVNDYSN FLLARTITGI GVGADLAVVN TYIGEVAPLN GRAKYTSFVF LFSTLGAGLG LWLGLLLTTP PAPFPLGLPF ALGGSGFLAV NGWRVMYGIG ALLALIGLLL RFNLPESPRW LISRGRIADA EAVVKQMEER ASRKLRSLPP LPAVIPPYVV ERLSYLDSLK AVILDRRYAR RLAVLIPMWF FGYMTVYVLA AGLTTILASL GYPPPEAGII ASFGDIGFIL CAVTIMLVGD KMERSRWTAI SVLFTIVGGV VIALAKTNLP LSFLGSSILF YGFNLWVPVS YAWSAESFPT RARATGFALT DGLGHIGGGV GTVVVASFVA SLVSSGVTTG LAIEVFLLIA SFQIISTVIA VSLGHKTANK RLDEISP
|
| |