Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1280 |
Symbol | |
ID | 5104692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1256285 |
End bp | 1257973 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640507170 |
Product | major facilitator transporter |
Protein accession | YP_001191363 |
Protein GI | 146304047 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTACA AATGGATAGC CTTGAGCAAT ACGACTTTGG GAGTCCTAAT GGCGACGATA AACGGCACGA TCACGATCAT CTCGCTTCCC GCCATATTCA GGGGGATAGG GATAAACCCA CTGGCCCCCT CATCTTTCCA GTACCTTCTC TGGATCCTCA TGGGCTACAA CGTTGTAACC GCGACGCTTC TCCTTTCCTT TGGAAGGTTG TCTGACATGT ACGGAAGGGT CAGGCTGTAC AACCTCGGCT TCCTGGTTTT CACCGTGGGC TCTATCCTTC TCTCCTTAAC CTTTGGAAGT GGGGATATGG CTGGCCTCGA GCTCGTGATA TTCAGGATAA TCCAGGGAAT TGGTGGAGCG TTCTTGATGG CGAACAGCGC GGCAATCCTC ACAGACGTTT TCCCCGTCAA TGAGAGGGGT AGGGCACTTG GAATAAACCA GGTAGCTGCG CTGGCAGGAT CGTTAATTGG GCTTATCCTG GGAGGAATTC TATCGGTCAT AAACTGGAGG TATGTCTTCC TCGTGAGCGT TCCAGTAGGG GTATTTGGAA CGGCGTGGAG TTACCTGAAG CTCAAGGAGA CCAGTGCAAG GAACAGGGAG GGGATAGATT GGGTCGGTAA TGCGGTGTTC GGCCTCGGGT TAATCCTTGT ACTAATTGCA ATGACCTACG CCCTTATGCC CTATGGCTCA GCGCAGACCG GGTGGGGTAA TCCCTTCGTC ATACTCTCCA TGGTTGCAGG GCTGGGACTT CTGGCCTCTT TTCCCTTTAT CGAGACTAGG GTCAAGTACC CCATGTTCAG AATGGAACTC TTCAGGAATA GGATGTTCGC TGCGGCCAAC TTTGCTGGAT TTCTGAGGTC CATAGGCTAC GGAGGTCTCA TGATCATGAT AGTGATCTTC CTCCAGGGGA TATGGCTCCC GCTTCACGGA TACTCCTACT CTGAAACTCC TTTCTGGGCA GGGATATACA CGATCCCCCT AATGGTTGGA TTCGTGAGTG CGGGACCAGT GAGCGGTTGG CTCTCAGACA GGTACGGATC GAGGGGGCTA GCTACTGCGG GAATGGTTCT GGTCGGCATA GGTTTCTTAG CCCTGACCGC GCTACCCTAC AACTTCAGCT ATCCAGTGTT TGGAGCAATC ATCTTCATGA TGGGAGTCGG AAATGGGATG TTCGCTTCTC CAAACACTTC CTCGATCATG AGTAGCGTCC CCGCAAAGCA CAGGGGAGCT GCGTCGGGGA TGAGGTCAAC GCTTCAGAAC ACTGGACAAA CGGTGAGCAT TGCCATATTC TTCACGATTG TGATCCTTTC CCTGAGTTCG TCGTTGGGGC CGTCATTGGC TCACGCCCTA GCTCAGGCAG GCGCTCCTCA GCTTTCGCCC TATGTACAAA AGGTTCCAGT GACCGGGGCC TTGTTCGCGG CCTTCCTTGG ATACGACCCC GTGAAGGCCC TACTCGGAAC CTTACCCGCT TCAGTTGCCT CCCAGATTCC CAGTAGTGCG ATCTCGATCA TGGAGCAGAG GACCTGGTTC CCAAGCGCAA TTGCCCCCAG CTTCATGTTA GCCTTGAGGG AGACGTTCTA CTTTGGGGCT GCGCTCTCCT TCATAGCGGC AGTAGCCTCA GCCCTGAGGG GAAAAGCTAA AATCCCGGAG GAGGTTGTCC AATATGATGC AGAGAAAACT AGACGTTGA
|
Protein sequence | MQYKWIALSN TTLGVLMATI NGTITIISLP AIFRGIGINP LAPSSFQYLL WILMGYNVVT ATLLLSFGRL SDMYGRVRLY NLGFLVFTVG SILLSLTFGS GDMAGLELVI FRIIQGIGGA FLMANSAAIL TDVFPVNERG RALGINQVAA LAGSLIGLIL GGILSVINWR YVFLVSVPVG VFGTAWSYLK LKETSARNRE GIDWVGNAVF GLGLILVLIA MTYALMPYGS AQTGWGNPFV ILSMVAGLGL LASFPFIETR VKYPMFRMEL FRNRMFAAAN FAGFLRSIGY GGLMIMIVIF LQGIWLPLHG YSYSETPFWA GIYTIPLMVG FVSAGPVSGW LSDRYGSRGL ATAGMVLVGI GFLALTALPY NFSYPVFGAI IFMMGVGNGM FASPNTSSIM SSVPAKHRGA ASGMRSTLQN TGQTVSIAIF FTIVILSLSS SLGPSLAHAL AQAGAPQLSP YVQKVPVTGA LFAAFLGYDP VKALLGTLPA SVASQIPSSA ISIMEQRTWF PSAIAPSFML ALRETFYFGA ALSFIAAVAS ALRGKAKIPE EVVQYDAEKT RR
|
| |