Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1222 |
Symbol | |
ID | 5103836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1196263 |
End bp | 1197663 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507114 |
Product | major facilitator transporter |
Protein accession | YP_001191307 |
Protein GI | 146303991 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGATA GCTCTAATTT AAGGCAGTCA CCGGAATGGT ACGTGGCCAG AGTGGACAGG CTCCCTACAT GGGGCCTATC ATATGCCCTA ATATGGGCCA TGGGATTTTC CTTTTTTATA ACCCTATATG ACGTCATTAA CGTGGGTTTC GCTCTTCCCT ACGTACCCTT CGTGGTAAGC GCAGCTCAGG CGTCATTAAT AGCGTCGTTG GGTCTATGGG GTTATGTTGT GGGCGCTCCA ATCTTTTCAT ACATTGCGGA TGTGGTGGGA AGGAGACCCA CCCTAGTCTT CACTGCCCTG TTGACGGCGT TAGGAAGTTT CGGGGATGCC CTATCCGTGA ATTATCCCAT GCTGGCCGTG TTCAGGTTCA TTACGGGGAT GGCCATAGGG GCTGACTTGG TATTGGTCAT GACCTACATG GCAGAGATGT CCCCTGCCGC TAAAAGAGGT CAATACACGA ACCTTGCCTT TATAGGGGGT TGGGCTGGAA TAGGAATTGG TCCCTTCATC GCTGCCCTCA TTGTTACCTC TATACCTTCC ATAGGATGGA GGATAGTTTT CGTTGTGGGA GGGATTCTCG CTGCCCTTGC TCTAGCCATA AGGGCATATG CTCCTGAGAC GGTGAGGTTC TTGGCCATGA AGGGAAAGTT CAATGAAGCT GATAGTTTAG TTGGGCACAT GGAGACAACG TCCATGAAGA GAGCTGGCGT AAATCAATTA CCTGAGCCCA ACATGAAGGT GTACAATGTA CCCAAGGAGA ACCCGTTCAA GGTTCTCGCT AAGCCGAAGT ATCTTAAGAG GCTCATAATC CTGTTCCTCC TGATGTTTAC CATATACTTT ATGGATTACC CATTCCTTGT GTTACCAGAA ACATGGGTGA AGGATGTGCT GGGATATAGC GGGTCCCTGT TCTCCTCCGC TGTCTTCTAT TTTGGGTTAG CCGGGATTGG GGCCTTCCTA GGAGCTATAC TTCTAAGGTT CATTATTGAC AGATTTGATA GGAGATACAT GACAGTGTTT GGAGTTGTTG TGTTCACAAT TGGTACTGCC ATAATGGCAA TTGGAGGAAT TGCAAGAAGC ATTCCGACAT TCTTCATTGG ATCGTTCATT GCCGAGCTCG TGGGAGTTGG ATGGTTCAAC GTTTATTATC TGCTATGCAG TGAGAACTTT CCAACAAGTG CAAGGGCAAC TGGTTACGCC ATTACAGACG GTATTGGACA CGCAGGAGGA GCAATTGGAT TGCTCACGGT TTTCCCGCTA ATCCCGATTC TAGGTAATAT AGGGGCTTGG ACGGTACCGT GGATACCTGC AATAGTGATG GCGATAGTTA CAGTGTTTAC TCTGCCAAAG ACCGTGAAGG TTAGACTAGA GGAGGTAAAT GAAGCTACGG ATCGGGTGTG A
|
Protein sequence | MADSSNLRQS PEWYVARVDR LPTWGLSYAL IWAMGFSFFI TLYDVINVGF ALPYVPFVVS AAQASLIASL GLWGYVVGAP IFSYIADVVG RRPTLVFTAL LTALGSFGDA LSVNYPMLAV FRFITGMAIG ADLVLVMTYM AEMSPAAKRG QYTNLAFIGG WAGIGIGPFI AALIVTSIPS IGWRIVFVVG GILAALALAI RAYAPETVRF LAMKGKFNEA DSLVGHMETT SMKRAGVNQL PEPNMKVYNV PKENPFKVLA KPKYLKRLII LFLLMFTIYF MDYPFLVLPE TWVKDVLGYS GSLFSSAVFY FGLAGIGAFL GAILLRFIID RFDRRYMTVF GVVVFTIGTA IMAIGGIARS IPTFFIGSFI AELVGVGWFN VYYLLCSENF PTSARATGYA ITDGIGHAGG AIGLLTVFPL IPILGNIGAW TVPWIPAIVM AIVTVFTLPK TVKVRLEEVN EATDRV
|
| |