Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0407 |
Symbol | |
ID | 5105524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 359062 |
End bp | 360345 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640506313 |
Product | major facilitator transporter |
Protein accession | YP_001190508 |
Protein GI | 146303192 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAATC AACCCTTGAG GAGCATATCC TCGTCCAAGA GGATAGTTAG GTTGTTGCCC ATTCTTTTTT ACCTCTATCT AGTAAATTTT CTAGATAGAG TTAACATATC CTATGCAATT TCAGCGGGGA TGTTCAAGGA TTTGGGAGTT CCCAAGAGTA GCGCGGATCT TATAGCCTCC ATTGCCTCTA GTCTATTCTT CGTAGCTTAC GCTATCCCTC AGGTATTCTC CAACCTAGGC ATAAGCAGAA TTGGAGTTAG GAAGGTATTT GCGTTAGCCT TCACCGCATG GGGGATAATC ACAATTCTCA CAGGGTTTGT TCAGAACGTT CCTGAAGTCT ACCTGCTTAG GTTCCTCCTT GGACTCGCTG AAGCTCCTTT CTACGCGGGC GTAATCTTTT ACCTCAGCGT GTGGTTCCTG AGGGACGAAA GGGGATTCGC AAATAGCCTG TTCAATGCAG CCATCCCTGT CTCAGGGATA ATAGGAGGAC TCATAGCTGG TTCATTCTTC TCTGTGTTTG GAGATGATCC CGGATGGAGA TACCTATTCG TGGCTGAGGG TGTACTGGCT CTCGTGTCGG TGGCTGTTAT CTGGCTTTTA CTCACCGACT TTCCCAAGGA TGCAAAGTGG TTAAGTGAGG GGGAGAAAGA GGAACTTCTA AGCAAGATAA AGGTTGAAAA GGAGGAGAAG CAGAAGCTAG TTTCCCACGC CTCGTGGAGG AGGGCGCTAG GTGATAGGGA TGTACTTCTC CTGGTGCTGA TATATTTCCT TGGCGTAACG TCACTGTACG GTTACACCAT CTGGTTGCCG TCAATCATTA AGAGCTTCGG CGTCTCCGCC TCAACTGCAA GTTACCTCAC TGTTATACCA TATCTCGTTG CCTCAATCTC GCTCATCTTC ATCTCCAGGT ATTCAGACAG GGCCGGAGTT AGGAAATCTC TGGCCTTGGC AATATTTCTC GTTGCAGGGA TTGGGCTATC CTTAAGTGCA TTTACACTCA AGACGCCAGT GATTTCGTTC CTATTCTTCG TAATCTCTGC TATTGGAATT TACAGTTTCA TTCCAGTATT CTGGACTATA CCCACTGAAT TCCTAAGCGA GGAGTCAGCT GCAGCGTCCA TAGGACTAAT AAACGCACTG GGCAACTTGG GTGGGATCGC TGGTCCCATC ATAGTAGGCT TCCTAGAGAG CTTAACGGGG GTTTTCACGG CAGGTGTTTA CTCCCTCGCC CTCTTCGACA TCCTAGCAGG GCTTGTGGTA TTACTAGTCA GAAAGAGCAG ATGA
|
Protein sequence | MSNQPLRSIS SSKRIVRLLP ILFYLYLVNF LDRVNISYAI SAGMFKDLGV PKSSADLIAS IASSLFFVAY AIPQVFSNLG ISRIGVRKVF ALAFTAWGII TILTGFVQNV PEVYLLRFLL GLAEAPFYAG VIFYLSVWFL RDERGFANSL FNAAIPVSGI IGGLIAGSFF SVFGDDPGWR YLFVAEGVLA LVSVAVIWLL LTDFPKDAKW LSEGEKEELL SKIKVEKEEK QKLVSHASWR RALGDRDVLL LVLIYFLGVT SLYGYTIWLP SIIKSFGVSA STASYLTVIP YLVASISLIF ISRYSDRAGV RKSLALAIFL VAGIGLSLSA FTLKTPVISF LFFVISAIGI YSFIPVFWTI PTEFLSEESA AASIGLINAL GNLGGIAGPI IVGFLESLTG VFTAGVYSLA LFDILAGLVV LLVRKSR
|
| |