Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1094 |
Symbol | |
ID | 5103568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1020145 |
End bp | 1021428 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640506989 |
Product | major facilitator transporter |
Protein accession | YP_001191182 |
Protein GI | 146303866 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCAGTC CTCCTGGAAA ACTAAAATCC TTCTTCATCT CCTCCGCAGG GTTCCTCCTT GACGGATATG ATCTCTCGGT GATATCCTTC GCGCTTCTGT TCCTTCCGAA GGAACTTCAT CTTACCCCAT TACAGGAGGG ACTCGTTAGC TCTGCCTCGC TCATGGGAAT GATACTCGGT TCAGTCCTAC TCGGGTTACT CTCCGACAAG ATGGGGAGGA AAAGGCTCAT GGGCTTGGAT CTAGTAATCT TCACGGTCTT CGCCATAACC TCGGCCCTGT CCCAGAACTT CCTGGAAATG TTCCTATCTA GGCTACTCCT GGGGGTTGGC ATAGGTGGAG ATTATCCCCT AAGTAGTTCC CTCATGGCTG AGTACTCCCC CTCAAGGTCG AGGGGAAGGT ACCTCGTGGG GGCAGTGTCC ATGTATTGGG TAGGAACACT GCTCTCTGCT GTCGTGAACC TAGTCTTCCT TCCCACGGGT GACTATTTCT GGAGGTATTC CTTCGCGTTT GGAGCCCTTC TATCCATCCC AGTCATAGTA GCCAGGTTCT CTCTCCCCGA GTCACCCAGA TGGTTAATAA GCAAGGGTAA ACTTAAGGGA GATGGAATCC CAACCCAAGA GGAGGAAAAC AAGGGAGTTA CAGGTTTCCT TGACCTGTTC AGGATGAGAT TACTTCCATA CCTCCTCCTA GTCTCAGCAA TCTGGTTCTT GTTTGACGTT GCGTCATACG GTATAGGACT TTACTACCCA GCAATATTTA GGGAGTTCTC TTTACCCTCC AACTACGAGG TGATTTACGC CACCATGATA ATCGCGGTGG GAGCAATCCT CGGCTATATC CTGGCGGAGG TCGCCATAGA TTCGCTGGGA AGGAGAGCTG TTCTTCTATC CGGGCTTGGC GTAATGGCAC TTCTCCTGGC TGTGGGAGGT GTCCTGAGGC TTACCGGGGT TGTTTTGGTG CCATACTTTG CGGTCTTCGT GGCAATGGAG CAGTGGGCTG GCGCGGTCAC ACTCTTTTAC CCCGCTGAGC TCTTCCCTAC CCCAGTTAGG TCATCCGCTC AAGGATTTGC GACAGCAGTG AGCAGGATAG GAGCTGTCCT GGGAGTCGTG TTTTTCCCTA GCATGGTGAA GGTCCTTGGT CTCTCTAACT CCCTGATTCT GTTCTCTGTA ACGTCGGCTA TCGCATTCAT ATTGGCACTC CTGCTGAGGG AAACTAAGAG AAAGGAACTA GAGGAGATCT CCCTTGGGCT AAAGGAGGTG AAAGGGAGAA ATCCGAGTAC ATGA
|
Protein sequence | MSSPPGKLKS FFISSAGFLL DGYDLSVISF ALLFLPKELH LTPLQEGLVS SASLMGMILG SVLLGLLSDK MGRKRLMGLD LVIFTVFAIT SALSQNFLEM FLSRLLLGVG IGGDYPLSSS LMAEYSPSRS RGRYLVGAVS MYWVGTLLSA VVNLVFLPTG DYFWRYSFAF GALLSIPVIV ARFSLPESPR WLISKGKLKG DGIPTQEEEN KGVTGFLDLF RMRLLPYLLL VSAIWFLFDV ASYGIGLYYP AIFREFSLPS NYEVIYATMI IAVGAILGYI LAEVAIDSLG RRAVLLSGLG VMALLLAVGG VLRLTGVVLV PYFAVFVAME QWAGAVTLFY PAELFPTPVR SSAQGFATAV SRIGAVLGVV FFPSMVKVLG LSNSLILFSV TSAIAFILAL LLRETKRKEL EEISLGLKEV KGRNPST
|
| |