Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1289 |
Symbol | |
ID | 5104701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1264979 |
End bp | 1266418 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640507179 |
Product | extracellular solute-binding protein |
Protein accession | YP_001191372 |
Protein GI | 146304056 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTCCA ATATGAGGAA AGTGATACCC TTGTTACCGA GAGTTAATTA TAAAAAGGCA GTTAGGGGGA TAGCCAAATC TGTTGTAATA GGGATAGTAA TTGTAATAAT AGTTATAGGT GCAGTGGCAG CTATAGAGTT AACAAGTCAT AGGACAACGC CCCCAAGTGT AACTAACACC TCAACTACCA CAACGACTCC TCCCCCTGTT ACTGGGAATG TGACCATAAC CTATTTCGAC GACCTCTCCC AGTCAGAGGC TTCAGTAATG CAGAACGTAA TCATTCCGCA ATTTGAAAAG GAATATCCCA ACATTCATAT CAACTATGTG GATGAAGGTG CAACGGACAT CGTGAAGAGT GTTGAGGAGC TTGAGCTAAG TGGTAACGTT GGACCTGTAA TAATTGGAGA GGATAACCTG GTTATTGGGG AGCTTCTGAA CGGGAACTAT TTGATGAACC TCACGCCATA CACGAGCGAA ATACTTCAGA ACGTCTCCCT CATACCCTCA ATGGTGAGCC TCGTAAAGTA TGAGCAAAGT GTTTACCACG GGGAGTTCTT CATACCATTG AGGGGTAACA TACCCCTAGT GTGGTATAAT GCAACGCTGT TCCAGGAAAT GGGAATAACT CCGCCTCAGA ACTGGTCTCA GCTAATGCAG GTAGCCTCTG AGATAAAGGC TAAGACTGGT GTGGCCCCAA TCATGTTCCA GGGTCACGGC GGAGCCAGCA CCTACACGGA GCTTTACCAA TGGATGGTAC AGGCTGGAGG AAATCCATTC CTCTTCAACG ACTCCGGTGA TGTGTTAGCC TTCGAATATC TCTATAACCT CTCCAACTAC TTCACTCCGG GTTACGTCCA TGGGTACTGG GGTAGCTATA AGGGACTGTT AAGTGGAGAG TATTACATGA TTGACTATCA ATGGCCCTAC ATCTATAGCA CCATGGCTAG TGAAGGCGTA AACATGAGTC ACATAGGCTT CTATCCGGGC CCTGTGGGAC CTGCTAACGG AGACCATCTG GTGGGCGGAG ATGTCCTGGC CATACCTAAG GGAGCAACCG ACATTCCTGC ACTAATAGAT TTCGCGAGGT TCCTCCTATC GACGCAGGTT CAAAGGGACT TTATCATATA CTTGTCCTGG CCAGCAGTAA ATCAGCAGGC CTACAACAAC TTGCCAAGCA ATATCAGCGC ATTGTACAAG GCAGAGGAGG AGGCCATGAG CAACGCGTTC TTCAGGGAAC CCGTTCCATG GATAACTGTG TGGGGACAGA TCGCTGACAA GGTATTTGAC ACGATTATTG TAGATCATGC ACCCTACTCC CAGATACCCA GCATCCTAGG CCAGGCGAAT CAGGAGATGT ATAACTACCT AGTCCAGAAC TATAACACCA CTGTGGCTCA GCAATACGAG CAGGGAGTCT ACGGTCCATT GTACGGGTGA
|
Protein sequence | MKSNMRKVIP LLPRVNYKKA VRGIAKSVVI GIVIVIIVIG AVAAIELTSH RTTPPSVTNT STTTTTPPPV TGNVTITYFD DLSQSEASVM QNVIIPQFEK EYPNIHINYV DEGATDIVKS VEELELSGNV GPVIIGEDNL VIGELLNGNY LMNLTPYTSE ILQNVSLIPS MVSLVKYEQS VYHGEFFIPL RGNIPLVWYN ATLFQEMGIT PPQNWSQLMQ VASEIKAKTG VAPIMFQGHG GASTYTELYQ WMVQAGGNPF LFNDSGDVLA FEYLYNLSNY FTPGYVHGYW GSYKGLLSGE YYMIDYQWPY IYSTMASEGV NMSHIGFYPG PVGPANGDHL VGGDVLAIPK GATDIPALID FARFLLSTQV QRDFIIYLSW PAVNQQAYNN LPSNISALYK AEEEAMSNAF FREPVPWITV WGQIADKVFD TIIVDHAPYS QIPSILGQAN QEMYNYLVQN YNTTVAQQYE QGVYGPLYG
|
| |