Gene Msed_1289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1289 
Symbol 
ID5104701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1264979 
End bp1266418 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content48% 
IMG OID640507179 
Productextracellular solute-binding protein 
Protein accessionYP_001191372 
Protein GI146304056 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTCCA ATATGAGGAA AGTGATACCC TTGTTACCGA GAGTTAATTA TAAAAAGGCA 
GTTAGGGGGA TAGCCAAATC TGTTGTAATA GGGATAGTAA TTGTAATAAT AGTTATAGGT
GCAGTGGCAG CTATAGAGTT AACAAGTCAT AGGACAACGC CCCCAAGTGT AACTAACACC
TCAACTACCA CAACGACTCC TCCCCCTGTT ACTGGGAATG TGACCATAAC CTATTTCGAC
GACCTCTCCC AGTCAGAGGC TTCAGTAATG CAGAACGTAA TCATTCCGCA ATTTGAAAAG
GAATATCCCA ACATTCATAT CAACTATGTG GATGAAGGTG CAACGGACAT CGTGAAGAGT
GTTGAGGAGC TTGAGCTAAG TGGTAACGTT GGACCTGTAA TAATTGGAGA GGATAACCTG
GTTATTGGGG AGCTTCTGAA CGGGAACTAT TTGATGAACC TCACGCCATA CACGAGCGAA
ATACTTCAGA ACGTCTCCCT CATACCCTCA ATGGTGAGCC TCGTAAAGTA TGAGCAAAGT
GTTTACCACG GGGAGTTCTT CATACCATTG AGGGGTAACA TACCCCTAGT GTGGTATAAT
GCAACGCTGT TCCAGGAAAT GGGAATAACT CCGCCTCAGA ACTGGTCTCA GCTAATGCAG
GTAGCCTCTG AGATAAAGGC TAAGACTGGT GTGGCCCCAA TCATGTTCCA GGGTCACGGC
GGAGCCAGCA CCTACACGGA GCTTTACCAA TGGATGGTAC AGGCTGGAGG AAATCCATTC
CTCTTCAACG ACTCCGGTGA TGTGTTAGCC TTCGAATATC TCTATAACCT CTCCAACTAC
TTCACTCCGG GTTACGTCCA TGGGTACTGG GGTAGCTATA AGGGACTGTT AAGTGGAGAG
TATTACATGA TTGACTATCA ATGGCCCTAC ATCTATAGCA CCATGGCTAG TGAAGGCGTA
AACATGAGTC ACATAGGCTT CTATCCGGGC CCTGTGGGAC CTGCTAACGG AGACCATCTG
GTGGGCGGAG ATGTCCTGGC CATACCTAAG GGAGCAACCG ACATTCCTGC ACTAATAGAT
TTCGCGAGGT TCCTCCTATC GACGCAGGTT CAAAGGGACT TTATCATATA CTTGTCCTGG
CCAGCAGTAA ATCAGCAGGC CTACAACAAC TTGCCAAGCA ATATCAGCGC ATTGTACAAG
GCAGAGGAGG AGGCCATGAG CAACGCGTTC TTCAGGGAAC CCGTTCCATG GATAACTGTG
TGGGGACAGA TCGCTGACAA GGTATTTGAC ACGATTATTG TAGATCATGC ACCCTACTCC
CAGATACCCA GCATCCTAGG CCAGGCGAAT CAGGAGATGT ATAACTACCT AGTCCAGAAC
TATAACACCA CTGTGGCTCA GCAATACGAG CAGGGAGTCT ACGGTCCATT GTACGGGTGA
 
Protein sequence
MKSNMRKVIP LLPRVNYKKA VRGIAKSVVI GIVIVIIVIG AVAAIELTSH RTTPPSVTNT 
STTTTTPPPV TGNVTITYFD DLSQSEASVM QNVIIPQFEK EYPNIHINYV DEGATDIVKS
VEELELSGNV GPVIIGEDNL VIGELLNGNY LMNLTPYTSE ILQNVSLIPS MVSLVKYEQS
VYHGEFFIPL RGNIPLVWYN ATLFQEMGIT PPQNWSQLMQ VASEIKAKTG VAPIMFQGHG
GASTYTELYQ WMVQAGGNPF LFNDSGDVLA FEYLYNLSNY FTPGYVHGYW GSYKGLLSGE
YYMIDYQWPY IYSTMASEGV NMSHIGFYPG PVGPANGDHL VGGDVLAIPK GATDIPALID
FARFLLSTQV QRDFIIYLSW PAVNQQAYNN LPSNISALYK AEEEAMSNAF FREPVPWITV
WGQIADKVFD TIIVDHAPYS QIPSILGQAN QEMYNYLVQN YNTTVAQQYE QGVYGPLYG