Gene Msed_0852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0852 
Symbol 
ID5105212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp786784 
End bp787926 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content51% 
IMG OID640506757 
Productmajor facilitator transporter 
Protein accessionYP_001190950 
Protein GI146303634 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.498161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCGAT TTCAGGGCAT GGACAGGAGA TTTTTCCTGA CCACTGGCCT AGTGGCAATG 
CTATTCAATT CCGTGTATCA GTACTCATGG AACGCCTTCT TTCCGTTGCT CGTGAAGGGT
TTCCATGCAT CTGCAAGCTC CGTAGAGGTC GGCTTTGCGC TGTTCGTGAT TTCCTCGACG
TCATTTCAAG TGTTGAGTGG AAGGATTTCA GACCTCAGGG GACCTAGAGT GATGGGTTCC
CTTGGTGTTC TTGCCTTCTC CTTCGGCCTA ATCCTGAGTT CGTTGATACC AAGTCTCCCC
CTCTTTTACG CCACCTGGAC CCTAGGTAGT ATCGGTGAAG GAGTTCTTTA CGGGATTTCC
CTGAACCTAG CCATCAAGTG GTACGCGGAA AGAAGGGGGT TGGCCTCAGG TCTAGTTTCC
ATGGGCTTCG GCTTAGGGGG AGCCCTCGTC AATCCCCTTA TTGAGCTCTC CAACAATTTC
AGAAGTTCCA TGTTAGCAAT TGGCGTCGCC TCTTTGATCC TTCTCCCGCT CTTTCTTCTC
TCAAGATACC CAAGTGACGT GAGGGGTTCA TCTCCAGGGG AAACTTTGAG GGAGACCAGG
TTTTGGCTCA TCTACGTATC CTTCGTTCTA GCCTCTCTTC CCTTACTTGT TGCGTCTTCC
TCTCTAGGAG AACTAGGTCA GTACCTCAAC AGCGTGGAAT ACACGATTGC CACCATATAC
TTTCCAATAG CCAGCGGAGT GGGAAGGCCT ATCATGGGGT ACCTCACCGA CCGTCTCGGG
AGATTAAGGG GAATAGACTA CATGACCGCG GGTATCCTGC TAGGAACATC CCTCGTGGTG
ATTGGGTTCC TGGGAAGAAA CCTGCTCCTA CTGGCGGGGA TAGCCCTGGT GGGAATAATG
GGAGGAACTA CTTACCCCCT TTACTCAGCG CTGGTGGGAG ACTTGTACGG GCCTAGATAC
TCCACCGCAA ACACTTCCCT CCTCTACACT GGTAAGATAG TCTCAGGGGT TCTAGGAAGC
CTCATCTTTT CCTCGCTGTT TCAGTACAGT AATGTCCTTG GATTGGGTTT TATCATGGGG
GCGACAGCCC TGTCTACGGT TTCGCTCGCT CTACTTCATA GAATCACAAG AGGAGCAAGC
TAA
 
Protein sequence
MSRFQGMDRR FFLTTGLVAM LFNSVYQYSW NAFFPLLVKG FHASASSVEV GFALFVISST 
SFQVLSGRIS DLRGPRVMGS LGVLAFSFGL ILSSLIPSLP LFYATWTLGS IGEGVLYGIS
LNLAIKWYAE RRGLASGLVS MGFGLGGALV NPLIELSNNF RSSMLAIGVA SLILLPLFLL
SRYPSDVRGS SPGETLRETR FWLIYVSFVL ASLPLLVASS SLGELGQYLN SVEYTIATIY
FPIASGVGRP IMGYLTDRLG RLRGIDYMTA GILLGTSLVV IGFLGRNLLL LAGIALVGIM
GGTTYPLYSA LVGDLYGPRY STANTSLLYT GKIVSGVLGS LIFSSLFQYS NVLGLGFIMG
ATALSTVSLA LLHRITRGAS