Gene Msed_1081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1081 
Symbol 
ID5104462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1008839 
End bp1009999 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content50% 
IMG OID640506976 
Productsodium/hydrogen exchanger 
Protein accessionYP_001191169 
Protein GI146303853 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0475] Kef-type K+ transport systems, membrane components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATATAA CCCTAGTTCT CCTGGAGATC TCAATACTCA TCTTCTTTGC TGAGCTCATG 
AGAACATCTC TTCGAAAGTT CGTTCCCTCC ATTGTTGGCG AGATCATAGC GGGAATGGTG
TTAAGTCCCT TCGCTGTGGG AGGTCTACTG GATCACATCC TCAACCTAGA TCTGTTCTCC
CTGAACCAAT ATCTCCTTTT CCTATCCGAG TTTTCCATGA TCCTCCTGAT CTTCTCCTCC
GGACTGGAGC ATGGAGTCTC AGCGATCAGG TCAGCGGGAA CGTTTGGATT CCTGGGGGCA
ACGGCAGGTG CACTTTTTCC CGCCCTAGTG GGGATACTGG TCTTTCAGGG AATAGGGTTT
GACACGTCGC TCATCCTGGG AACTGCCATA GGTGCCACAA GCTTAGCCTC TGCTGGTTCT
ATCATTTCTG AACTTAGGTT GAAAGGCAAA GGGGTTGATC TACTCATGTC CATGGCGTCA
TCTGATGACG TTGTGGACCT AATCTTGCTC TCAGTGGTGC TGGGAACCCT GGCTGGGGCA
ACATCTGTCA AGTCCATAGC GACGCTGGTG ATCTATTATA TAGTCGCCTG GATTGTGATA
TTCGTGGTTG CCGTGAGGGT TATTCCCATG ATCGCTAACA GGTTGGACGA GGTATACATT
GAGGAGTTCT CCATGTTAGT TATATTTGGG TTAACGGCCA TCATGACTGC CCTGAACTTC
TCCCCCGTAA TTTCAGCATT CATTGCAGGA GTGGCCATGG CTGAGAGCGT GAAAAAGGAG
AGGGTTAGGC AAATAATCGA CGTTCTTCTG GCGGTGTTCG GAAGTGCCTT CTTCGTAGTA
GTGGGACTCC AGGTTAATCT GTCAGGTCTC ACCAATTTCT GGTTAATGGC AGTGGAGCTC
ACTGTGATTG CTGTGATTTT CAAGATATTG GGAGTTTTAC CCTTTGCCTA CCTGGGATTG
AGGAAGTGGA GAAGCGCGTT AGCCGTCTCC CTTGCCATGG TTCCGAGGGG TGAGACTGGA
CTGGTTGTGG GATCCATAGG ACTAAGCTAT AACGCGCTCA ATCAGAACGA GTTCGGTGCC
CTAGTTTTCA TGGCAATCCT AACCACTGTA ATTGGCGCCT CATTTTTCAA GGGTATGGCC
CATTGGTTGA GGGAGGAATA G
 
Protein sequence
MDITLVLLEI SILIFFAELM RTSLRKFVPS IVGEIIAGMV LSPFAVGGLL DHILNLDLFS 
LNQYLLFLSE FSMILLIFSS GLEHGVSAIR SAGTFGFLGA TAGALFPALV GILVFQGIGF
DTSLILGTAI GATSLASAGS IISELRLKGK GVDLLMSMAS SDDVVDLILL SVVLGTLAGA
TSVKSIATLV IYYIVAWIVI FVVAVRVIPM IANRLDEVYI EEFSMLVIFG LTAIMTALNF
SPVISAFIAG VAMAESVKKE RVRQIIDVLL AVFGSAFFVV VGLQVNLSGL TNFWLMAVEL
TVIAVIFKIL GVLPFAYLGL RKWRSALAVS LAMVPRGETG LVVGSIGLSY NALNQNEFGA
LVFMAILTTV IGASFFKGMA HWLREE