Gene Msed_1496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1496 
Symbol 
ID5104743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1461183 
End bp1462775 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content47% 
IMG OID640507384 
ProductNa+/solute symporter 
Protein accessionYP_001191577 
Protein GI146304261 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAGAGT TGGCATTCGG GAATATAGAC GGAGTTACCC TTGGTGTTTT CGTGGTTTTA 
TTTGCAATTT TTGCCTTTTT GGGTTTTTGG GCATCTAGAT GGAGGAGAGG AGATCTCTCC
AAAATTGACG AATGGGGGCT TGGAGGAAGA AGACTAGGCT GGCTGTTAGT GTGGTTCTTG
ATGGGGGCAG ATCTGTTCAC TGCCTACACC TTCATAGCCG TACCCTCAAG TATGCTGGCT
GTAGGTTCGT TATACTTCTT CGCTGTGCCT TACGTGGCAT GGGGGTTCGG AGTTGCCCTT
TTAACCATGC CTAGATTATG GACAGTTTCT AGAAACAAGG GATATGTGAC GGCCTCCGAC
TTCGTTAAGG ACAGATTCGA CAGCAGATGG CTCGCCATTG CGGTTGCACT AACTGGAATA
GTTGCTGAGC TGCCATACAT TGCACTACAG ATAGTGGGTA TGCAGGCAGT GCTTGCAGCT
ATGCTTGCTG GCTTAACTGG AGTAGTTTCT AAGACAGTGT CTGACATTGC CCTGATAATT
GCGTTCGCCA TCTTGGCCTC GTTTACCTTT ACCAGCGGTC TAAGAGGGGC AGCAATTACT
GGAGTCTTTA AGGATATACT TATCTGGATT ACAGTGCTTG CTGTGATCAT TATAGTCCCG
CTAAGTTACG GAGGATTCGC AAGTGCATTT CATAATGCAG CTATCCAATC TGCAACGGTA
AATCAAGCTC TGAATCACGC TAAGGGTCCC ATAAATTACG GCGCACTATC TCCTAAACTC
ATCCCTGCGT ACTTCTCACT ATCCCTTGGA TCAGCACTTG CACTTTACCT ATACCCGCAT
GCCATAAATG GTTCACTTAG CTCAGAGGAC AAGGGGAAAC TGAAGCTAGG TACTTCACTA
TTACCAATTT ATGGTATTGG TTTGGCTTTA CTAGCACTCT TCGGGATCTT GGTATATGCA
GTTCCAAATG CCCTTAGTGC AGTAATTAAG TTAGGTGCTG GCACTTTCGT GGTTCCTTCA
CTCATTGCAT ATACGATGCC CGACTGGTTT GTGGGGCTGG CTTACCTAGC AATTTTCATA
GGAGGACTCG TCCCAGCAGC AATCATGGCC ATAGGAGTAG CTAACCTTCT TGTAAGGAAC
GTGATCAAGG AGTTCAAGTC CCTAGAGCCT AAGACTGAGG CTACACTAGC TAAGGTTATC
TCCACAGTCT TCAAGTTCGT GGCCTTGGGG TTCGTGTTTG CGGTCCCCGC TACCTACGCA
ATCCAGCTCC AGTTACTAGG CGGAATCCTG ATAACGCAAA CTTTACCCTC GGTGTTCCTA
GGACTCTACA CCAGGAATCT TAATGGAAAG GCTACCCTTG TAGGGTGGGC AGCTGGAATT
CTGTCAGCCT TAGCCCTCGT TATTGAGGCT AACGCAAAGT TCGGAGTAAT AAAGACTAGC
CTTTACACCA CGCCCCTAGG TCCACTCTAT ATAGCGATCC TTGCACTACT GATCAACCTA
GCGGTGACGT TGATAGGATC AGGAATAGCA TATGGAATGG GATGGAGACC TTCACAGAAG
ATAAAGGAAG AGGAGATCAC TAAGGAGATG TAA
 
Protein sequence
MRELAFGNID GVTLGVFVVL FAIFAFLGFW ASRWRRGDLS KIDEWGLGGR RLGWLLVWFL 
MGADLFTAYT FIAVPSSMLA VGSLYFFAVP YVAWGFGVAL LTMPRLWTVS RNKGYVTASD
FVKDRFDSRW LAIAVALTGI VAELPYIALQ IVGMQAVLAA MLAGLTGVVS KTVSDIALII
AFAILASFTF TSGLRGAAIT GVFKDILIWI TVLAVIIIVP LSYGGFASAF HNAAIQSATV
NQALNHAKGP INYGALSPKL IPAYFSLSLG SALALYLYPH AINGSLSSED KGKLKLGTSL
LPIYGIGLAL LALFGILVYA VPNALSAVIK LGAGTFVVPS LIAYTMPDWF VGLAYLAIFI
GGLVPAAIMA IGVANLLVRN VIKEFKSLEP KTEATLAKVI STVFKFVALG FVFAVPATYA
IQLQLLGGIL ITQTLPSVFL GLYTRNLNGK ATLVGWAAGI LSALALVIEA NAKFGVIKTS
LYTTPLGPLY IAILALLINL AVTLIGSGIA YGMGWRPSQK IKEEEITKEM