Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1496 |
Symbol | |
ID | 5104743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1461183 |
End bp | 1462775 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507384 |
Product | Na+/solute symporter |
Protein accession | YP_001191577 |
Protein GI | 146304261 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGAGAGT TGGCATTCGG GAATATAGAC GGAGTTACCC TTGGTGTTTT CGTGGTTTTA TTTGCAATTT TTGCCTTTTT GGGTTTTTGG GCATCTAGAT GGAGGAGAGG AGATCTCTCC AAAATTGACG AATGGGGGCT TGGAGGAAGA AGACTAGGCT GGCTGTTAGT GTGGTTCTTG ATGGGGGCAG ATCTGTTCAC TGCCTACACC TTCATAGCCG TACCCTCAAG TATGCTGGCT GTAGGTTCGT TATACTTCTT CGCTGTGCCT TACGTGGCAT GGGGGTTCGG AGTTGCCCTT TTAACCATGC CTAGATTATG GACAGTTTCT AGAAACAAGG GATATGTGAC GGCCTCCGAC TTCGTTAAGG ACAGATTCGA CAGCAGATGG CTCGCCATTG CGGTTGCACT AACTGGAATA GTTGCTGAGC TGCCATACAT TGCACTACAG ATAGTGGGTA TGCAGGCAGT GCTTGCAGCT ATGCTTGCTG GCTTAACTGG AGTAGTTTCT AAGACAGTGT CTGACATTGC CCTGATAATT GCGTTCGCCA TCTTGGCCTC GTTTACCTTT ACCAGCGGTC TAAGAGGGGC AGCAATTACT GGAGTCTTTA AGGATATACT TATCTGGATT ACAGTGCTTG CTGTGATCAT TATAGTCCCG CTAAGTTACG GAGGATTCGC AAGTGCATTT CATAATGCAG CTATCCAATC TGCAACGGTA AATCAAGCTC TGAATCACGC TAAGGGTCCC ATAAATTACG GCGCACTATC TCCTAAACTC ATCCCTGCGT ACTTCTCACT ATCCCTTGGA TCAGCACTTG CACTTTACCT ATACCCGCAT GCCATAAATG GTTCACTTAG CTCAGAGGAC AAGGGGAAAC TGAAGCTAGG TACTTCACTA TTACCAATTT ATGGTATTGG TTTGGCTTTA CTAGCACTCT TCGGGATCTT GGTATATGCA GTTCCAAATG CCCTTAGTGC AGTAATTAAG TTAGGTGCTG GCACTTTCGT GGTTCCTTCA CTCATTGCAT ATACGATGCC CGACTGGTTT GTGGGGCTGG CTTACCTAGC AATTTTCATA GGAGGACTCG TCCCAGCAGC AATCATGGCC ATAGGAGTAG CTAACCTTCT TGTAAGGAAC GTGATCAAGG AGTTCAAGTC CCTAGAGCCT AAGACTGAGG CTACACTAGC TAAGGTTATC TCCACAGTCT TCAAGTTCGT GGCCTTGGGG TTCGTGTTTG CGGTCCCCGC TACCTACGCA ATCCAGCTCC AGTTACTAGG CGGAATCCTG ATAACGCAAA CTTTACCCTC GGTGTTCCTA GGACTCTACA CCAGGAATCT TAATGGAAAG GCTACCCTTG TAGGGTGGGC AGCTGGAATT CTGTCAGCCT TAGCCCTCGT TATTGAGGCT AACGCAAAGT TCGGAGTAAT AAAGACTAGC CTTTACACCA CGCCCCTAGG TCCACTCTAT ATAGCGATCC TTGCACTACT GATCAACCTA GCGGTGACGT TGATAGGATC AGGAATAGCA TATGGAATGG GATGGAGACC TTCACAGAAG ATAAAGGAAG AGGAGATCAC TAAGGAGATG TAA
|
Protein sequence | MRELAFGNID GVTLGVFVVL FAIFAFLGFW ASRWRRGDLS KIDEWGLGGR RLGWLLVWFL MGADLFTAYT FIAVPSSMLA VGSLYFFAVP YVAWGFGVAL LTMPRLWTVS RNKGYVTASD FVKDRFDSRW LAIAVALTGI VAELPYIALQ IVGMQAVLAA MLAGLTGVVS KTVSDIALII AFAILASFTF TSGLRGAAIT GVFKDILIWI TVLAVIIIVP LSYGGFASAF HNAAIQSATV NQALNHAKGP INYGALSPKL IPAYFSLSLG SALALYLYPH AINGSLSSED KGKLKLGTSL LPIYGIGLAL LALFGILVYA VPNALSAVIK LGAGTFVVPS LIAYTMPDWF VGLAYLAIFI GGLVPAAIMA IGVANLLVRN VIKEFKSLEP KTEATLAKVI STVFKFVALG FVFAVPATYA IQLQLLGGIL ITQTLPSVFL GLYTRNLNGK ATLVGWAAGI LSALALVIEA NAKFGVIKTS LYTTPLGPLY IAILALLINL AVTLIGSGIA YGMGWRPSQK IKEEEITKEM
|
| |