Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1683 |
Symbol | valS |
ID | 5105329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1620653 |
End bp | 1623079 |
Gene Length | 2427 bp |
Protein Length | 808 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507577 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001191762 |
Protein GI | 146304446 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0962202 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0485293 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTGTCCC AGGATGAGAT TTTAAAGAGA ATGGAGGAAT GGCCCAAGCA TTATAATCCC AAGGATATTG AACCAAAGTG GCAGAAGCTC TGGTTGACCC AGGAGTACTG GGAAAAGATC TTCAAGTTCG ATGAGAACTC TGAAAAACCA GTCTTCTTCA TTGATACCCC TCCGCCATTT ACCAGCGGAG AACTGCACAT GGGCCACGCC TATTGGGTTA CCATAGCTGA TGCAATGGCG AGGTTCAAGA AGCTTCAGGG TTATAACGTT CTATTCCCTC AGGGGTGGGA TACCCAAGGA TTACCAACCG AGCTCAAGGT TCAGTACAAG CTCGGGATAC CCAAGGAGAA CAGGGACCTC TTCCTAAAGA AGTGCGTGGA ATGGACCGAG GACATGATAG GAAAAATGAA GTCTGCCATG ATTAGACTGG GATATAGGCC CAACTGGGAG CAGTTTGAGT ATAGAACATA TCACCCCAAT TACAGGAGGG TCATACAGAG AAGCCTCATC GAGATGCATC AAAAGGGCAT GATAAGGATG AAACAGGGAC CAGTGTATTG GTGTCCAAAG TGTGAGACCG CTGTGGCGCA GAGCGAGGTT GGATACCTAG AGAAGCAGGG CATCCTTGTT TACATTGGCT TCCCTCTGAA AGAAGGTGGG GAGATAGTTA TTGCCACTAC ACGTCCAGAA CTTCTAGGGG CCACTCAGGC TGTTGCAGTA AATCCTCAGG ATGAACGTTA CAAGCACCTC GTGGGAAAGA AGGTAATTCT TCCGATCTTT GGAAAGGAAG TTCAGATAAT AGCTGATCCA GACGTGGAAA AGGACTTCGG AACCGGGGCA GTGATGATAA GCACCTACGG TGACCCCCAG GACATAAGGT GGCAACTAAA GTATAACCTT CCCGTCACGG AACTCGTGGA TGAGAGAGGT AGAATAAAGG GGACTGGTTT CCTAGACGGA TTGAAAGTAG AAGAGGCTAA GAAGAAGATT GTCGAACTAT TAAAGGAGAG GGGATACGTT AGGAAAATTG AAAGCATAAA ACACAACGTG CTCTCTCACA CCGAGAGAAG CGACTGCCTC TCGCCAATAG AGTTCCTGGT GAAGGAACAG GTATACATAG ATGTTGTTCC CTACAAAGGA AAACTCCTTG AGGAGTATAA GAAAATGAAC TTTAAGCCAC AGAGAATGGC AAGCAAACTT GAGGAGTGGA TAAATAACAT TGAGTGGGAC TGGAACATAA GCAGACAAAG GCTTTACGGG ACCCCCTTAC CCTTCTGGTA TTGCGATAAC CTTCACCTAG TTCCGGCCGA CATATCGTCT TTACCTGTGG ATCCCACCAA GAGTAACCCG CCTGTGGAAA AGTGCCCGCA TTGCGGACTT CCCCTAAAAC CACTTACTCA TGTGGCTGAT GTGTGGGTAG ACTCAAGTGT GACTGTACTA TACCTCTCTG GGTTCTATGA GAATAAGTTG AGGTTTAGCA AGACCTTTCC GGCCGATGTG AGACTTCAGG GTACTGATAT AATTAGAACC TGGCTATTCT ATACCTTCTT TAGAACCCTG ATGCTGGCAG GGAATGTACC ATTCAAGAGA GTAATAGTTC ACGGACAAGT TCTAGGTCCA GATGGGACGA GGATGAGCAA GAGTAAGGGT AACAACGTAT CTCCAATGGA CAGAATAGAT GAGTTTGGAG CTGATGCCAT TAGGATGACC CTGCTGGACG CCTCAATAGG CGACGATTTC CCCTTCAAGT GGGATACGGT AAAGGGGAAG AAACTACTCC TTCAGAAGTT ATGGAACGCG AGCAGGCTAG CATATCCGTT CATACACGGT AGGAAATTGG AGAAGCCTGA GGTGTTGCAT CCAATTGACC AATGGATACT GGCTAAACAC AAGGAATTTG TGGAGAAGGC TATCCAAGCC TATGAGAACC AGGACTTCTT CGTGATCCTT TCCATGCTTT ATGAGTACTT CTGGGAGACC ATAGCAGATG AATACCTTGA GCTTATCAAG CATAGATTGT TCAGCGATGA TAGATCTGCC ACCTATACCT TGGGGAGAAT ACTCAAGGAT CTAATTATCT TACTACACCC AATTGCCCCC CACATAACAG AGGAAATCTA TTCCCTTATC TTCGGTGAGA AGCAGAGCGT CCTTCTGGAG ACCTTACCGT CAACTGAAGA TATCAAGATA GCGGAGGAAG TTGTGAGAAT GGGGGACTCT GTCAAGAAAT TCAACTCGCT GGTGAGAACC AACAAGGTAA AAATGAGAAT GTCCATGAAC TCCCAAATTA GGGTGGAACT ACAGGGACCT AAGCAGTTCG TAGAGGATGT AATGAAGGTC GAAGACGACC TGAAGAAAAC GCTGAAAATA ACTGAGATCG CCTACCGTGA GGCGGACCAA CTCGACGTGA AGATATCTCC CACCTAA
|
Protein sequence | MLSQDEILKR MEEWPKHYNP KDIEPKWQKL WLTQEYWEKI FKFDENSEKP VFFIDTPPPF TSGELHMGHA YWVTIADAMA RFKKLQGYNV LFPQGWDTQG LPTELKVQYK LGIPKENRDL FLKKCVEWTE DMIGKMKSAM IRLGYRPNWE QFEYRTYHPN YRRVIQRSLI EMHQKGMIRM KQGPVYWCPK CETAVAQSEV GYLEKQGILV YIGFPLKEGG EIVIATTRPE LLGATQAVAV NPQDERYKHL VGKKVILPIF GKEVQIIADP DVEKDFGTGA VMISTYGDPQ DIRWQLKYNL PVTELVDERG RIKGTGFLDG LKVEEAKKKI VELLKERGYV RKIESIKHNV LSHTERSDCL SPIEFLVKEQ VYIDVVPYKG KLLEEYKKMN FKPQRMASKL EEWINNIEWD WNISRQRLYG TPLPFWYCDN LHLVPADISS LPVDPTKSNP PVEKCPHCGL PLKPLTHVAD VWVDSSVTVL YLSGFYENKL RFSKTFPADV RLQGTDIIRT WLFYTFFRTL MLAGNVPFKR VIVHGQVLGP DGTRMSKSKG NNVSPMDRID EFGADAIRMT LLDASIGDDF PFKWDTVKGK KLLLQKLWNA SRLAYPFIHG RKLEKPEVLH PIDQWILAKH KEFVEKAIQA YENQDFFVIL SMLYEYFWET IADEYLELIK HRLFSDDRSA TYTLGRILKD LIILLHPIAP HITEEIYSLI FGEKQSVLLE TLPSTEDIKI AEEVVRMGDS VKKFNSLVRT NKVKMRMSMN SQIRVELQGP KQFVEDVMKV EDDLKKTLKI TEIAYREADQ LDVKISPT
|
| |