Gene Msed_1683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1683 
SymbolvalS 
ID5105329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1620653 
End bp1623079 
Gene Length2427 bp 
Protein Length808 aa 
Translation table11 
GC content47% 
IMG OID640507577 
Productvalyl-tRNA synthetase 
Protein accessionYP_001191762 
Protein GI146304446 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0962202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0485293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTGTCCC AGGATGAGAT TTTAAAGAGA ATGGAGGAAT GGCCCAAGCA TTATAATCCC 
AAGGATATTG AACCAAAGTG GCAGAAGCTC TGGTTGACCC AGGAGTACTG GGAAAAGATC
TTCAAGTTCG ATGAGAACTC TGAAAAACCA GTCTTCTTCA TTGATACCCC TCCGCCATTT
ACCAGCGGAG AACTGCACAT GGGCCACGCC TATTGGGTTA CCATAGCTGA TGCAATGGCG
AGGTTCAAGA AGCTTCAGGG TTATAACGTT CTATTCCCTC AGGGGTGGGA TACCCAAGGA
TTACCAACCG AGCTCAAGGT TCAGTACAAG CTCGGGATAC CCAAGGAGAA CAGGGACCTC
TTCCTAAAGA AGTGCGTGGA ATGGACCGAG GACATGATAG GAAAAATGAA GTCTGCCATG
ATTAGACTGG GATATAGGCC CAACTGGGAG CAGTTTGAGT ATAGAACATA TCACCCCAAT
TACAGGAGGG TCATACAGAG AAGCCTCATC GAGATGCATC AAAAGGGCAT GATAAGGATG
AAACAGGGAC CAGTGTATTG GTGTCCAAAG TGTGAGACCG CTGTGGCGCA GAGCGAGGTT
GGATACCTAG AGAAGCAGGG CATCCTTGTT TACATTGGCT TCCCTCTGAA AGAAGGTGGG
GAGATAGTTA TTGCCACTAC ACGTCCAGAA CTTCTAGGGG CCACTCAGGC TGTTGCAGTA
AATCCTCAGG ATGAACGTTA CAAGCACCTC GTGGGAAAGA AGGTAATTCT TCCGATCTTT
GGAAAGGAAG TTCAGATAAT AGCTGATCCA GACGTGGAAA AGGACTTCGG AACCGGGGCA
GTGATGATAA GCACCTACGG TGACCCCCAG GACATAAGGT GGCAACTAAA GTATAACCTT
CCCGTCACGG AACTCGTGGA TGAGAGAGGT AGAATAAAGG GGACTGGTTT CCTAGACGGA
TTGAAAGTAG AAGAGGCTAA GAAGAAGATT GTCGAACTAT TAAAGGAGAG GGGATACGTT
AGGAAAATTG AAAGCATAAA ACACAACGTG CTCTCTCACA CCGAGAGAAG CGACTGCCTC
TCGCCAATAG AGTTCCTGGT GAAGGAACAG GTATACATAG ATGTTGTTCC CTACAAAGGA
AAACTCCTTG AGGAGTATAA GAAAATGAAC TTTAAGCCAC AGAGAATGGC AAGCAAACTT
GAGGAGTGGA TAAATAACAT TGAGTGGGAC TGGAACATAA GCAGACAAAG GCTTTACGGG
ACCCCCTTAC CCTTCTGGTA TTGCGATAAC CTTCACCTAG TTCCGGCCGA CATATCGTCT
TTACCTGTGG ATCCCACCAA GAGTAACCCG CCTGTGGAAA AGTGCCCGCA TTGCGGACTT
CCCCTAAAAC CACTTACTCA TGTGGCTGAT GTGTGGGTAG ACTCAAGTGT GACTGTACTA
TACCTCTCTG GGTTCTATGA GAATAAGTTG AGGTTTAGCA AGACCTTTCC GGCCGATGTG
AGACTTCAGG GTACTGATAT AATTAGAACC TGGCTATTCT ATACCTTCTT TAGAACCCTG
ATGCTGGCAG GGAATGTACC ATTCAAGAGA GTAATAGTTC ACGGACAAGT TCTAGGTCCA
GATGGGACGA GGATGAGCAA GAGTAAGGGT AACAACGTAT CTCCAATGGA CAGAATAGAT
GAGTTTGGAG CTGATGCCAT TAGGATGACC CTGCTGGACG CCTCAATAGG CGACGATTTC
CCCTTCAAGT GGGATACGGT AAAGGGGAAG AAACTACTCC TTCAGAAGTT ATGGAACGCG
AGCAGGCTAG CATATCCGTT CATACACGGT AGGAAATTGG AGAAGCCTGA GGTGTTGCAT
CCAATTGACC AATGGATACT GGCTAAACAC AAGGAATTTG TGGAGAAGGC TATCCAAGCC
TATGAGAACC AGGACTTCTT CGTGATCCTT TCCATGCTTT ATGAGTACTT CTGGGAGACC
ATAGCAGATG AATACCTTGA GCTTATCAAG CATAGATTGT TCAGCGATGA TAGATCTGCC
ACCTATACCT TGGGGAGAAT ACTCAAGGAT CTAATTATCT TACTACACCC AATTGCCCCC
CACATAACAG AGGAAATCTA TTCCCTTATC TTCGGTGAGA AGCAGAGCGT CCTTCTGGAG
ACCTTACCGT CAACTGAAGA TATCAAGATA GCGGAGGAAG TTGTGAGAAT GGGGGACTCT
GTCAAGAAAT TCAACTCGCT GGTGAGAACC AACAAGGTAA AAATGAGAAT GTCCATGAAC
TCCCAAATTA GGGTGGAACT ACAGGGACCT AAGCAGTTCG TAGAGGATGT AATGAAGGTC
GAAGACGACC TGAAGAAAAC GCTGAAAATA ACTGAGATCG CCTACCGTGA GGCGGACCAA
CTCGACGTGA AGATATCTCC CACCTAA
 
Protein sequence
MLSQDEILKR MEEWPKHYNP KDIEPKWQKL WLTQEYWEKI FKFDENSEKP VFFIDTPPPF 
TSGELHMGHA YWVTIADAMA RFKKLQGYNV LFPQGWDTQG LPTELKVQYK LGIPKENRDL
FLKKCVEWTE DMIGKMKSAM IRLGYRPNWE QFEYRTYHPN YRRVIQRSLI EMHQKGMIRM
KQGPVYWCPK CETAVAQSEV GYLEKQGILV YIGFPLKEGG EIVIATTRPE LLGATQAVAV
NPQDERYKHL VGKKVILPIF GKEVQIIADP DVEKDFGTGA VMISTYGDPQ DIRWQLKYNL
PVTELVDERG RIKGTGFLDG LKVEEAKKKI VELLKERGYV RKIESIKHNV LSHTERSDCL
SPIEFLVKEQ VYIDVVPYKG KLLEEYKKMN FKPQRMASKL EEWINNIEWD WNISRQRLYG
TPLPFWYCDN LHLVPADISS LPVDPTKSNP PVEKCPHCGL PLKPLTHVAD VWVDSSVTVL
YLSGFYENKL RFSKTFPADV RLQGTDIIRT WLFYTFFRTL MLAGNVPFKR VIVHGQVLGP
DGTRMSKSKG NNVSPMDRID EFGADAIRMT LLDASIGDDF PFKWDTVKGK KLLLQKLWNA
SRLAYPFIHG RKLEKPEVLH PIDQWILAKH KEFVEKAIQA YENQDFFVIL SMLYEYFWET
IADEYLELIK HRLFSDDRSA TYTLGRILKD LIILLHPIAP HITEEIYSLI FGEKQSVLLE
TLPSTEDIKI AEEVVRMGDS VKKFNSLVRT NKVKMRMSMN SQIRVELQGP KQFVEDVMKV
EDDLKKTLKI TEIAYREADQ LDVKISPT