Gene Msed_2161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2161 
SymbollysS 
ID5104900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp2075586 
End bp2077076 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content47% 
IMG OID640508052 
Productlysyl-tRNA synthetase 
Protein accessionYP_001192224 
Protein GI146304908 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1190] Lysyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00499] lysyl-tRNA synthetase, eukaryotic and non-spirochete bacterial 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000571826 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTGGCTA TGAAATGGGA CGAGAGGAGA ATCCAAGTTT TAGAGGAACT TAGAGCTCGC 
GGAGTAAACC CTTACCCACA GAAATATCCA ATTACTCACA CGGTAGTTCA GTTGAGACAG
ATAGGGAACC AGAGACAGGA CAAGCCCAAG GATCCCTTCC TTACAGGAGT TAGGACAGCT
GGAAGGGTGG CCAATCTCAG AAGGCACGGG AAAGCCTCGT TCGTCGATAT ATTTGATGAT
GGGGAGAAAC TTCAGCTATA CCTGAGGGTA AATGAGCTTG GTGACAAGTA TGAGTATTTC
TTCAAGGTCG TGGATAGGGG AGACATAATA GGCATTCAGG GGGATCTGTT TTTCACGGCA
AAGGGAGAGC TAACACTGCT GGTTAAGGAT TTTGCCATGC TTGCCAAATC CTTGATAGAG
CCCCCTGATT GGTCTACGAT GAGCCCGGAG TTTAGGTATT CCCACAGATA CGTTGATTTC
CTATATAACG ACAATGCGAG AAGGTTTATG GAAACCAGGT TCAAGATCAT CAGGGAAATA
AGGAACATAC TTGGAGAGGA GGGTTTTATT GAGGTGGAGA CTCCAATTCT TCAACCTGTG
TATGGAGGTG CCCTGGCTAG ACCTTTCAAA TCTCACGTTA ACTACCTTAA TGAGGACTGG
TACCTCAGAA TCTCCTTGGA ACTTTATCTG AAGAGGTTCA TTGTGGGAGG CTTCAATAAG
GTCTTTGAGA TCGGAAAGGT ATTCAGGAAC GAGGATATAG ATGTCACTCA CAACCCAGAG
TTCACGCTCA TGGAGCTATA CTGGGCCTAC GCTGACTATA ACGATATCAT GGATCTCACA
GAGAGGTTAT TCAAACAGCT AGCTGATAGG GTTCTTCACT CCACCAAGAT ACCCTACAAG
GTTGGGGAGA AAACTGTGGA GATCAACCTG GAAAGCTTCA GGCGCGTCTC CATGTACGAT
TCCCTGTCAG AGGCCCTTGG GAAGAACGTG GAGGAGATGA CCGATGACCA GCTTAAGGAA
TTGATGCAGA GGAACGGGCT GGTTCCAAGG GGTTCACTGT ATATCCGAGG TCTTATGATC
GAGAAGTTAT TTGATAAACT CGTCACACCT AGCCTAGTTC AACCTACTTT CGTGACTGAC
TATCCCATAG AGACTACCCC ACTTTGCAAG CCTCACAGAA ATAAGCCAGG ACTTGTGGAG
AGATTCGAGC TCTACATTGC AGGAGTGGAA TTCGCCAATG CCTACACGGA GCTCAACGAT
CCCATTTTAC AGGACAAGCT GTTCAGGGAA GAGCAGGACA TGATGAAACG TGGCGACCAA
GAGGCACATC CCTACGATAA GGATTTCATA AACGCCTTAA GTTACGGTAT GCCACCCACG
GGAGGCTTGG GGATTGGTAT TGACAGGCTC GTGATGCTTT TCACCAATAA CCAGAGCATA
AAGGAAGTTA TTCCATTCCC CATGGTCAGC TCGAAGCTTA TTGAGGAATA A
 
Protein sequence
MLAMKWDERR IQVLEELRAR GVNPYPQKYP ITHTVVQLRQ IGNQRQDKPK DPFLTGVRTA 
GRVANLRRHG KASFVDIFDD GEKLQLYLRV NELGDKYEYF FKVVDRGDII GIQGDLFFTA
KGELTLLVKD FAMLAKSLIE PPDWSTMSPE FRYSHRYVDF LYNDNARRFM ETRFKIIREI
RNILGEEGFI EVETPILQPV YGGALARPFK SHVNYLNEDW YLRISLELYL KRFIVGGFNK
VFEIGKVFRN EDIDVTHNPE FTLMELYWAY ADYNDIMDLT ERLFKQLADR VLHSTKIPYK
VGEKTVEINL ESFRRVSMYD SLSEALGKNV EEMTDDQLKE LMQRNGLVPR GSLYIRGLMI
EKLFDKLVTP SLVQPTFVTD YPIETTPLCK PHRNKPGLVE RFELYIAGVE FANAYTELND
PILQDKLFRE EQDMMKRGDQ EAHPYDKDFI NALSYGMPPT GGLGIGIDRL VMLFTNNQSI
KEVIPFPMVS SKLIEE