Gene Msed_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1940 
SymbolleuS 
ID5103327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1883466 
End bp1886282 
Gene Length2817 bp 
Protein Length938 aa 
Translation table11 
GC content49% 
IMG OID640507828 
Productleucyl-tRNA synthetase 
Protein accessionYP_001192004 
Protein GI162149625 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0495] Leucyl-tRNA synthetase 
TIGRFAM ID[TIGR00395] leucyl-tRNA synthetase, archaeal and cytosolic family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAATGAAA TCTCAAAGAA ATGGCAAGAG GAGTGGAGCA AGAACAGGAT ATTTGAGGCT 
GATCCCAAGG ATCAGAAGAA GTTCTTCACC ACTGTCGCGT TTCCCTATCC TAACTCTCCA
TTTCACCTGG GACACGGTAG AACATACGTA ACGGGTGACG TTTACGCTAG ATTCATGAGG
ATGAAGGGAT ATAACGTCCT CTTCCCAATG GGCTTTCACT TTACTGGTAC CCCGATCATA
ACCATGGCAG ATGACGTGGC GAAGGGTGAT AAGGACCTAC TCGACATATT TCAGAACATT
TACGAGATAC CAGCTGACGT TATACCCAAG TTGTCCGACC CGCTCTTCAT GGCGAATTAC
TTTAAGGAAG ATATCAAGGC AGCCATGAGG GAAATAGGTT TATCCATAGA TTGGAGGAGG
GAGTTCACCA CAATAGACCC ACAATTCTCA GCTTTCATAG TGTGGCAATT CTCCAAGCTA
CAGAAGAAAG GTTACGTGGT AAAGGATACC CATCCCGTGG GTTGGTGTCC CGTTCACAAC
CTCCCCGTTG GAATGCATGA TACCAAGGGA GACATGGAAC CTGAGATTGG AGAGTACGTT
GTGATATTCT TCGAGAGTAA GATGGGAGCA CTTGCCGCTG CGACCCTAAG GCCTGAAACC
ATTTTTGGGG CAGTAGCAGT TTGGGTAAAC CCTAAGGCAA CGTACACGGT TGCGGAGATT
TGGGGTAAGA AGGTAATAGT CTCGGAGAAG GCCGCCGAGA AGTTGAAGTT CCAGACTGAC
GTGAAGGTGC TCGAAAAGGT CAGCGGATCG GACCTTCTGA AGATCGTGGC GATAAACCCC
ATTACGGGGA AGGAGATTCC AATCCTTCCT GCAGATTTCG TGGACCCCAC AACTGCCACG
GGCGTCGTTA TGAGTGTTCC AGCACATGCT CCCTTTGACT ACTTCTACCT GAAGAAGGCC
AAGGTTGGCA TAGAACCCAT ACCCGTGGTC GCAGTTGAGG GACAGGGAGA TGCTCCAGCA
AAGGATCTAG TTGAGTCGTC CCATCCCAAG AACGACGCTG ATCTCAAGAA GCTCACTGAA
CAGCTGTACA GGTTGGAGTT TAACAAGGGG CTTATGAGGA GCGACATACT CAGGTTAGTG
AAAGACGAGC TCAGGGCTGA GCTTTCAGTG GTAGCGGGAA AACAGGTACC TGAGGCCAGG
AAGATGGTCA CGGATATCTT GATTCAGAGG AAGGCTGGCA CGAAGATGCT GGAGATAATG
AACAAGCCCG TGTACTGCAG GTGCGGTAAC GAAGTGGTTG TTAAGATTCT TCAGGACCAG
TGGTTCTTGG ATTACGGCAA CCCCGAGTGG AAGGCTAAGG CTAAAAAGCT CTTGGACAGC
ATGAGGGTAA TCCCCGAGGA GACCAGGAAG GACTTCGAAT ACGCCCTCGA TTGGTTGCAA
AAGAGAGCCT GTGCAAGGAC AAGGGGACTA GGTACCCCTC TCCCCTGGGA CAAGAAATGG
ATCATCGAGA GCCTATCCGA TTCCACTATC TACATGGCTT ACTATACCCT CTCCCACAAG
ATAAAGGAGT TCGGACTTCA TCCCTCTCAA CTCACTGAGG AGACCTGGGA TTACATAATG
CTGGGAGAAG GCGACGTCAA GGCCATATCG GAGAGAAACA AGATAGGAGT AGATGCCTTA
CAAGAGTTGA GGAGACACTT CACCTACTGG TATCCGCTTG ATCTAAGGCA TAGCGGTCCT
GATCTGATCC CCAACCACCT GAGCTTCTTC ATATTCAATC ACGCTGGGAT ATTTCCTGAA
AACCTCTGGC CAAGGGGCGT TGCCGTAAAC GGCTTCATCC TCTACGAGGG GAAGAAGATG
AGCAAGTCCC TGAGGAACAT AGTTCCGCTG AGGAAGGCCA TAAGAACGTA TGGAGCAGAC
GTGATAAGGA TAGCCCTTTC CTCTCTGGTT GATATGAGCT CTGACGCTAA CTTCACCGAG
GCAGGGGCCA GGGCAATCGC AGACAACCTG AAGAGGTTTT ACGAACTGAT GCAGATGCAG
GATGGCTCCA CGATTGATGG AACTCCTGAG AAGTGGCTGA GATCGAAGTT ACACAGGCTA
GTAAGAGACG TCACTCCGCT CATGGAGTCC ATGAGGTTCA GGGAGGTGAT AAACGAGCTG
CTCTTCAACC TATCATCTTA CATCAACGAA TACCTAGAAA TGGTGAGGTC GGAGTCCAGG
GAGTACAATA GGGATGTGAT CAGGGAGGTT GTGGAGACCT GGACAAAGTT AATGGCCCCC
TTCGCCCCTC ACCTTACCGA GGAGATGTGG CATCAACTGG GGCATAACAC CTTCCTGTCC
TTAGAGAGTT GGCCAACCCC AGACAATAGC AAGATCAATG ACCAGATAGA GTTGGAACAT
GAGTATCATA AGCTGTTAAT AGAGGACATA AGGGCAATAC TCAACGTGTA CAAGGGTAAG
CCTTCCTCCG TCCTATTATA CGTTCACGAC GGGAGTCTGA ATCAGGTCGT GAAGAGCGCT
CTGGACGTGC TGAACAGTGG AGGGACAATG AAGGACTTCA TGCAGAAGAA CACGCCAAAG
AGCAAGGAGG AGGCAAGGGT ACTTCAGAGG ATCATGCAAT ACGTGACGGA GATGCCGGAG
ACCGTGAAGA AACTAATCTA CTCTAACGTC AATGAAATGG AAGTAACGAG AAAGGGAGTG
CCACTTCTGA GGTATAAGCT GAACCTAGAG ATTGAGGTTT TGGCCTACAC TCAGGAAGTG
AAGCAGAAAT TAAACAAAGA CGCTTTGCCC TACAGGCCAG CAATACTCGT GAAGTAA
 
Protein sequence
MNEISKKWQE EWSKNRIFEA DPKDQKKFFT TVAFPYPNSP FHLGHGRTYV TGDVYARFMR 
MKGYNVLFPM GFHFTGTPII TMADDVAKGD KDLLDIFQNI YEIPADVIPK LSDPLFMANY
FKEDIKAAMR EIGLSIDWRR EFTTIDPQFS AFIVWQFSKL QKKGYVVKDT HPVGWCPVHN
LPVGMHDTKG DMEPEIGEYV VIFFESKMGA LAAATLRPET IFGAVAVWVN PKATYTVAEI
WGKKVIVSEK AAEKLKFQTD VKVLEKVSGS DLLKIVAINP ITGKEIPILP ADFVDPTTAT
GVVMSVPAHA PFDYFYLKKA KVGIEPIPVV AVEGQGDAPA KDLVESSHPK NDADLKKLTE
QLYRLEFNKG LMRSDILRLV KDELRAELSV VAGKQVPEAR KMVTDILIQR KAGTKMLEIM
NKPVYCRCGN EVVVKILQDQ WFLDYGNPEW KAKAKKLLDS MRVIPEETRK DFEYALDWLQ
KRACARTRGL GTPLPWDKKW IIESLSDSTI YMAYYTLSHK IKEFGLHPSQ LTEETWDYIM
LGEGDVKAIS ERNKIGVDAL QELRRHFTYW YPLDLRHSGP DLIPNHLSFF IFNHAGIFPE
NLWPRGVAVN GFILYEGKKM SKSLRNIVPL RKAIRTYGAD VIRIALSSLV DMSSDANFTE
AGARAIADNL KRFYELMQMQ DGSTIDGTPE KWLRSKLHRL VRDVTPLMES MRFREVINEL
LFNLSSYINE YLEMVRSESR EYNRDVIREV VETWTKLMAP FAPHLTEEMW HQLGHNTFLS
LESWPTPDNS KINDQIELEH EYHKLLIEDI RAILNVYKGK PSSVLLYVHD GSLNQVVKSA
LDVLNSGGTM KDFMQKNTPK SKEEARVLQR IMQYVTEMPE TVKKLIYSNV NEMEVTRKGV
PLLRYKLNLE IEVLAYTQEV KQKLNKDALP YRPAILVK