Gene Moth_0152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0152 
Symbol 
ID3832382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp146086 
End bp147555 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content57% 
IMG OID637828085 
Productlysyl-tRNA synthetase 
Protein accessionYP_429033 
Protein GI83589024 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1190] Lysyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00499] lysyl-tRNA synthetase, eukaryotic and non-spirochete bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000016249 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTGG AAGCTGAAAA TGATTTAATG GCCGTAAGGC TGGAAAAGCT GCACCAGCTG 
CAGGAAGCCG GCATCGAACC CTACGGCGGC CCCTTCGAAG TCACCCACAG CACTACCGCC
ATCCGGGAGC GCTTCGATGA ACTCGAAGGT CAGGAGGTAG CCCTGGCCGG TCGCCTCCTG
GCTATCAGGA GCCACGGCAA GGCTTCCTTT GCTGATCTGC AGGACCGGGA GGGCCGCTTG
CAACTTTACA TTCGCCTCGA TAATGTAGGT CCCGGGATTT ATGAACTGTT CCAAAAACTC
GATATCGGCG ATATCGTCGG CGTCCGCGGC AAGGTTTTTC GTACCCATCG CGGCGAAATA
TCAGTAGAAG TCCGTCAACT GACACTCCTG TGCAAGAGTT TACGCCCCCT GCCGGAGAAA
TGGCATGGCT TGAAGGATGT CGACCTGCGC TACCGGCAGC GTTACCTGGA TCTGATTGTC
AACCCGGAGG TCAAACAGGT ATTTATCACC CGGGCCCGGA TCATCCGCGC CATCCGGTCC
TTCCTGGACA ACCGAGGCTT TTTGGAAGTA GAGACACCGA CCATGCATCC CATTGCCGGC
GGCGCCGCTG CCAGGCCCTT TATCACCCAT CACAACGCCC TGGATATTGA CCTCTACCTG
CGCATTGCCC TGGAACTGCA TTTAAAACGG CTGCTGGTGG GCGGCCTGGA AAAGGTCTAC
GAAATGGGCC GCATTTTCCG CAATGAAGGC ATCTCCACCA AACACAACCC CGAGTTTACC
ATGCTGGAGC TCTACCAGGC CTATGCCGAT TATTATGTCA TGATGGATCT GCTGGAGGAA
ATGGTAGCCT ATGTCGCCCG GGAGGCTCTG GGTACCACTG TTGTTACCTA CCAGGGGGAC
AGGCTGGATC TCACCCCTCC CTGGCCGCGG TTAACCATGC TGGAAGCTAT TAAGAAATAC
TACGGCGTGG ACTTTGATCA GTTGCCCACG GCCGAGGACG CCCGGCGGGC AGCCATCAGC
CTGGGCCTGG AGATAGAGCC GGGCATGGAG CGGGGGAAAA TAATCAACGA GGTCTTTGAA
GCCACAGTCG AACCCCATCT TATTCAGCCG ACCTTCATCC TGGATTACCC GGTGGCCATA
TCCCCGCTGG CCAAGCGAAA AAAAGAAAAC CCGGACTTTA CCTACCGCTT TGAAGCCTTT
ATAGCCGGCA GGGAATTGGC CAACGCTTTC TCCGAGCTCA ATGACCCCAT CGACCAGCGA
CGGCGCTTTG AAGCCCAGAT GGCTGAAAGG GCGGCCGGCG ACGAAGAGGC CCACATGATG
GACGAAGACT TCTTGCAGGC CCTGGAGTAC GGCATGCCGC CTGCAGGGGG GATGGGCATC
GGCATCGACC GCCTGGTCAT GGTTCTTACG GATTCGCCCT CCATCAGGGA CGTTATCCTC
TTCCCCACTA TGCGGCCGAA GGAAGAATGA
 
Protein sequence
MKLEAENDLM AVRLEKLHQL QEAGIEPYGG PFEVTHSTTA IRERFDELEG QEVALAGRLL 
AIRSHGKASF ADLQDREGRL QLYIRLDNVG PGIYELFQKL DIGDIVGVRG KVFRTHRGEI
SVEVRQLTLL CKSLRPLPEK WHGLKDVDLR YRQRYLDLIV NPEVKQVFIT RARIIRAIRS
FLDNRGFLEV ETPTMHPIAG GAAARPFITH HNALDIDLYL RIALELHLKR LLVGGLEKVY
EMGRIFRNEG ISTKHNPEFT MLELYQAYAD YYVMMDLLEE MVAYVAREAL GTTVVTYQGD
RLDLTPPWPR LTMLEAIKKY YGVDFDQLPT AEDARRAAIS LGLEIEPGME RGKIINEVFE
ATVEPHLIQP TFILDYPVAI SPLAKRKKEN PDFTYRFEAF IAGRELANAF SELNDPIDQR
RRFEAQMAER AAGDEEAHMM DEDFLQALEY GMPPAGGMGI GIDRLVMVLT DSPSIRDVIL
FPTMRPKEE