Gene Athe_0659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0659 
Symbol 
ID7407083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp743148 
End bp744641 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content38% 
IMG OID643715040 
Productlysyl-tRNA synthetase 
Protein accessionYP_002572556 
Protein GI222528674 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1190] Lysyl-tRNA synthetase (class II) 
TIGRFAM ID[TIGR00499] lysyl-tRNA synthetase, eukaryotic and non-spirochete bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000863751 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGATGC AGGATTTTGA GTTTACTCAG GAAGAGCTAA ACGAGCAGAT ACAAAACAGG 
ATAAAAAAGC TAAAAGAACT TCAGAAGAAT AAATACAATC CATATGAAAA GGTAAAATAT
GACCCGACAC ATTATTCTAC TGATATTAAA GAGAACTTTG AAGTATTTGA AGGCAAGTTT
GTCTGTGTTG CAGGAAGAAT GCTTTCAAAA AGAGGTCATG GCAAGGCTTC GTTTGTGGAT
ATTTTAGATA CAAAAGGCAA AATCCAGATA TATATAAAAA TTGATGAAGT TGGAGAAGAA
AAGTACGAGG AGTTCAAAGA ATATTATGAT ATTGGAGATA TAATAGGGGT AAAGGGAGAG
GTTTTCAAGA CTCACAAGGG CGAAATTTCG GTAAAGGCGA AAGAGATTGA GATGCTTACC
AAGTGTTTGC GACCACTTCC TGAAAAGTGG CATGGGCTCA AGGATGTTGA TACAAGATAT
AGAAAAAGGT ATCTTGACCT GATTGTAAAT CCTCAGGTTC GGGATACTTT TATCAAAAGA
AGTTTAATAA TCCGTTCAAT ACGCAAGTTT TTAGATGACA GGGGATTTTT GGAGGTTGAA
ACTCCTGTTT TGAGTCCTGT TGCAGGTGGT GCTGCTGCAA GACCTTTTAT TACCCATCAC
AATGCTTTGG ACATTGACCT TTATTTGAGA ATTGCAACAG AGCTCCACTT AAAAAGACTT
ATAGTTGGTG GGTTTGATAA GGTATATGAG CTTGGTCGAG TGTTTAGAAA TGAAGGTATT
TCAATAAAAC ACAACCCTGA ATTTACAACC ATTGAGATTT ACCAGGCGTA TGCTGACTAT
AAGGACATGA TGGATTTAAC AGAAAAGCTT ATCACAACGG TTGCACAAGA GGTTTTAGGT
ACATTAAAAA TAACATATCA AGGGCAGGAA ATTGACCTGA CAGCGCCATG GCAAAGGCTT
ACCATGGTTG AAGCAATCAA AAAGTATGTT GGCGTAGATT TTGAAAATGT TACTTCTTTG
GATGAAGCAA GAAAAATTGC AAAGGACCTT GGGATTGAGG TTGAGGAAAA CTGGCAGATA
GGTCATATCA TAAATGAAAT ATTTGAAAAG AAAGTTGAGG ATTTTCTGGT TCAGCCCACG
TTCATAATGG ACTATCCAGT TGAAGTTTCA CCACTTGCAA AGCGTAAGAA AGACAATCCT
CAGTTTACTG AGAGGTTTGA GCTTTTTATT ACTTGTAGGG AAATTGCGAA TGCATTTTCA
GAGCTCAACG ATCCGTTTGA TCAGAAAGAG AGATTTTTAG AGCAGCTCAA GGAAAGGCAG
AGAGGCAATC AAGAAGCCCA CATGATGGAT GAGGATTTTA TAGAAGCGCT TGAGTACGGT
ATGCCACCGA CAGGCGGGCT TGGAATAGGA ATTGACAGGC TGGTTATGCT TTTAACAGAT
TCGTATTCAA TCAGAGATGT GTTACTTTTT CCAACAATGC GACCGAAGGA TTAG
 
Protein sequence
MGMQDFEFTQ EELNEQIQNR IKKLKELQKN KYNPYEKVKY DPTHYSTDIK ENFEVFEGKF 
VCVAGRMLSK RGHGKASFVD ILDTKGKIQI YIKIDEVGEE KYEEFKEYYD IGDIIGVKGE
VFKTHKGEIS VKAKEIEMLT KCLRPLPEKW HGLKDVDTRY RKRYLDLIVN PQVRDTFIKR
SLIIRSIRKF LDDRGFLEVE TPVLSPVAGG AAARPFITHH NALDIDLYLR IATELHLKRL
IVGGFDKVYE LGRVFRNEGI SIKHNPEFTT IEIYQAYADY KDMMDLTEKL ITTVAQEVLG
TLKITYQGQE IDLTAPWQRL TMVEAIKKYV GVDFENVTSL DEARKIAKDL GIEVEENWQI
GHIINEIFEK KVEDFLVQPT FIMDYPVEVS PLAKRKKDNP QFTERFELFI TCREIANAFS
ELNDPFDQKE RFLEQLKERQ RGNQEAHMMD EDFIEALEYG MPPTGGLGIG IDRLVMLLTD
SYSIRDVLLF PTMRPKD