Gene Moth_2484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2484 
Symbol 
ID3831586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2589818 
End bp2591248 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content59% 
IMG OID637830406 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_431309 
Protein GI83591300 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000672423 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0231509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCTCT ACAATACGCT GACGGGACGC AAGGAAGAGT TCACCCCAGT TGAACCGGGC 
CGGGTACGAA TGTATGTCTG TGGCCCGACA ACCTACAACT ATATCCATCT GGGTAACGCC
CGGCCCATGG TGGTCTTCGA TACCCTGCGA CGTTACCTGG AGTACCGGAA CTATGATGTT
TTATACGTAC AGAACTTTAC TGACATTGAC GATAAGGTGA TTAACCGGGC CCGGGAGGAG
CACCAGGCTC CCCTGGTCAT TGCCGAACGG TATATTGAGG AATTCTTCAA GGACGCCGAT
GCCCTGAACG TCAAGCGGGC TACCCTTTAT CCCCGCGTTA GCCAGCATAT CGACGCCATT
ATCGCGGCCA TAGCCACCCT TGTTGAGCGT GGTTTCGCCT ATGTCGCCGA TGGGGATGTT
TATTTCGAAG TCGAAAAGTT TCCTGCCTAC GGCCGCCTGT CAAAGCGCAC CCCGGGGGAG
ATGCGGGCGG GGGCACGGGT GGAGGTCAAT ACCAGCAAAC GCAATCCCCT GGATTTCGCC
CTGTGGAAGG CGGCCTGCCC CGGCGAACCA TCATGGGAAA GCCCATGGGG ACCGGGGCGA
CCGGGATGGC ATATTGAGTG CTCGACCATG GCCCTCAAAT ACCTGGGCCC GGGCTTCGAT
ATCCATGGAG GCGGCGCCGA CCTCATTTTC CCTCATCACG AGAATGAAAT TGCCCAGGCT
GAGGCCCAGA CAGGGTGCAC CTTTGCCCGC TTCTGGCTCC ACAACGGCTT TATAACTGTA
AACCAGGAAA AAATGTCCAA GTCCAAGGGT AACTTCTTCC TGGTGCGGGA CATCCTCAAA
CGTTTCCGGC CCCTGGCGGT GCGCCTCTAC CTGCTGGCGA CCCATTACCG CAGTCCCATT
GACTTCGATG ATGCGGGCCT GCTGGCGGCG GAGAGGGGCC TGGAGCGTCT GGAAAATACC
CGCCGTCTCC TGGGCGAAGC CCGCTGCCAG CTAACTGGCA CCGGGGCGGA GACCACGGTG
CCAGCAAGAA CGTCGGCCCT GGCCGGAAGG GCGGAAGAAT TACGCCAGGA GTTCATCTCC
GCCATGGACG ACGACTTTAA TACCGCCCGG GCCCTGGCAG CCCTTTATGA CCTGGCCCGG
GAGATCAACT CCTACCTCAA CGGGACAACA ACCATCGACC CAGCGGCCCT GAGAACGGCG
GCTATAACCT TTGAGCAACT GGGGGGAGAA GTACTGGGCC TCTTTGGTCA GGCCCGGCAG
CAGGTAGATG ACGAACTCCT AAGCGGGCTT ATGGACCTCA TCCTACAGGT TCGCCAGGAG
GCCCGCCAGC GGCGCGACTG GGCCACGGCC GATACCATCC GGGACCGGTT GAAGGAGCTG
GGGATCGTCC TGGAGGATAC CCCCCGCGGC CCGCGTTGGA AAAGGAGTTA A
 
Protein sequence
MYLYNTLTGR KEEFTPVEPG RVRMYVCGPT TYNYIHLGNA RPMVVFDTLR RYLEYRNYDV 
LYVQNFTDID DKVINRAREE HQAPLVIAER YIEEFFKDAD ALNVKRATLY PRVSQHIDAI
IAAIATLVER GFAYVADGDV YFEVEKFPAY GRLSKRTPGE MRAGARVEVN TSKRNPLDFA
LWKAACPGEP SWESPWGPGR PGWHIECSTM ALKYLGPGFD IHGGGADLIF PHHENEIAQA
EAQTGCTFAR FWLHNGFITV NQEKMSKSKG NFFLVRDILK RFRPLAVRLY LLATHYRSPI
DFDDAGLLAA ERGLERLENT RRLLGEARCQ LTGTGAETTV PARTSALAGR AEELRQEFIS
AMDDDFNTAR ALAALYDLAR EINSYLNGTT TIDPAALRTA AITFEQLGGE VLGLFGQARQ
QVDDELLSGL MDLILQVRQE ARQRRDWATA DTIRDRLKEL GIVLEDTPRG PRWKRS