Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2484 |
Symbol | |
ID | 3831586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2589818 |
End bp | 2591248 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637830406 |
Product | cysteinyl-tRNA synthetase |
Protein accession | YP_431309 |
Protein GI | 83591300 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0215] Cysteinyl-tRNA synthetase |
TIGRFAM ID | [TIGR00435] cysteinyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000672423 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0231509 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATCTCT ACAATACGCT GACGGGACGC AAGGAAGAGT TCACCCCAGT TGAACCGGGC CGGGTACGAA TGTATGTCTG TGGCCCGACA ACCTACAACT ATATCCATCT GGGTAACGCC CGGCCCATGG TGGTCTTCGA TACCCTGCGA CGTTACCTGG AGTACCGGAA CTATGATGTT TTATACGTAC AGAACTTTAC TGACATTGAC GATAAGGTGA TTAACCGGGC CCGGGAGGAG CACCAGGCTC CCCTGGTCAT TGCCGAACGG TATATTGAGG AATTCTTCAA GGACGCCGAT GCCCTGAACG TCAAGCGGGC TACCCTTTAT CCCCGCGTTA GCCAGCATAT CGACGCCATT ATCGCGGCCA TAGCCACCCT TGTTGAGCGT GGTTTCGCCT ATGTCGCCGA TGGGGATGTT TATTTCGAAG TCGAAAAGTT TCCTGCCTAC GGCCGCCTGT CAAAGCGCAC CCCGGGGGAG ATGCGGGCGG GGGCACGGGT GGAGGTCAAT ACCAGCAAAC GCAATCCCCT GGATTTCGCC CTGTGGAAGG CGGCCTGCCC CGGCGAACCA TCATGGGAAA GCCCATGGGG ACCGGGGCGA CCGGGATGGC ATATTGAGTG CTCGACCATG GCCCTCAAAT ACCTGGGCCC GGGCTTCGAT ATCCATGGAG GCGGCGCCGA CCTCATTTTC CCTCATCACG AGAATGAAAT TGCCCAGGCT GAGGCCCAGA CAGGGTGCAC CTTTGCCCGC TTCTGGCTCC ACAACGGCTT TATAACTGTA AACCAGGAAA AAATGTCCAA GTCCAAGGGT AACTTCTTCC TGGTGCGGGA CATCCTCAAA CGTTTCCGGC CCCTGGCGGT GCGCCTCTAC CTGCTGGCGA CCCATTACCG CAGTCCCATT GACTTCGATG ATGCGGGCCT GCTGGCGGCG GAGAGGGGCC TGGAGCGTCT GGAAAATACC CGCCGTCTCC TGGGCGAAGC CCGCTGCCAG CTAACTGGCA CCGGGGCGGA GACCACGGTG CCAGCAAGAA CGTCGGCCCT GGCCGGAAGG GCGGAAGAAT TACGCCAGGA GTTCATCTCC GCCATGGACG ACGACTTTAA TACCGCCCGG GCCCTGGCAG CCCTTTATGA CCTGGCCCGG GAGATCAACT CCTACCTCAA CGGGACAACA ACCATCGACC CAGCGGCCCT GAGAACGGCG GCTATAACCT TTGAGCAACT GGGGGGAGAA GTACTGGGCC TCTTTGGTCA GGCCCGGCAG CAGGTAGATG ACGAACTCCT AAGCGGGCTT ATGGACCTCA TCCTACAGGT TCGCCAGGAG GCCCGCCAGC GGCGCGACTG GGCCACGGCC GATACCATCC GGGACCGGTT GAAGGAGCTG GGGATCGTCC TGGAGGATAC CCCCCGCGGC CCGCGTTGGA AAAGGAGTTA A
|
Protein sequence | MYLYNTLTGR KEEFTPVEPG RVRMYVCGPT TYNYIHLGNA RPMVVFDTLR RYLEYRNYDV LYVQNFTDID DKVINRAREE HQAPLVIAER YIEEFFKDAD ALNVKRATLY PRVSQHIDAI IAAIATLVER GFAYVADGDV YFEVEKFPAY GRLSKRTPGE MRAGARVEVN TSKRNPLDFA LWKAACPGEP SWESPWGPGR PGWHIECSTM ALKYLGPGFD IHGGGADLIF PHHENEIAQA EAQTGCTFAR FWLHNGFITV NQEKMSKSKG NFFLVRDILK RFRPLAVRLY LLATHYRSPI DFDDAGLLAA ERGLERLENT RRLLGEARCQ LTGTGAETTV PARTSALAGR AEELRQEFIS AMDDDFNTAR ALAALYDLAR EINSYLNGTT TIDPAALRTA AITFEQLGGE VLGLFGQARQ QVDDELLSGL MDLILQVRQE ARQRRDWATA DTIRDRLKEL GIVLEDTPRG PRWKRS
|
| |