Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1935 |
Symbol | argS |
ID | 4810793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2310240 |
End bp | 2311934 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640107351 |
Product | arginyl-tRNA synthetase |
Protein accession | YP_001038346 |
Protein GI | 125974436 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0018] Arginyl-tRNA synthetase |
TIGRFAM ID | [TIGR00456] arginyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000950947 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAATG TAGTGGAGAC TATAAAAAAA CAAATTAATG AAGTGGTTAA AAATTCAATA TCAAAGGCCG TTCAAAACGG AGAACTTCCG CAATTTACGG TGGACGAGCT GTTTATAGAA ATTCCGAAGG AAAAGGGACA TGGCGATTTT TCAACCAATA TTGCCATGCA GGCGGCAAAA ACGGTAAGAA AAGCACCCAG ACAGGTGGCT GAAATAATTA TAAAAAATAT GGACTTAAGC AATACATACA TTGACCGGGT TGAAGCGGCA GGACCGGGAT TTATCAATTT CTTCCTTACA AATGCATGGC TGTATGACGT TCTGAAGGTT ATCCAGAAAG AAAAAGAAAA TTACGGAAAT CTGGATATCG GACGTGGCCA GAAAGTCATG GTTGAGTTCG TCAGCGCCAA CCCCACCGGG CCTTTGCACA TGGGCAATGC CAGAGGCGGA GCGTTGGGAG ATTGTATTGC CAGTGTTCTT GAGAAAGCCG GATATGATGT GACCAGGGAG TTTTATATCA ATGATGCCGG AAACCAGATT GAAAAATTCG GCATTTCGCT TGAAGCAAGG TATATTCAGC TTTTGAAGGG AGAGGATGCC GTAGAGTTCC CGGAAGATGG CTACCATGGC GAGGATATTA TTGACCATAT GAAGGCGTAT ATTGAAGAAA ACGGGGACAA TCTGCTTTAC GTTGACAGTG AAGAAAGGCG CAAAACCCTT GTTGAATATG CTTTGCCGAA AAATATTGAA CGGATAAGAA AATCCCTGGA AAACTACGGC GTGGTCTTTG ATGTCTGGTT TTCTGAGCAG TCCCTGTATG ACAACGGCGA AGTCCGGGAG ACATTGGATA TTCTCAAAGA AAAGGGATAT ACCTTTGAAA AGGACGGAGC TGTCTGGTTT AAAGCTTCCG CCCTGGGAGC GGAAAAAGAT GAGGTAATTG TGAGGAACAA CGGAATCCCG ACATATTTTG CGGCCGACAT AGCCTATCAC CGCAACAAAT TTTTAAAGCG CAAGTTTGAC AGGGTTATAA ACTTGCTGGG TGCGGACCAC CACGGACATG CGGCAAGAAT GAAATGTGCT TTAAAAGCCT TTGACATTGA CCCGGACAAG CTTGATATTG TAATATTCCA GCTGGTGCGC CTTTACAGAA ACGGCGAGAT AGCCAGAATG TCCAAAAGGA CGGGCAGGGC AATATCCCTG GACGATCTTT TGGAGGAAGT CGGAAGAGAT GCCGCAAGAT TCTTCTTTAA CACGAAAGCT TCAGGAAGCC ACCTGGACTT TGATTTGGAC CTTGCAGTTA AAAAATCAAA CGAAAATCCG GTATATTATG TTCAGTATGC TTACGCCCGA AGCTGCAGCA TGCTGAGACT TTTAGAGAGC GAGGGCTTTA AAGTGCCGGA TGTTGACTCG GTCGACCTTA CAGTTTTGAA GGCACCTGAG GAAATTGAGC TGATGAAAAA GCTTTCCGAA TACCCTGAAG AGATAAGAAT TTCGGCCCAG ACTTTGGAGC CCAGCAGGCT TACAAGATAT GTTCTTGATG TTGCATCAAA TTTCCACAGC TTTTACAATG CCTGCAGGGT AAAAGGCGAG GAAGAGAATT TAATGTATGC AAGAATGATA CTTGTGGACA GTACAAGACT TGTTATAAAA AATGTGCTGG ATGTGCTCAG CATAACGGCT CCTGAAAAAA TGTAG
|
Protein sequence | MTNVVETIKK QINEVVKNSI SKAVQNGELP QFTVDELFIE IPKEKGHGDF STNIAMQAAK TVRKAPRQVA EIIIKNMDLS NTYIDRVEAA GPGFINFFLT NAWLYDVLKV IQKEKENYGN LDIGRGQKVM VEFVSANPTG PLHMGNARGG ALGDCIASVL EKAGYDVTRE FYINDAGNQI EKFGISLEAR YIQLLKGEDA VEFPEDGYHG EDIIDHMKAY IEENGDNLLY VDSEERRKTL VEYALPKNIE RIRKSLENYG VVFDVWFSEQ SLYDNGEVRE TLDILKEKGY TFEKDGAVWF KASALGAEKD EVIVRNNGIP TYFAADIAYH RNKFLKRKFD RVINLLGADH HGHAARMKCA LKAFDIDPDK LDIVIFQLVR LYRNGEIARM SKRTGRAISL DDLLEEVGRD AARFFFNTKA SGSHLDFDLD LAVKKSNENP VYYVQYAYAR SCSMLRLLES EGFKVPDVDS VDLTVLKAPE EIELMKKLSE YPEEIRISAQ TLEPSRLTRY VLDVASNFHS FYNACRVKGE EENLMYARMI LVDSTRLVIK NVLDVLSITA PEKM
|
| |