Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2077 |
Symbol | cysS |
ID | 5105057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1995195 |
End bp | 1996592 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640507967 |
Product | cysteinyl-tRNA synthetase |
Protein accession | YP_001192141 |
Protein GI | 146304825 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0215] Cysteinyl-tRNA synthetase |
TIGRFAM ID | [TIGR00435] cysteinyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.786603 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCAAGTAT TTAACACGCT GGGAAAGAGG CTCCAGAGCT TCGAGCCCCA CGAGCCCAAC ACCGTGAAAA TGTACGTGTG CGGTCCCACC GTCTACGACG AGGTTCACAT TGGACATGGT AGGACCTTTG TGGCCTTCGA TGCCATGAGC AGGTATCTAA GGGTAAAGGG TTACAACGTG GTAAGGGTCC AGAACATCAC GGACATAGAC GATAAAATAA TCAATAAGGC AAGGGAACTG GGGAAGAGTT GGAACGAAGT TTCAGAGTAC TACTCGAAGA GCTACCTGGA ACACATCGGT GCCCTCAAGG TAAAGATAGA CATGCATCCT AAGGTCACTA CCCACATCAA GGAGATTATC GACTTCGTTC AAAGGTTAAT TGACAGTGGG CATGCCTACG TTGCCAACGG GAGCGTCTAT TTTGATGTGG ACACTTACCC GGGTTATGGG GAGCTATCCA ACGTGAAGAA GGAGGAGTGG GATCAGGGGG AGGAGATAGT TAAGGAAAAA AGACACCCCT ACGACTTCGC GCTCTGGAAG GCGTATAAGC CTGGGGAACC ATACTGGGAG TCTCCTTGGG GTAAGGGTAG ACCTGGATGG CACATAGAGT GCTCCACCAT GTCCACGAGG TATCTAGGAA CAAAGATCGA TATTCACGGT GGAGGAATGG ACCTGGTGTT TCCCCATCAC GAGAACGAGA GAGCCCAAAC CGAATCCCTC ACCGGATCAA CTTGGGTAAA GTATTGGATG CATGTGGCCT TTCTCACAAT AAGGAAGGAG AAGATGTCCA AGTCCAAGGG CAACATTGTC CCGCTTAAGG AGGCACTGAG CAAGTATGGG CCATCTACGC TGAGGTACTG GTTTCTATCA TCCCAGTACA GGAACCCCAT AGAGTATAGC GAAGAGATCC TAGAACAAAG CTCTAGGTCC CTCCAGAGGC TTAAGGATGC CATATCCGTG CTGAGGAAAA TAATTCAGAA GGGACCAGCC CACTACGCGA AGGAGGAGGA CGTAAAGGTC CAAGAGGAAA TAGTAAGGGC TATCTCAAGG TTCGACGAAC ACATGGAGAA CGATTTTGAT ACGTCTAACG CATTGACATC AATTCACGAA ATAGCCTCAA TAGTTTTCTC AAAGCTCCAA TACAGCGAAG ACGTGTTTGG GGCTTTAATA GCGCTGGACG GATTCAGGAA GTTCAATGAG GTCTTCGCAG TTATGGATGA GGAATTTTCG GCGGAGCTAG ATAGGTTAAC CAAGGTGATC GACGCAGTGA TAGAAGTCAG GAATTACCTG AGAAAGAAAC AGATGTACGA TCTATCGGAC CAGATCAGGG ATATCCTCTC CAGGAGCGGA GTAAAAATAC TGGACTCCAA GGAAGGCTCT ACTTGGAGAT TTCAGTGA
|
Protein sequence | MQVFNTLGKR LQSFEPHEPN TVKMYVCGPT VYDEVHIGHG RTFVAFDAMS RYLRVKGYNV VRVQNITDID DKIINKAREL GKSWNEVSEY YSKSYLEHIG ALKVKIDMHP KVTTHIKEII DFVQRLIDSG HAYVANGSVY FDVDTYPGYG ELSNVKKEEW DQGEEIVKEK RHPYDFALWK AYKPGEPYWE SPWGKGRPGW HIECSTMSTR YLGTKIDIHG GGMDLVFPHH ENERAQTESL TGSTWVKYWM HVAFLTIRKE KMSKSKGNIV PLKEALSKYG PSTLRYWFLS SQYRNPIEYS EEILEQSSRS LQRLKDAISV LRKIIQKGPA HYAKEEDVKV QEEIVRAISR FDEHMENDFD TSNALTSIHE IASIVFSKLQ YSEDVFGALI ALDGFRKFNE VFAVMDEEFS AELDRLTKVI DAVIEVRNYL RKKQMYDLSD QIRDILSRSG VKILDSKEGS TWRFQ
|
| |