Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_3050 |
Symbol | |
ID | 4028046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 3398731 |
End bp | 3399927 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637968262 |
Product | tyrosyl-tRNA synthetase |
Protein accession | YP_575093 |
Protein GI | 92115165 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0162] Tyrosyl-tRNA synthetase |
TIGRFAM ID | [TIGR00234] tyrosyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000057737 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGATG TCGCTGATGC GCTGGCGCTG CTCAAGCGCG GGACCCATGA AATCCTGCTG GAGGACGAGC TGGTAAAGAA GCTCGAGTCC GGCCGCAAGC TGCGTATCAA GGCAGGCTTC GATCCCACTG CGCCGGATTT GCATCTCGGG CATAGCGTCC TGTTGACCAA GATGCGCCAG TTTCAGGAGC TGGGCCATCA GGTAGTGTTC TTGATCGGCG ATTTCACCGG GCGTATCGGT GACCCCACGG GGAAAAACGT CACACGCAAG CCGCTGACCG AGGCGGAGGT CAAGGCGAAT GCCGAGACCT ACAAGGAGCA GGTGTTCAAG ATCCTCGACC CGGCGCTGAC CGAGGTGCGT TTCAATGCCG AATGGATGAG CAAGCTCTCC GCGGCCGACA TGATCGAGCT GGCGGCGCAG TCCACCGTGG CGCGCATGCT GGAACGCGAT GATTTCGAGA AGCGCTATAC CGCCAATCAA TCGATCTCCA TCCACGAGTT CCTCTACCCA TTGGTCCAGG GCTACGATTC GGTCGCACTC GAGGCCGACA TCGAGCTGGG CGGCACCGAC CAGAAGTTCA ACCTGCTGAT GGGCCGCGAG ATCCAGAAGC ACTTCGGCCA GGCGCCGCAG GTGGTCATGA CCATGCCGTT GCTGGAAGGA CTGGATGGCG TCCAGAAGAT GTCCAAGTCA TTGGGGAATT ACGTAGGTGT CGATGACACT CCCGGCGTGA TGTTCAACAA GCTGGTGTCG ATGCCGGACA GCCTGATGTG GCGTTACTTC GAATTGCTGT CACTGAAGTC CAACGACGAG ATAGAGAAGA TCCGTCGTTC CATCGAGTCA GGCGCCAACC CGCGTGATAT CAAGATGGAA TTGGCACGGG AACTGATCGC CCGTTATCAC GGCGAAGAAG CCGCCGCCAA CGCGCACAAG TCCGCGGGCA ACCAGCTGGC CGAGGGCGAG CTGCCGGATG ACCTGCCGGA CGTCGAAGTG GCGTTCGAGG GTGAGCAGGC CCCCATCGCG GCGGTGCTCA ACCGCTCAGG GCTGACCAAT AACAGCGCCC AGGCCAAGGA CATGCTCAGT AACGGACGCG TCAAGGTCGA TGGGGAAGTG GTCGCCAAGG ATGCCATGCT CGCCACGGGG CAGCGTTATG TCATTCAGGC CGGCAAGAAG CGCTACGCGC GCGTGACGCT CGTCTGA
|
Protein sequence | MTDVADALAL LKRGTHEILL EDELVKKLES GRKLRIKAGF DPTAPDLHLG HSVLLTKMRQ FQELGHQVVF LIGDFTGRIG DPTGKNVTRK PLTEAEVKAN AETYKEQVFK ILDPALTEVR FNAEWMSKLS AADMIELAAQ STVARMLERD DFEKRYTANQ SISIHEFLYP LVQGYDSVAL EADIELGGTD QKFNLLMGRE IQKHFGQAPQ VVMTMPLLEG LDGVQKMSKS LGNYVGVDDT PGVMFNKLVS MPDSLMWRYF ELLSLKSNDE IEKIRRSIES GANPRDIKME LARELIARYH GEEAAANAHK SAGNQLAEGE LPDDLPDVEV AFEGEQAPIA AVLNRSGLTN NSAQAKDMLS NGRVKVDGEV VAKDAMLATG QRYVIQAGKK RYARVTLV
|
| |