Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_2152 |
Symbol | |
ID | 5104891 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 2066156 |
End bp | 2067214 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640508043 |
Product | tyrosyl-tRNA synthetase |
Protein accession | YP_001192215 |
Protein GI | 146304899 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0162] Tyrosyl-tRNA synthetase |
TIGRFAM ID | [TIGR00234] tyrosyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000510287 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000433253 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGATAACTA GAAATACCGC GGAGGTTGTT ACTCCCGAGG AATTAAAAGA AGCCTTGGAA AGCGGTAGAA AACTTAAGGG ATATCTTGGT TTTGAGCCCA GCGGTCTATT CCATATAGGA TGGTTAATCT GGGCTCAAAA GGTGAAGGAC CTATTGCAGG CCGGAGTGGA CATGAGCCTC CTGGTCGCAA CGTGGCATGC CTGGATAAAT GACAAACTAG GGGGGAATAT GGAAATGATA AAGCTAGCAG GCGATTATGC TATTACCGTA TTGGACAGTT TTGGAATTAG CAGGAGCAAG GTTCACGTCA TAGATGCTGA GGATATGGTG AAGGACAAGG ATTACTGGTC ATTGGTAATA AGGGTGGCAA AGAACACGAG CCTGGCTAGG ATGAAGAGGG CACTCACTAT CATGGGAAGG AAGGCCGATG AAGCTGAGCT AGATTCCTCT AAGCTGATTT ATCCTGCGAT GCAGGTGAGT GATATCTTCT ATATGGACTT AGATATAGCG CTGGGTGGAA CGGATCAAAG GAAAGCTCAC ATGCTTGCAA GGGACGTAGC TGAGAAGCTT GGCAAGAAGA AGGTAATAGC AATTCACACG CCACTCCTGG TTGGCTTACA GGGAGGGCAG AGGATGAACC CTGGAGTGGA CGAAGATGAC GCCTTGGCTG ACATAAAGAT GAGTAAATCC AAGCCTGAGA CTGCCATATT CATCAACGAC GAGCCTGAAG AAGTGGAGGG TAAATTGATG ACAGCATACT GTCCCAAGGG AGTTGTGGAG AATAACCCGG TGTTACAAAT TAACAAGTAC ATCCTATTCC AGGTCGATGA TAGGGGACTT AAGGTAGAGA GGGATGCTAA GTTTGGCGGG GATGTACAGT TCAACACCTA TGAAGAGCTG GAGAAAGCCT ACGCTGAAGG GAAATTACAT CCCAAGGACC TTAAGGTTGC AACTGCAAGA AAGCTTAACC AGATAATAGA TCCTTTAAGG AAGTCTATTA AATCTAGACC TGAATATGAT AAACTAGCAA AAGAAATAGC AAGGAGTGTT AGCAGGTGA
|
Protein sequence | MITRNTAEVV TPEELKEALE SGRKLKGYLG FEPSGLFHIG WLIWAQKVKD LLQAGVDMSL LVATWHAWIN DKLGGNMEMI KLAGDYAITV LDSFGISRSK VHVIDAEDMV KDKDYWSLVI RVAKNTSLAR MKRALTIMGR KADEAELDSS KLIYPAMQVS DIFYMDLDIA LGGTDQRKAH MLARDVAEKL GKKKVIAIHT PLLVGLQGGQ RMNPGVDEDD ALADIKMSKS KPETAIFIND EPEEVEGKLM TAYCPKGVVE NNPVLQINKY ILFQVDDRGL KVERDAKFGG DVQFNTYEEL EKAYAEGKLH PKDLKVATAR KLNQIIDPLR KSIKSRPEYD KLAKEIARSV SR
|
| |