Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1921 |
Symbol | |
ID | 5103308 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1868233 |
End bp | 1869675 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640507809 |
Product | prolyl-tRNA synthetase |
Protein accession | YP_001191985 |
Protein GI | 146304669 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0442] Prolyl-tRNA synthetase |
TIGRFAM ID | [TIGR00408] prolyl-tRNA synthetase, family I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATAT CAAGGGAGAA GTGGTCCTCT AATTTTAGCG AATGGTTTGA TTGGGTTATC TCTCAGGCTG AAATATACGA CTATGGAAGA TACCCCGTTA AGGGATCTGG CGTATGGATG CCCTACGGCT TCAAGATAAG GCAGAACGTG ACCACCCTCA TTAGGAAATT ACTTGACGAA ACAGGTCATG AGGAGGTGCT CTTTCCACTT CTTATTCCTG AGACGCTCCT CAAGAAGGAG GCAGAACACA TTAAGGGCTT CGAGAAGGAA GTTTTCTGGG TCACTCACGG AGGGGAGGAC GAACTCGAGG AAAGGTTTGC CCTCAGACCA ACGTCAGAGG TTGCGATTAC CCTGATGGAA TCCCTCTGGA TAAAGGGTTA CTCGCAATTA CCAAGAAAGT TCTACCAGAT CGTTAGCGTG TTTAGGTATG AGACCAAGGC CACAAGACCC ATGATCAGGG TTAGGGAGCT CTCCACGTTT AAGGAAGCCC ATACTGTTCA CGAGACCTTT GAGGATGCGG CCAGGCAAGT GGATGAGGCA GTAGAGATTT ACAGTAAGTT CTTTGATATT CTGGGAATTC CATACCTCAT CTCTAGGAGA CCGGAATGGG ATAAGTTCGC TGGGGCAGAG TACACCATAG CTCTCGACAC CATAATGCCC GATGGAAGAG CTCTACAGAT AGGGACGGCG CATCATCTGG GCCAGCACTT CACCAAGGCA ATGGACTACA AGGTCCAGAG GGCCGATGGT TCTCACGTTC ATCCACATCA GACAAGTTAC GGGATATCTG ACAGGGTAAT AGCAACTGTG ATCTCCATAA ACGGTGATGA TCACGGCCCC ATACTACCAC CTGTGGTAGC TCCCATTGAG GGTGTCATCA TACCGATACC TGGAAAGAGT GAAGAGGACA CCGAGAAAAT CAACAAGTAT GCCATGGAAG TGGAGTCCGT TCTCAAGAAC AGCGGAATCC GCGTGGCCCT TGACGCCTCT GAGGATAAGA CTCCTGGAGA GAAGTATTAT ATCTGGGAGT TAAAGGGCGT TCCAATCAGA ATAGAGATAG GACCTAGGGA GCTAAACTCT GGCACTGCCT TCCTTAAGAG GAGGGATACG CTAGAGGGCA AAAGCGTGAA GAGGGAGGAA CTGGTAAAGG AATTCAGGAA CCTTGAGGAT CAAATCTCCG CCGACCTTAG GAAGAGGGCA TGGGAACAGT TCAAGGAGAG GGTTAAGAGG TTCCAGAGCT TGGATGAGGC TAAAAAGTTC CTGGAGAACA GGGGAGGCAT AGCTGAGGTT CCATGGTGCG GACAGGACTC ATGCGGACTT AAGATCGAGG AACAGGTCCA GGCTAGGGTT TTGGGTACTC CCTTGAAACC TGAACCTAGC GGTAACTGCG TCGTATGTGG AAAACCTTCA ACCAACATCC TTCGAATAGC AAAAACTTAT TAG
|
Protein sequence | MKISREKWSS NFSEWFDWVI SQAEIYDYGR YPVKGSGVWM PYGFKIRQNV TTLIRKLLDE TGHEEVLFPL LIPETLLKKE AEHIKGFEKE VFWVTHGGED ELEERFALRP TSEVAITLME SLWIKGYSQL PRKFYQIVSV FRYETKATRP MIRVRELSTF KEAHTVHETF EDAARQVDEA VEIYSKFFDI LGIPYLISRR PEWDKFAGAE YTIALDTIMP DGRALQIGTA HHLGQHFTKA MDYKVQRADG SHVHPHQTSY GISDRVIATV ISINGDDHGP ILPPVVAPIE GVIIPIPGKS EEDTEKINKY AMEVESVLKN SGIRVALDAS EDKTPGEKYY IWELKGVPIR IEIGPRELNS GTAFLKRRDT LEGKSVKREE LVKEFRNLED QISADLRKRA WEQFKERVKR FQSLDEAKKF LENRGGIAEV PWCGQDSCGL KIEEQVQARV LGTPLKPEPS GNCVVCGKPS TNILRIAKTY
|
| |