Gene Ssol_1255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1255 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1164356 
End bp1165636 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content32% 
IMG OID 
Producthistidyl-tRNA synthetase 
Protein accessionACX91491 
Protein GI261601888 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.505298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAAAGT TTGAAACAGT AAGAGGCATG AAAGATTATA TTGGAATCGA TGCAGAAAAA 
ATTAGATACT TGGAATCTAC CTTTAGAGAC TTAGCAAAAA AATATGGATA TTCTGAAATT
ATAACGCCAG TAGTAGAAGA ATTTAAACTG TTTGAGTTGA AGGGTGGGGA GGAACTAAGA
CAAACGATGT ATGTGTTTAA GGACAAGGCA GATAGAGAGA TATCGTTACG ACCTGAAATA
ACACCAAGTG TAGCAAGAGC ATACATACAA AATTTACAGA GTTCGCCAAA GCCGATAAGG
CTATTTTACT TTGGTACCGT TTATAGGTAT GACGAACCCC AGTACGGCAG ATATAGAGAG
TTCAGACAAG CCGGAATAGA AATGATAGGT GATTCTTCCA TCTTAGCTGA TGTAGAAGTA
TTAGATTTAT TGTACAATTT TTATGATAAG CTTAATCTTT CTAAGGATAT AACAATTAAA
ATAAATAACA TTGGTATATT TAGAAAAATA ATGGATAAAT ATAATATCGA AGATAATCTA
CAAGAGCATG TTCTGCATTT AATAGATAAG AATAAGGTTG ACGAAGCTTT AGTTATTCTT
GAAAAAAATA TAAAGAATAA GGATATAATG GACTTTTTAA ATATGATCCT TACTAAAAAA
GAGGCAAAAC TAGAAGATAT AGAATCCTTA GCTGAATTAG AGGAAGTTTC AAAATTAGAT
ATTAAAAACG AATTTGAATA TCTACTTCGA TTATCTAGAA TTTTAAGCAG CTTAAATGTA
AAATTTAAGG TTGACCTAGG TTTTGTAAGA GGATTAGCTT ATTATACTGG ACTAATATTT
GAGGTTCTTC ATCCCTCTGT TCAGTTTAGC ATTGCTGGAG GAGGAAGATA TGATAAACTT
ATAGAGCTCT ATGGTGGCTT ACCCTCACCA GCAATAGGAT TCGCTATAGG AGTTGAGAGA
ACTTTATTAG TAATTAAAGA TCTGAAAGTT GAAGAACCAA TAAATGTGAT AGTAGTAGGC
ATCTCAGAGG AGGCAATACC AGCTATGTTT ACGGTATCCA GAATGTTAAG AAAGGAAGAA
TATAAGGTAG TAATAAATAC TAAAGATCAG CCTCTCTCTA AACTATTACC TTATTATGCT
TCCCAAGGAT TTAAACTCGC AATAATAATA GGTAAACAAG AACTTGAGAA AAATATGATA
ACAGTTAGAA ATTTAATTAC ACGAAAACAG ATTTCTATCC CACTAGAGAA CGTTCTAGAT
GCAATAAAAC AAACGTTATA A
 
Protein sequence
MVKFETVRGM KDYIGIDAEK IRYLESTFRD LAKKYGYSEI ITPVVEEFKL FELKGGEELR 
QTMYVFKDKA DREISLRPEI TPSVARAYIQ NLQSSPKPIR LFYFGTVYRY DEPQYGRYRE
FRQAGIEMIG DSSILADVEV LDLLYNFYDK LNLSKDITIK INNIGIFRKI MDKYNIEDNL
QEHVLHLIDK NKVDEALVIL EKNIKNKDIM DFLNMILTKK EAKLEDIESL AELEEVSKLD
IKNEFEYLLR LSRILSSLNV KFKVDLGFVR GLAYYTGLIF EVLHPSVQFS IAGGGRYDKL
IELYGGLPSP AIGFAIGVER TLLVIKDLKV EEPINVIVVG ISEEAIPAMF TVSRMLRKEE
YKVVINTKDQ PLSKLLPYYA SQGFKLAIII GKQELEKNMI TVRNLITRKQ ISIPLENVLD
AIKQTL