Gene Ssol_1431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1431 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1320909 
End bp1322069 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content36% 
IMG OID 
Producttryptophanyl-tRNA synthetase 
Protein accessionACX91664 
Protein GI261602061 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.494991 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTTTC TTACTACAAT GCCAGATGAA TTTACTGTAA CTCCTTGGGA AGTTAAGGGT 
AAAGTTGATT ATGATAAACT AATTGTTCAA TTTGGTACTC AGAAAATTAC AGAAGAGCTG
AAACAAAGAA TTAAGAACTT AGCTGGGGAT TTGCATGTCA TGCTCAGGAG AAACGTATTT
TTTTCTCATA GGGATTTAGA TTTAGTTTTA AATGACTATG AGAAAAGTAA AGGATTCTTC
CTATATACTG GAAGAGCGCC TTCCTTAGGT ATGCATATAG GACATCTGAT ACCATTCATA
TTTACCAAAT GGCTACAAGA GAAATTTAAT GCTAATTTAT ACATTGAGAT AACTGACGAC
GAGAAGTACA TGAGAAATCC AGAATTTACA TTAGATCAAA CTAGGAGTTG GGCTTACGAT
AATATTTTAG ATATAATCGC TGTTGGCTTT AATCCCGATA AAACGTTCAT CTTCCAAGAT
ACAGAGTACA TAAGGAATAT GTATCCTATA ACAGTGAAAA TAGCAAAGAA GCTGACGTTT
TCAGAAGTAA GAGCTACTTT TGGATTAGAC GCATCCTCAA ATATAGGTCT CATATTTTAC
CCAGCCCTAC AGATAGCTCC TACCATGTTT GAAAAGAAGA GATGTCTAAT ACCAGCCGGT
ATAGATCAAG ATCCCTATTG GAGATTGCAA AGGGATATAG CGGAAAGCCT TGGGTATTAT
AAGGCTGCGC AGATACATAG TAAATTCCTT CCCCCACTCA CGGGTCCAGA GGGCAAGATG
AGTTCTTCAA ACCCAGAAAC GGCAATATAT CTTGTAGATG ATCCTAAAAC CGTGGAAAGG
AAAATCATGA AATACGCATT TTCAGGGGGA CAACCCACAA TAGAGTTACA TAGGAAATAT
GGCGGAAACC CGGAAATAGA TGTTCCCTTT CAGTGGTTAT ATTACTTCTT TGAGGAGGAT
GATAATAGGA TTAAGGAGAT TGAGGAGGAG TATAGATCAG GCAAGATGTT AACCGGTGAG
TTAAAACAGA TATTAATAGA CAAACTAAAT AATTTCTTAG AAGAACACAG AAGAAGGAGG
GAAGAAGCAA AAGAACTTGT ACATGTATTT AAATATGATG GTAAATTAGC TAAGCAGATG
TGGGAGAAGA TTCACGAATA G
 
Protein sequence
MYFLTTMPDE FTVTPWEVKG KVDYDKLIVQ FGTQKITEEL KQRIKNLAGD LHVMLRRNVF 
FSHRDLDLVL NDYEKSKGFF LYTGRAPSLG MHIGHLIPFI FTKWLQEKFN ANLYIEITDD
EKYMRNPEFT LDQTRSWAYD NILDIIAVGF NPDKTFIFQD TEYIRNMYPI TVKIAKKLTF
SEVRATFGLD ASSNIGLIFY PALQIAPTMF EKKRCLIPAG IDQDPYWRLQ RDIAESLGYY
KAAQIHSKFL PPLTGPEGKM SSSNPETAIY LVDDPKTVER KIMKYAFSGG QPTIELHRKY
GGNPEIDVPF QWLYYFFEED DNRIKEIEEE YRSGKMLTGE LKQILIDKLN NFLEEHRRRR
EEAKELVHVF KYDGKLAKQM WEKIHE