Gene Ssol_1878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1878 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1669581 
End bp1672031 
Gene Length2451 bp 
Protein Length816 aa 
Translation table11 
GC content34% 
IMG OID 
Productvalyl-tRNA synthetase 
Protein accessionACX92090 
Protein GI261602487 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTAAACC AAGATGAAAT TTTAAAGAAG ATGGAAGAGT GGCCAAAACA CTATAACCCT 
AAGGAAATTG AGGAAAAATG GCAAAAGATA TGGCTAAGTG AGGAATATTG GAGAGACGTA
TTTAGATTCA GAGATGAGGA CGATAAAGCA CCAAGATTTG TTATTGATAC TCCTCCCCCT
TTTACTAGCG GAGAACTTCA TATGGGGCAC GCGTATTGGG TTACAATAGC TGATACTATA
GGTAGGTTTA AGAGATTAGA AGGATATAAT GTACTATTAC CTCAAGGATG GGACACACAA
GGTTTACCTA CTGAACTAAA AGTGCAATAT AAGTTAGGAA TCCCGAAAGA TAATAGGCAA
CTATTTCTCC AGAAATGTAT AGAATGGACA GAAGAAATGA TTAAGAAAAT GAAGGAGGCA
ATGATTAGAC TAGGTTATAG GCCAGAATGG GAAAGATTTG AATATAAGAC ATATGAGCCA
AAATATAGAA AAATTATCCA GAAAAGTCTT ATTGACATGT ATAAAATGAA TTTAATTGAG
ATGAGAGAAG GCCCAGTAAT TTGGTGCCCT AAGTGTGAGA CTGCATTAGC ACAAAGTGAA
GTAGGTTACT TAGAGAAGGA GGGAATTCTT GCCTATATAA AATTTCCATT AAAAGAAGGA
GGAGAAATAG TAATAGCGAC TACCAGACCT GAATTACTAG CTGCTACACA AGCTATCGCC
GTAAATCCAA TGGATGAAAG ATATAAGAAT TTAGTAGGAA AAATAGCATT GGTACCAATA
TTTAATATCG AGGTCAAAAT AATATCTGAC GCGGATGTGG AAAAGGAATT CGGAACCGGA
GCAGTAATGA TAAGTACTTA TGGCGATCCC CAAGATATAA AGTGGCAATT GAAATACAAC
TTACCGATTA AGGTTATAGT TGACGAAAAA GGAAGGATAA TAAATACAAA TGGAATACTT
GATGGATTGA AAATTGAACA AGCTAGAAAT AAAATGATAG AACTCCTAAA GACTAAAGGA
TACCTTGTTA AAGTAGAGAA GATAAAGCAC AATGTACTAT CACACGTTGA GAGAAGTGAT
TGTCTATCTC CAGTAGAATT CTTAGTTAAA AAGCAAATAT ACATTAAAGT TTTAGATAAG
AAGCAAAAAT TATTAGAAGA ATATAAAAAG ATGAAATTTA AACCGGCTAG AATGTCCTAT
TATCTCGAGG ATTGGATAAA GAGTATAGAG TGGGATTGGA ATATAACTAG GCAAAGGATT
TATGGTACGC CATTACCGTT TTGGTACTGC GAAAATGGGC ATTTAGTACC AGCTAAAGAA
GAGGACTTGC CAATAGATCC TATCAAAACT AGCCCGCCAT TAGAGAAATG TCCATTATGC
GGATCAGAAC TTAAACCAGT TACCGATGTT GCAGACGTGT GGATAGACTC TAGCGTAACA
GTCCTTTACC TAACCAAGTT CTATGAAGAT AAAAACGTTT TCAATAGGAC TTTCCCAGCA
TCACTTAGAC TTCAAGGTAC TGATATAATT AGAACTTGGT TATTCTATAC CTTCTTTAGG
ACTTTAATGT TAGCTAATAA TGTACCTTTT ACTACAGTTC TTGTTAATGG TCAAGTCCTT
GGACCAGATG GAACTAGAAT GAGTAAAAGT AAGGGAAATG TAGTATCACC ATTAGATAGA
GTTAATGATT TTGGAGCAGA TGCGATTAGA ATGGCTCTTC TAGACGCAAG TATTGGTGAC
GATTTTCCAT TTAAATGGGA TATAGTGAAA GGAAAGAAGA TGTTATTGCA AAAATTATGG
AATGCAAGTA GACTAGTCTA CCCTTTCATA GCAAAACAAA GACTTGATAA ACCTAAAAGC
CTACATATAG TAGACAAATG GATCTTACAA GAACATAAGA AATTCGTAAC TAAAGCAATA
AATGCATATG AGAATTACGA CTTTTATTTA GTACTTCAAG AGCTATATAA CTATTTCTGG
GAGATCGTAG CTGACGAGTA TTTGGAAATG ATAAAGCATA GGTTATTTGA TGACGATAAC
TCTGCAAAAT ATACTATACA GAGAATAATA AGAGATATAA TCATATTGCT TCATCCTATC
GCACCTCATA TAACAGAGGA AATTTACTCA AGGCTATTTG GCCACAAGAA GAGTGTTCTC
CTAGAAGAAT TACCAAAAGT AGATGATATT GAGGAGAATA AAAGAATAGA TGAACTTGGA
GAAGTAATAA AGAAAACGAA CTCGCTCATA AGATCAGAGA AGATAAAGAA TAGATTATCA
ATGAATACTC CAGTTAGTGT AAAATTGTAC GCTTCTAAGC AAGTTATTGA ATTAATTAAT
GAAGTGAAAG ATGACGTAAT GAAGACATTA AAGGTAACTA ATCTTGAACT AATAGAATCG
AATGAAGAAA AAGTGGAAAT TAAAACTGCT AATCAGTCCA TGGGAGTTTA G
 
Protein sequence
MLNQDEILKK MEEWPKHYNP KEIEEKWQKI WLSEEYWRDV FRFRDEDDKA PRFVIDTPPP 
FTSGELHMGH AYWVTIADTI GRFKRLEGYN VLLPQGWDTQ GLPTELKVQY KLGIPKDNRQ
LFLQKCIEWT EEMIKKMKEA MIRLGYRPEW ERFEYKTYEP KYRKIIQKSL IDMYKMNLIE
MREGPVIWCP KCETALAQSE VGYLEKEGIL AYIKFPLKEG GEIVIATTRP ELLAATQAIA
VNPMDERYKN LVGKIALVPI FNIEVKIISD ADVEKEFGTG AVMISTYGDP QDIKWQLKYN
LPIKVIVDEK GRIINTNGIL DGLKIEQARN KMIELLKTKG YLVKVEKIKH NVLSHVERSD
CLSPVEFLVK KQIYIKVLDK KQKLLEEYKK MKFKPARMSY YLEDWIKSIE WDWNITRQRI
YGTPLPFWYC ENGHLVPAKE EDLPIDPIKT SPPLEKCPLC GSELKPVTDV ADVWIDSSVT
VLYLTKFYED KNVFNRTFPA SLRLQGTDII RTWLFYTFFR TLMLANNVPF TTVLVNGQVL
GPDGTRMSKS KGNVVSPLDR VNDFGADAIR MALLDASIGD DFPFKWDIVK GKKMLLQKLW
NASRLVYPFI AKQRLDKPKS LHIVDKWILQ EHKKFVTKAI NAYENYDFYL VLQELYNYFW
EIVADEYLEM IKHRLFDDDN SAKYTIQRII RDIIILLHPI APHITEEIYS RLFGHKKSVL
LEELPKVDDI EENKRIDELG EVIKKTNSLI RSEKIKNRLS MNTPVSVKLY ASKQVIELIN
EVKDDVMKTL KVTNLELIES NEEKVEIKTA NQSMGV