Gene Ssol_2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2001 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1796460 
End bp1797668 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content36% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionACX92211 
Protein GI261602608 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTAGAGA AAAATTTATT ACCGGAAATA TTACTGGCTA TACATATGCC GTTAAATAAA 
GGGTTAACTA GGGTTAAAGC TATCGTGATA ATAATTGTTG TTATAATCGC AGTGATAGCT
GGGGTTGTAG GATATTATTT AATTAATCAT CCTTCTAATT CTGTAACTAC TTCATCTTCA
TCTACTACAA CTAGTTCTTC CCTATCTAGT ACTAGCATAT CTTCATCTAC TACTAATATT
ACTTCATCTC AAGGTATTAC AGTCTTCGTA GCGGGTGCTT ATCTTGCAAT TCTCAACTAC
CTAGCTGACC AATTTCAGAA CGCTACTGAA ATTCCAGTTC ATGTTGTAGG TAGTGGCTCC
TTCGCATTAG CTTCACAAAT AGCTTCCCAG ACTCCAGTTC CAGCAAACGT TTTCATTCCA
GTTGCCTATA TTCAAGCTGT TGAGTTAACT GGCAGTAGGA ATCCCGGTTG GGCTATAGCT
TTTCTATCAG ATCAGATGAC AATAGTTTAC TCTAACTACA CTACCAAATC TCCTTATTGG
TCCCAACTAT ACTCCAATTA CACCATGGCT ATGGAGACCA ACAATACTAA GTATTGGTAT
AATTTCTTCT ACTTATTGAC CACCAGGTTC AGTCTGGGAA TTGCTAATCC TAACACTGAC
CCAGAGGGAT TATATGCGTA TTTGATACTT CAAATGGCAA GTTATTTATA TGCTAATCAT
AATATAAGCT ACTTTGTGCA TCTCGTTAAA GCGAATCCAA ATGTCAAAGT AGCCCCTAGT
ACAGCTAACT ATGTAGCGCC CTTAAAGGCG GGTACTTTAG ACTTCACATT CTCTTATGTT
TCCTATGCTG TATCTCAAGG ATTGGAATAT CTAAAACTAC CTCCTTGGTT AAGTTTTGGT
TATTATCCGA ACGAGACGAC ATGGTACAGT CAATTTGCTT ATAATATAAG TGTAAATGGC
CAAACATTAA CAATTCATGG AAATCCAGTT TACTTATACA TTACCATTCC ATTAAACGCT
TCGAATATAC AAACTGCATA TCAGTTTATT GGCTTCGTAC TGGGTCATGA ATCTCAACTT
ACCAGATTTA ATGTAATTCC AATACAACCA GCTTTATTGT ATAATGAAAC TAGTAATATT
CCGCAGCCTA TATTGAACTT GTTAAAATCT GGTGAGTTGA AGTATGCGGG TAATTTCTCT
GAAGTTTAA
 
Protein sequence
MLEKNLLPEI LLAIHMPLNK GLTRVKAIVI IIVVIIAVIA GVVGYYLINH PSNSVTTSSS 
STTTSSSLSS TSISSSTTNI TSSQGITVFV AGAYLAILNY LADQFQNATE IPVHVVGSGS
FALASQIASQ TPVPANVFIP VAYIQAVELT GSRNPGWAIA FLSDQMTIVY SNYTTKSPYW
SQLYSNYTMA METNNTKYWY NFFYLLTTRF SLGIANPNTD PEGLYAYLIL QMASYLYANH
NISYFVHLVK ANPNVKVAPS TANYVAPLKA GTLDFTFSYV SYAVSQGLEY LKLPPWLSFG
YYPNETTWYS QFAYNISVNG QTLTIHGNPV YLYITIPLNA SNIQTAYQFI GFVLGHESQL
TRFNVIPIQP ALLYNETSNI PQPILNLLKS GELKYAGNFS EV