Gene Ssol_2237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2237 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2016902 
End bp2018503 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content37% 
IMG OID 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionACX92426 
Protein GI261602823 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.822439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTC CCCCAATATT AAGACTGAAG CTACTTTGGG TTGCGCTAAT TATATTCATT 
TTTCTTTTAG GGAGTATTGC ATATGGTTTC TTTACAGCGC CTAAGGCAAA CCCTTATACA
CCAGCTGGTC AGTATGCAGC TTCACCATAT GCAGTGCCTT CGTGGGCTTC AATCTTCTAC
GGTAATCTTC CACCTGACAT AAAAGTACCT AATGATTATG ATCTGATTGC TGCGAAGACA
GCATCTGTTA TAAATTATTG GCATTTGTCT AACTTTACTT TTAACGGAGA TGCTGTTATA
ATAATTTATA ATAGTAGCTA TGGACCCAAG GGAGAAACTA ACTTCCAAAA GACACTTTAT
TCTTATGGAA ACACTGGAAA CGGGAGTGTA GAGATAATAA TAGAAGGAAC TAATCCATTA
AATTTAACTC TGTATCATGA CTTTCTTTAT AACTATTTAT TGCCTAGAGA GACAAAATTT
GGAGACTATG AATTCTATAT CATCCAAGCC AGTATTTCCG CTTATGCTAC AAACGCTTAC
TATACTTTTA ATGGATATGT GATAAATCCG TCAAATGCCA CATTTTGGTT GTTCTTAGCT
GGGAACTACT TGCCAACAAA TTTGGTTACG CTCTCAACTG TATTTAAGTA CTTAGGGAAT
GGAGGATGGA ATTACATATT AGCATCTTCG GCATCAGCTG GAGAAACTCC ATGGTTCTAT
ACCTCAAATA TACCTCCTAA TGAAAGCGCA GTAGCTTCTG TTATAATGCT TCAAAGCATG
TTTAATAGTA CCGGAAATTA CAAGGTGGAA TTCACCATAA ATTATATCCC AAACGGACCT
AACTCCAAAT TAGTAGTTTA CTTATCTGAC CTTTATTTCG AATTTCTTGG TAGTAGATAT
GGAGTATTGG GTACTGATAA TAATGGGGCT AGTGTGTTTG CAGAGTATTC ACAAGGTGGA
ATATTCGACC TTGAACTCGC AATACTTGCT GGGTTGGCAA TAGTATTCAT AGGTGCGGTA
TTTGGCTTAT TTGCTGGTTA TTATGGTGGG AAGTTGGATC AAATATTAAC CTCATTTACT
GATTTCATAC TACTACTCCC TGGGTTAGCA ATACTAATAG TCTTAATAAC CATATTCCAA
CAAATCTTTA CAGTTTTCCC TAAGGATATT TTAATCATAA TAGTTTTAGT AATTCTAAGC
TGGCCTCCTA CGTCTAGAAC AATAAGAGGA CAAGTTCTCC AAGTTAGAAA TATGGCATTC
GTTGAAGCCG CTAAGGCTTT AGGAATGTCC AATATGGAAA TCATAAGAAA GCACGTGTTA
AGGCACGTTT TCCCAATAAT TATAGCACAG CTAATCTTCG ATATTCCAGC TGTGATAGGT
ATAGAGTCAG CCCTAGACTT TCTAGGTATT GGAATACTTA AATTCCCAAC ATGGGGTAAT
ATGTTAGGGT TCTCAATTAA TGCCTCACTA GACGCTCCTG GATTTGCATG GTGGTGGATT
CTGACACCAG GTATAGCGTT ATTTCTCTTG GGCGTTAGTC TATTTTATAT AGGTGAGGCG
ATAACTAGGT ATTATGGAAG TTTAGTTGGT GAGACCCATT GA
 
Protein sequence
MRIPPILRLK LLWVALIIFI FLLGSIAYGF FTAPKANPYT PAGQYAASPY AVPSWASIFY 
GNLPPDIKVP NDYDLIAAKT ASVINYWHLS NFTFNGDAVI IIYNSSYGPK GETNFQKTLY
SYGNTGNGSV EIIIEGTNPL NLTLYHDFLY NYLLPRETKF GDYEFYIIQA SISAYATNAY
YTFNGYVINP SNATFWLFLA GNYLPTNLVT LSTVFKYLGN GGWNYILASS ASAGETPWFY
TSNIPPNESA VASVIMLQSM FNSTGNYKVE FTINYIPNGP NSKLVVYLSD LYFEFLGSRY
GVLGTDNNGA SVFAEYSQGG IFDLELAILA GLAIVFIGAV FGLFAGYYGG KLDQILTSFT
DFILLLPGLA ILIVLITIFQ QIFTVFPKDI LIIIVLVILS WPPTSRTIRG QVLQVRNMAF
VEAAKALGMS NMEIIRKHVL RHVFPIIIAQ LIFDIPAVIG IESALDFLGI GILKFPTWGN
MLGFSINASL DAPGFAWWWI LTPGIALFLL GVSLFYIGEA ITRYYGSLVG ETH