Gene Ssol_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2044 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1833289 
End bp1835232 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content36% 
IMG OID 
Productamino acid permease-associated region 
Protein accessionACX92252 
Protein GI261602649 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGAAA GAGAAGCCAA AGGAGCAAGA GATTTAGGTA TTTCTTCAGA TAAACAATTA 
AGGAGAAGCT TAGGAAAATT TGAGCTATTA TACCTCTCAT TAGGAGGAAT TATAGGATCT
GGATGGTTAT TTGCATCTTT AAGTACAGCA GCTTATGCTG GCGGTGCAGC TATATTGAGC
TGGATAATTG CTGGAATTTT AGTAATGTTT GTGGGTTTAG CTTATGCTGA AATAGGTGCA
GCAATTCCAA AAAGTGGTGG AATAACAAGA TATCCGCATT ATACTCACGG AGGGCTAGTA
GGGTATATAA TTACTTGGGC TAATTTCCTT TCCGCTGCAT CAGTGCCAGC TATTGAGGCA
GCAGCAGCAA TAGAATATAT AGGATCTTAT TACCCTCAGC TAATAACTTC TGGAACTTTT
GATGGAACTA CCGTAACAAT TTTGACACCA TTAGGCATAG GATTAGCTGG TCTATTGTTA
ATTTTCTTCT TCTTTTTAAA TTACTTCGGA GTTAACATTT TAGGAAAAGT GACTCATGGT
GCAGGTTGGT GGAAATTATT GGTGCCAACA ATAACTTTTT TAGCATTATT AGCGTTGGAC
CTGCACTCAG CTAACTTTAC ATTAGGTGGA GGTTTCTTCC CATCTGCACA ATATGTAAAA
GGAGGTTCCT CTGGAATTTA TGGTTTTAGT GCAGTTCTCT TTGCTATTCC TTCTACCGGA
GTAATTTTCG CTTATCTAGG ATTTAGACAA GCAGTAGAAT ATGGGGGTGA AGGTAAAAAT
CCCAGTAAAG ATATACCATT TGCGGTGTTA GGTTCATTAC TTATAGCTAT TGCGTTATTT
ACGTTACTTC AAGTTAGCTT TATAGGAGGA ATAGATTGGA GCAAATTATA TCTTGTTAAT
AAGACTACTG GTATATTAAT CCCCGTTGTA CCAGGGAATT GGTCAGCATT AAGTACAGCA
GTCACAGCAT CTAACGTCTC AATATCTTCT GGCCCATTCT TAGTTCTAAC TCAGATTGCC
CCAGTCTCTG GTCTAGCTGC AGCGTTTTTC ACGGCTTTAG CGGTTTTATT AACTATTGAT
TCTGTAGTTT CACCATCCGG AACTGGATGG ATTTATACTG GAACTTCAAC TAGAACACTA
TACGCATTTG CTAGTAATGG TTATTTACCA GAAATTTTCT TAAAGATTGG AAAGACTAAA
ATACCAACTT ATTCACTAAT TGCAACATTA ATAGTAGGTT TCATATTCTT ACTACCATTT
CCATCGTGGT ATGCTTTAGT AGGTTTCATA TCTTCAGCGA CAGTATTAAC TTACATTATG
GGCGGAATTG GATTAGCAGT CTTAAGAAAA CACGCTAAGG AATTAAATAG ACCATTTAGA
GTACCCGCAT CAGTAATAAT TGCACCAATA GCAACACTAG CAGCTGGTTT AATAGTTTAC
TGGTCAAGTT TTGCTATTCT CTTTTACGTA TTTACTGGAA TATTCTTAGG TCTTCCATTA
TTCTTTATAT TCTATTCCAA TAGAATATTA GGGATAAATA AGGCATATTC AATAGTTGTT
GGCGTTATTA ACTTAGTAAT AGATCTAGTA ATGGCATTCT TGTTATTTGA TGAGACTAGC
GGACTTGGTG CTGCTAACAA TCTCTTCTTC GGAATTTATA TAGTAGTTAT TGCAGCTATG
CTAATTGGCG ATATGTTATT CTTGCTGAAA ACTGTACCTC CAGATGTTAA GAGAGAAATT
AATGCCGGAT GGTGGTTAAT TTACTTTATC TTAGCAATTT ACATAATATC ATATTTTGGA
GGATTTGGGC TATATACTAT AATACCATTT CCATATGATA CTATAGTTGC TGCTATTGTA
ATACTCATTG GATACTTCTG GGCTATAAGA AGTGGTTTCA GAACTCAAGC AATTCAAGAT
ATAATTCAGG CAACTAGAGA ATAA
 
Protein sequence
MGEREAKGAR DLGISSDKQL RRSLGKFELL YLSLGGIIGS GWLFASLSTA AYAGGAAILS 
WIIAGILVMF VGLAYAEIGA AIPKSGGITR YPHYTHGGLV GYIITWANFL SAASVPAIEA
AAAIEYIGSY YPQLITSGTF DGTTVTILTP LGIGLAGLLL IFFFFLNYFG VNILGKVTHG
AGWWKLLVPT ITFLALLALD LHSANFTLGG GFFPSAQYVK GGSSGIYGFS AVLFAIPSTG
VIFAYLGFRQ AVEYGGEGKN PSKDIPFAVL GSLLIAIALF TLLQVSFIGG IDWSKLYLVN
KTTGILIPVV PGNWSALSTA VTASNVSISS GPFLVLTQIA PVSGLAAAFF TALAVLLTID
SVVSPSGTGW IYTGTSTRTL YAFASNGYLP EIFLKIGKTK IPTYSLIATL IVGFIFLLPF
PSWYALVGFI SSATVLTYIM GGIGLAVLRK HAKELNRPFR VPASVIIAPI ATLAAGLIVY
WSSFAILFYV FTGIFLGLPL FFIFYSNRIL GINKAYSIVV GVINLVIDLV MAFLLFDETS
GLGAANNLFF GIYIVVIAAM LIGDMLFLLK TVPPDVKREI NAGWWLIYFI LAIYIISYFG
GFGLYTIIPF PYDTIVAAIV ILIGYFWAIR SGFRTQAIQD IIQATRE