Gene Ssol_2139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2139 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1924011 
End bp1925264 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content35% 
IMG OID 
Productamino acid permease-associated region 
Protein accessionACX92342 
Protein GI261602739 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.578825 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTAGAA GAAAACTTTC AGTTTTTGAA GCCTTTTCAC TGTCCTTTGG AGGACAAGCG 
CCCTTTACCT CAATAATTAC ATTTGGTACT ATAGGTCTAC AACTTGGTGG ATCCTTCTTA
ACTATTGCCA CAATTATTGG GACGATTCTA GTGTTAGTGA ATGGCATAGT AATATATAGA
TTGTCGTTAA GGTATTCCCA ACATGGAGGA TACTTCACCT ATGCCTTCTA TTCGCTTACC
GAAAGGCTAG GATTAGTAAC TGGATGGCTA TTTTTACTTT ATGCATTTAG TTATGGTGGT
ACATTATTAG CTGGTTCAAT TTACATTATA ACAAGTTATT TAAAGATAAG TGCTGACCTA
GTTGCATTTC TAGTCATATT ATTTTCAGCA TTTCTGGTTA TAAGGGGTTT AGATGTTTCC
GTTAAATACG CCGAGTTCAT AAGTATTGCT GAGATAGTTG CAATAATTGT CAGCTCAGTT
GTGCTATTGT TAGGGACGAA ACCAAGTTTT AACTTAACGA TTCCGACTAA TCCCTTTCTA
GTTATACTTT ACGCCATAGG AATGCCCATA GGTTATGGAA ATTTGAACCC AATGAGTGAA
GATATAAAGA ATGCGAAGAA AATTGTGGGG ATAATTACTG TTATCGTGAT ACTTTTAGGT
GGATTGCTAT CAGCTTTGCT TTTTTACGCC AGTGCGCTAT ATGGGACTGA TTTGATAGAA
ATTCTTTTAG ATAAGGTTGG ATTCATATTT CCCTATTTGA TCTTCTCAGC TTTAAATGGT
GGAATATTGG GTGGAATAGC CTATATTATA GCGATGTCTA GGATCCTTCA TGCAATGTCA
TTAAAGAATC TTATGCCATC GATTATTTCA TCGGTTAAAT ATAATAGACC ATATAACGCT
GAGGTCATAT CACTCATTAT CTATACTGTT ATTTTGTTTC TCCTAACTCA CTTCGTTGGG
CTATATACAA CCTTTCTAGT TTTAGGGGGA CTTACGGTAT TAAGCTATTT GATAATATCA
CTTTCAGCCA ATCTTTCGCT ATTTAGGATA GCGTTGAAAA AACTAAGGAA GAGAAAAATG
GAAATAACGA TTGCGATCAC TTCTACGTTA TTATCCTTAA TAATATTAGT GTATTCGATA
CAAGAAAACA CTCCCATAAT TAACTACATA TTCTTCGCTT GGATAATTGC TGGGTTTATC
TATGCAGAAG TACTTGAAAT AGCAGGAAAT AATGGAAAAG ATGATGAAGA TTAA
 
Protein sequence
MSRRKLSVFE AFSLSFGGQA PFTSIITFGT IGLQLGGSFL TIATIIGTIL VLVNGIVIYR 
LSLRYSQHGG YFTYAFYSLT ERLGLVTGWL FLLYAFSYGG TLLAGSIYII TSYLKISADL
VAFLVILFSA FLVIRGLDVS VKYAEFISIA EIVAIIVSSV VLLLGTKPSF NLTIPTNPFL
VILYAIGMPI GYGNLNPMSE DIKNAKKIVG IITVIVILLG GLLSALLFYA SALYGTDLIE
ILLDKVGFIF PYLIFSALNG GILGGIAYII AMSRILHAMS LKNLMPSIIS SVKYNRPYNA
EVISLIIYTV ILFLLTHFVG LYTTFLVLGG LTVLSYLIIS LSANLSLFRI ALKKLRKRKM
EITIAITSTL LSLIILVYSI QENTPIINYI FFAWIIAGFI YAEVLEIAGN NGKDDED