Gene Ssol_2571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2571 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2362647 
End bp2363786 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content36% 
IMG OID 
Productpeptidase U32 
Protein accessionACX92684 
Protein GI261603081 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.746078 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTGG TAGTTGCGAC AAATTTTGAT GATTCCTTGT TAGAAGGATT AAAGAGGTAC 
CCGGATGTTA AATATATTTT CGGTAGTTTT AAGAGGACTA TAACAGGACA TGGTAGGGCT
GGTTTTATTG TACCCCATAT TAAGGAAGAA CAATTCGAGA CTCATATTAG TTTAGCGCAC
TCCTATGGTA TAAAATTTCT TTACACAATG AATACTAACA CGTTATTAGG CAAGGAATAT
GATACGGAAT TTATTGGTAA AGTAATGAAA GAGGTTGACA AGTTAGTAAA TTTCGGAGTT
GATGGTTTTA TAGTTGCATT GCCGTTTCTG ATAAGACTTA TAAGAACTGA ATACCCTGAC
TTGGAAGTTT CTGCGTCTTC CTTTTCTAGA ATTCGGAACG TAAGGGAAGT TGAGGAGTAT
ACGAATTTAG GCGTTAACAC TATAATTATG CATGAGGATG CAAATAGGGA TTTCAAATTA
CTGAAAGAAG TGGCTGCATT ATCAAGAGCT AATAGATTTG AAATAGAATT AATACTTAAT
AATTCTTGTC TTTATGGATG TCCATTTAGA CTTACACATG ATAATATTTC CTCAGTCACT
TCAATGGTAA ACGGAGTAAA TGACGTTTGG TTTGAGTACC CCGTACTGTT ATGTGCAACC
GATGTTTTAA ACGACCCAGC GAATTTGATA AGGAGTAGGT GGATTAGACC AGAGGATATA
AAATACTATG AGGAGATAGG GATAAATAGA TTTAAAATTG CAGGTAGAAA TAAAAAGACG
GATTGGATAT TAAGAGTAGT AAAAGCTTAT GCTGAGAGGA AGTACGAAGG AGATCTCTTA
GATCTCGTTA GCTATCCTCA AGGGAGAGCA GCTACTAAGG CAGTTCAGAT GGTTAATGGA
CCTTCATCCT ACTTTATACT GACTTCGGTA AGGATAGATA ATACTAAGTT CCCTAAGGGA
TGGATAAAGT TCTTTTTCAC TAACGATTGT GATACGAGAA GCTGTAAGGA ATGTAAATAT
TGCGATATCG TAGCTGAAAG AGTAATGACT GTAAATGGAG AACCGTTTAA GAGCAGTGAA
TGGAGCATAA GGCAACCTTA TCCGATCAAT ATAATACCGA AATTTAAAGA AAGAAAATAA
 
Protein sequence
MKLVVATNFD DSLLEGLKRY PDVKYIFGSF KRTITGHGRA GFIVPHIKEE QFETHISLAH 
SYGIKFLYTM NTNTLLGKEY DTEFIGKVMK EVDKLVNFGV DGFIVALPFL IRLIRTEYPD
LEVSASSFSR IRNVREVEEY TNLGVNTIIM HEDANRDFKL LKEVAALSRA NRFEIELILN
NSCLYGCPFR LTHDNISSVT SMVNGVNDVW FEYPVLLCAT DVLNDPANLI RSRWIRPEDI
KYYEEIGINR FKIAGRNKKT DWILRVVKAY AERKYEGDLL DLVSYPQGRA ATKAVQMVNG
PSSYFILTSV RIDNTKFPKG WIKFFFTNDC DTRSCKECKY CDIVAERVMT VNGEPFKSSE
WSIRQPYPIN IIPKFKERK