Gene Ssol_0385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0385 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp347627 
End bp349147 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content33% 
IMG OID 
Productpeptidase M61 domain protein 
Protein accessionACX90670 
Protein GI261601067 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTTT ATATAAATCC TAGGAACAAG TATATAGAGG TTTATGGAGA GGGAAGAGAG 
GGCATAATTA TTTTTCCAAC ATGGGTTCCC GGTTCTTATG TTATTAGAGA TACTGAGAGA
TATGTTGTAG AAATAGATGG GATTAGAATA GGGAAGAACA GATTCTTCGT TAAGGATAAG
TTCAGATATT TAATTCAAGC TCTTAGTAAA GATCAAAGGG AAGCAATCTC AACGAGCGAC
TATTTATTTA TAAATCCAGC ATCAGTATTC CCATTTCAGA CAATTGAAGA GGAGTACTGT
GTTAAAATTA ACGTTAGGTG GCCCATTCAT ACTACGCTAA AAAAAGTCGG AGATTGGTTT
TGTGCTGATA ATTACAATGA ATTTGCGGAC TCACCGATTC AAGCTTCACC TAAATTAAAG
TTGATCGAAA TTGATAGAAA TCATAAAATT TCTACTATTG ATGAGCTTGA TAAACCCATA
GTAGAATCTC TAGGTAAATG TACATTTGAA ATAGATCAGA ATATATTTTC CGATTCCTCA
ACAAATGAGT ACATATTCTT TTTTAGAAGA TCCGATACCG ATTTCGGCGG AATAGAGCAT
GAGAGATCTT CCTCCATTGT AATCTCATGG AACTATAAAG ATCTTATTCG TTTGTTTATT
CATGAGTATT TCCATAGATA TAATGTAAAG AAGATTAGAC CAAAAGATTT AAAGATAAAT
TACGAAAGTG AGACTTATAC TGAATTACTG TGGGTTGCAG AGGGTCTTAC AGAATATATT
GCAGTAATAG TCCCTTTGAG AACAAAAGTA GTAAAAGTTG ACGATACCTT AAATTACATT
GCTAATACAC TAGCATGGCT TACCTTCCCC GGAATAAGGA GAATGAGCTT AGCAGAGTCT
TCCTATACTA CATGGATAAA GTATTATCGC CGTGATAATA ATTTCTTAAA TGTAGGTATT
TCTTATTATC AGCTAGGTCT TATAGTTGGT CTTATAATGG ATCTAGAGAT GATTGACAGC
GGTAACTCAA TATATAATTT CTTTAGAGAA TTATATAAGA TAAGGGAATA TACGTATGGA
AACGTTAGAG ATATAGCAGA GGGGTTAGGA GTTCAAAATA TAGACGAATT GGTGTTTTCA
AGAAATCCAC CTATATTTAA CAGATTGTCG AAATTTTTTA AACTGAGTTT CGTGGACAAA
GGTTTTCCAT ACTGTGGTTT AATGCTAGAT AATAAGAAGG TAACTTTTGT TGAAGATGAC
TCACCAGCAG ATAAAGCAGG GATTATTCCG GGAGATGAAA TAGTTGGGAT AAACGGAATA
TCTTCTAGTA ATTTAAATTT AGAATGTAAG GATAAACTGG AGTTAGTTAT TAATCGAGAA
GGGAGGCTTA TAGAGTTTAC GATTATACCA GATGGAAATC CCGGACACAA TTTGGTTATA
AAAGGAAATG GAGAGATCTT CAAGAAATGG TCTGGTGGTA TAGAATATGG AGAAGGAAAG
TCAAATATAA GAATTATCTA G
 
Protein sequence
MRFYINPRNK YIEVYGEGRE GIIIFPTWVP GSYVIRDTER YVVEIDGIRI GKNRFFVKDK 
FRYLIQALSK DQREAISTSD YLFINPASVF PFQTIEEEYC VKINVRWPIH TTLKKVGDWF
CADNYNEFAD SPIQASPKLK LIEIDRNHKI STIDELDKPI VESLGKCTFE IDQNIFSDSS
TNEYIFFFRR SDTDFGGIEH ERSSSIVISW NYKDLIRLFI HEYFHRYNVK KIRPKDLKIN
YESETYTELL WVAEGLTEYI AVIVPLRTKV VKVDDTLNYI ANTLAWLTFP GIRRMSLAES
SYTTWIKYYR RDNNFLNVGI SYYQLGLIVG LIMDLEMIDS GNSIYNFFRE LYKIREYTYG
NVRDIAEGLG VQNIDELVFS RNPPIFNRLS KFFKLSFVDK GFPYCGLMLD NKKVTFVEDD
SPADKAGIIP GDEIVGINGI SSSNLNLECK DKLELVINRE GRLIEFTIIP DGNPGHNLVI
KGNGEIFKKW SGGIEYGEGK SNIRII