Gene Ssol_0549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0549 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp495833 
End bp497146 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content37% 
IMG OID 
Productpeptidase M20 
Protein accessionACX90825 
Protein GI261601222 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACGAGG AACTTTATAC TTTAATTGAA TTTCTAAAGA AACCCTCCAT ATCTGCAACT 
GGAGAGGGAA TAGATGAAAC AGCAAACTAT CTTAAGGAAA CTGTTGAGAA GTTATTAGGT
GTAAAGGCGA ATCTTGAGAA GACTAAAGGT CATCCCGTAG TATACGCTGA AATTAACGTT
AATGCCAAAA AGACACTACT TATTTACAAC CATTATGATG TCCAACCGGT GGATCCAATA
AGTGAGTGGA AAAGAGCGCC CTTTTCAGCA ACAATTGAAA ATGATAGAAT TTACGCTAGG
GGAGCCTCTG ACAATAAAGG AACATTAATG GCAAGACTAT TTGCTATTAA ACACTTACTA
GATAAGAACG AATTAAATGT TAACGTGAAG TTACTTTACG AGGGAGAAGA GGAAATAGGT
AGTGTGAATT TGGAGGACTA TATCGAAAAG AATACAAATA AACTGAAGGC AGACTCAGTC
ATAATGGAGG GAGCTGGCTT AGACCCCAAA GGAAGGCCAC AAATAGTACT AGGGGTAAAA
GGATTATTAT ACGTTGAACT AGTTCTTGAC TATGGAACTA AAGATCTACA CTCTTCTAAT
GCACCATTAG TCAGAAATCC ATGCATAGAT CTAGCTAAGA TAATATCTAC ATTGGTAGAC
ATGGGAGGAA GAGTGTTAAT TGAAGGGTTT TATGATGACG TGAGAGAATT AACAGAAGAG
GAAAGAGAGC TAATAAAGAA ATACGATATC GATGTAGAGG AATTAAAGAA GGCGTTAGGG
TTTAAGGAAT TAAAGTATAA TGAAAAGGAA AAGATTGCTG AGGCATTACT AACTTACCCA
ACATGTAATG TTGATGGGTT CGAATGCGGG TATACTGGAA AGGGTAGCAA AACTATCGTA
CCACATAGAG CATTTGCAAA ATTAGATTTT AGGCTAGTAC CTAATCAAGA TCCATATAAA
GTTTTCGAGT TACTAAAAAA ACACCTTCAA AAGGCTGGTT TCAATGGGGA GATATTAGCA
CATGGCTTTG AATATCCTGT TAGAACTTCG GTTAACTCTA CAGTAGTCAA AGCAATGATA
GAATCCGCTA AAAAAGTATA TGGTACTGAA CCACAAGTAA TTCCTAATTC AGCCGGCACT
CAACCCATGG GGTTGTTTGT GTATAAGCTA GGGATAAGGG ATGCAGTTAG CGCAATAGGT
GCTGGAGGAT ATTACTCAAA TGCTCATGCA CCCAATGAAA ACATTAAGAT AGATGACTAT
TATAAAGCTA TAAAACATAC CGAGGAATTT CTAAAATTAT ACCCAATACT ATAA
 
Protein sequence
MDEELYTLIE FLKKPSISAT GEGIDETANY LKETVEKLLG VKANLEKTKG HPVVYAEINV 
NAKKTLLIYN HYDVQPVDPI SEWKRAPFSA TIENDRIYAR GASDNKGTLM ARLFAIKHLL
DKNELNVNVK LLYEGEEEIG SVNLEDYIEK NTNKLKADSV IMEGAGLDPK GRPQIVLGVK
GLLYVELVLD YGTKDLHSSN APLVRNPCID LAKIISTLVD MGGRVLIEGF YDDVRELTEE
ERELIKKYDI DVEELKKALG FKELKYNEKE KIAEALLTYP TCNVDGFECG YTGKGSKTIV
PHRAFAKLDF RLVPNQDPYK VFELLKKHLQ KAGFNGEILA HGFEYPVRTS VNSTVVKAMI
ESAKKVYGTE PQVIPNSAGT QPMGLFVYKL GIRDAVSAIG AGGYYSNAHA PNENIKIDDY
YKAIKHTEEF LKLYPIL