Gene Ssol_0847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0847 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp790751 
End bp792154 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content37% 
IMG OID 
ProductCarboxypeptidase Taq 
Protein accessionACX91095 
Protein GI261601492 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.202747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGAAG ACATTTGGGC AATTGAACAC GCAATAAGCT TACTGGATTG GGATATCCAA 
ACTTACATGC CCCAATCTGG GATTAAGGCT AGGGGAGAGG CTTTAGCCAG GCTAAGTAAC
TTAAGGAGGA AATTGTTGTT AGGCATTAGA GGCGAGATAG AAAAGTTAGA GCCAAAAAAT
GATATTGAAA AGGGTTTAAA AAGAGTTTTA GAAAGAGAAT ATAAGTACTA TGACGCTGTG
CCAGAAGAGT TGGATATGAA ACTTCATAGA ATAACATCTG AAGCTACTGT AGTTTGGAGA
AACGCTAAAG CTAAAGGCGA TTTTAACGCA TTCAAACCTT ATTTAGAGCA AATACTTGAG
ATTAAGAGAG AGATAGCACA TAAGCTAGGG TATAAGGATC ATCCATATAG TGCACTTTTA
GATAGGTATG AAGAAGGGTT TACTGTCACC GATGCTGAAA GGGTATTCAA CGAGTTATTA
CCCGGTTTGT CTAAGATTCT CAATAAGATC GATGATAAGT TTACTAGAAA ATATCATTTT
GAGGATGAAA AATATGATGT TTTTCAGATG AGTAAAACCA TAGAGGCAAT AGCTTATGAG
GTACTAAAGA TGCCTAAGGA TAGATTTAGA ATAGACGTTT CTCCTCATCC TTTCACAGTA
TCAATGAGTA GAAATGATGT TAGAATAACA GTAAGGTATG AAGGATATGA TTTCAAGAGA
GTTCTTTATT CTCTAGTGCA CGAGAGCGGG CATGCAATAT ATGAGTTACA AATAGATCCG
AGTCTAGAAT ACTCTCCTTT AGCAAATGCT CCTTCCATGG GCCTTCATGA GTCGCAATCG
AGATTCTGGG AAAACGTAGT AGGAAGGAGT TATGGCTTTA TTAAAACCAT TTATCCCTTG
CTAAACGTTA AGGATAGCAT TGATGATGTA TATTACTATG TTAATGGCGT TAAGAGGCAA
CCAATTAGGG TTGACGCTGA TGAAGTTACT TATAACTTTC ATATTGCAAT CAGATACGAG
ATAGAGAAGA GGGCAATAGA GGGTAGTTTA GAAGCTAGCG AATTCCCCTC ACTATTTAAT
GATTTGATGG ACAAATACCT AAATATAAGG CCTAAGAATG ATGGAGAGGG AGTATTACAA
GACGTTCATT GGAGTCAAGG CTCTTTTGGT TACTTCCCTA CTTATACATT GGGAAATGTG
ATAGCTGGTA TGGTATACTA CCATATGAAG AGTGAGAGAG GTTTCGATAT TAGTAATATA
GAGGGGATAA AGAATTGGCT AAGAGAGAGA ATTCATAAAT ACGGATCAAT ATATTCACCA
AAGGAGTTAC AAATGAGGTC ATTTGGTGAG GCATATAACC CATCTAGGCT ATTAGATTAT
ATGAGAGAGA AATATAATGC GTGA
 
Protein sequence
MYEDIWAIEH AISLLDWDIQ TYMPQSGIKA RGEALARLSN LRRKLLLGIR GEIEKLEPKN 
DIEKGLKRVL EREYKYYDAV PEELDMKLHR ITSEATVVWR NAKAKGDFNA FKPYLEQILE
IKREIAHKLG YKDHPYSALL DRYEEGFTVT DAERVFNELL PGLSKILNKI DDKFTRKYHF
EDEKYDVFQM SKTIEAIAYE VLKMPKDRFR IDVSPHPFTV SMSRNDVRIT VRYEGYDFKR
VLYSLVHESG HAIYELQIDP SLEYSPLANA PSMGLHESQS RFWENVVGRS YGFIKTIYPL
LNVKDSIDDV YYYVNGVKRQ PIRVDADEVT YNFHIAIRYE IEKRAIEGSL EASEFPSLFN
DLMDKYLNIR PKNDGEGVLQ DVHWSQGSFG YFPTYTLGNV IAGMVYYHMK SERGFDISNI
EGIKNWLRER IHKYGSIYSP KELQMRSFGE AYNPSRLLDY MREKYNA