Gene Ssol_0507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0507 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp452567 
End bp454342 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content37% 
IMG OID 
ProductAcylaminoacyl-peptidase 
Protein accessionACX90786 
Protein GI261601183 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAG AAGACTACTA CTATTCGATT AAATTAGTAC CAGAAATAAC AATAGAGAAT 
GGAAAACTAT TTCACGTAGA GACATGGATA GAAGAGGATA AATACAAATC ATCAATTTAT
TTGAACCTCA AGAGGATAAC TTTTCAAGGA AATGAATCAT CACCTAAGTT CAATAATGAT
AAGCTTTACT TTATAAGAAA CGAAGAGGCT AAATCATCCT TACTTGAAGC ACAACTCTAT
GGCGAGCCTA AAGTAATATT TACATTTTCT GGCAAAATAT CCAAATATGA GTTCCATAAT
AAGGGAATAT TAGTGATAGC AGAGGAAAAC ACAGATAAGA CATTACCTTT TAGAGCCGAG
AAGATAAAGT ATAGGTTTGA TAGCAGAGGT TTATTGAGAG CTAGGCAATC ACTTTACCTA
TTTGATGGTA AGGATCTGAG AAGGCTCGTT ACAGGGAACT TTGACGTTAC AGATTTAGCT
ACAAACGGTA ATAGGGTCGT AATTTCAGCA ACTAAGGACG GAGATGATTA TGGATTAGGA
AATTTGTATG AGGTAAATAT AGAAACTGGG GAGTTAAACA GAATAACGAA GGAGGATGGG
ACTGTACAAG CAATAGCTAT GAATAGCGAG GGGAAAATAG CGTTTTTAGG GCATAGGAAA
GGGCTAACCC CATGGGCTTC TCTCGAAATA ATGCTACCAG AAGAAGGAAA GAGTTACATG
TGTGGAAAGA CTTGCGGAAA TAAAGTGTTA ACTGATTTAT TTGATGGCGT AAAGGATAGG
ATCGTATTCG AAAAAGATCT AATACTCTCC TTAGGTCAAG AGGGAGGTAC GTCACATATT
TATCAAATCT CTGATAATAA GGTAGATAAG GTAACTAGTG GAAATATAAT GGTAAGAGGA
TTTGATTATA GTAATAGTGA ACTAGCTTAC TTCTATTCCA CTCCAGAAAA GCCTGTAATA
TTAAAATATA GAGATATAGA ATATGATCCG AACCCTAACA TCAAAGGATA CACTCCAGAG
AGAATAACAG TAAACTCTAA CGGAGTAGAA GTGGAGGGAT GGAGTATAAT TAAAGATCCT
AACGCACCAA CAATATTATT CATCCATGGG GGACCGCATA TGGCGTATGG TTATGGTTAT
TTCATAGAAT TCCAGTTCTT CGTAGATAAT GGGTTTAACG TAATATATGC AAATCCTAGA
GGCAGTCAAG GATATGGGGA GGAATTCGCC AAGGCTTGTG TGGGGGATTG GGGTGGAAAG
GATTTCGAAG ATCTAATGAA CTTCGTGAAT ACCGTTAAGG AAAGGTATAG TTTAAAAGGT
AAATTCGGTA TTACTGGTGG TTCTTATGGA GGCTTTATGA CCAATTGGAT AGTAACGAAG
ACTAGTATGT TTTCGGCTGC AATTAGCGAA AGGAGTATAT CGAATCTAGT TAGTATGTGT
GGTACTAGTG ATATAGGTTT TTGGTTTAAT GCGATTGAAT CCGGGATTGC AGATCCATGG
AGTACTGAAG GCATTGAGAA ACTAATGAAA ATGTCGCCAA TTTATTATGT GAAAAACGTT
AAAACACCTA CCATGTTAAT TCACGGAGAG GAAGATTATA GATGCCCAAT TGAACAAGCT
GAACAATTTT ATGTTGCATT GAAGATGCAA GGAGTCCCTA CAACGTTGGT AAGATACCAA
GGTGATAGTC ATGAACACGC TAGAAGAGGG AAGCCTAAGA ATATGATAGA TAGATTGAAG
ACTAAATTAG AATGGTTTAG TAAATATTTA CTCTAA
 
Protein sequence
MKPEDYYYSI KLVPEITIEN GKLFHVETWI EEDKYKSSIY LNLKRITFQG NESSPKFNND 
KLYFIRNEEA KSSLLEAQLY GEPKVIFTFS GKISKYEFHN KGILVIAEEN TDKTLPFRAE
KIKYRFDSRG LLRARQSLYL FDGKDLRRLV TGNFDVTDLA TNGNRVVISA TKDGDDYGLG
NLYEVNIETG ELNRITKEDG TVQAIAMNSE GKIAFLGHRK GLTPWASLEI MLPEEGKSYM
CGKTCGNKVL TDLFDGVKDR IVFEKDLILS LGQEGGTSHI YQISDNKVDK VTSGNIMVRG
FDYSNSELAY FYSTPEKPVI LKYRDIEYDP NPNIKGYTPE RITVNSNGVE VEGWSIIKDP
NAPTILFIHG GPHMAYGYGY FIEFQFFVDN GFNVIYANPR GSQGYGEEFA KACVGDWGGK
DFEDLMNFVN TVKERYSLKG KFGITGGSYG GFMTNWIVTK TSMFSAAISE RSISNLVSMC
GTSDIGFWFN AIESGIADPW STEGIEKLMK MSPIYYVKNV KTPTMLIHGE EDYRCPIEQA
EQFYVALKMQ GVPTTLVRYQ GDSHEHARRG KPKNMIDRLK TKLEWFSKYL L