Gene Ssol_1894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1894 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1683157 
End bp1684416 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content35% 
IMG OID 
Productenolase 
Protein accessionACX92106 
Protein GI261602503 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAACC GTTTTTCCAT AGAGAAGGTT AAGGGATTAG AAATCGTAGA TTCTAGAGGT 
AATCCCACTA TAAGAGTTTT CATAAGAACT AGTGATGGTG TCGAATCCTT TGGAGACGCA
CCAGCAGGGG CTTCTAAAGG GACAAGAGAG GCGGTAGAAG TTAGGGATGA AAATGGGCTT
ACAGTAAAGA GGGCAGTAGA CATTGTAAAT TACATAATAG ATCCTGCATT ACATGGAATT
GATGTAAGAG AACAAGGGAT AATCGACAAA TTACTAAAAG ATATAGACTC CACTGAGAAT
AAGTCTAAAT TAGGAGGAAA CACAATAATT GCAACATCAA TAGCTGCATT AAAGACTGCT
TCTAAGGCCT TAGGTCTAGA GGTTTTTAAA TACATATCTG GGCCTAGATT ACCTAAAATC
CCAATACCTT TACTTAATAT AATAAATGGC GGTTTACATG CTGGAAATAA GCTAAAAATA
CAAGAATTCA TTATAGTGCC AATTAAGTTC AATACTTTTA AAGAAGCTCT TTTCGCTGCG
ATAGACGTTT ATAGAACCCT AAAAGGGTTA ATAACGGAGA GGTATGGTAA AATTTACACA
GCAGTTGGAG ATGAAGGGGG ATTCTCTCCA CCTTTAGAAG ATACTAGAGA GGCCTTGGAT
CTAATATATA CTTCCATAAA TAATGCAGGT TATGAAGGAA AAATATATAT GGGAATGGAT
GCTGCAGGGA GCGATTTCTA CGATAGTAAA AAAGAGAAAT ATATAATTGA TGGTAGAGAA
TTGGATCCTA ATCAATTACT TGAATTTTAT CTTGACTTAG TTAAACAATA TCCCATAGTG
TACTTGGAAG ATCCGTTTGA AGAGAACTCT TTTGATATGT TTAGCCAACT ACAAAATAAG
CTGAGTTCAA CAATAATTAC TGGAGATGAC CTATATACTA CAAATATAAA ATATCTAAAA
ATAGGTATAG AAAAGAGATC GACTAAGGGT GTTATAGTTA AGCCTAATCA AGTCGGTACA
ATATCTGAGA CGTTTGAATT TACTAATTTG GCTAGGAGAA ACTCAATGAA GTTAATAACA
AGTCATAGAA GTGGAGAGAC TGAGGACAAT TTCATAGCAG ACTTTGCGGT GGGAATTGAG
TCAGATTTCA TAAAGGTTGG TGCACCGGCG AGAGGAGAGA GAACTAGCAA ATATAATAAG
CTATTAGAAA TAGAAAATAA ATTTGGATTA GAATACGAAG GAAAATATTT TTATCTTTAA
 
Protein sequence
MINRFSIEKV KGLEIVDSRG NPTIRVFIRT SDGVESFGDA PAGASKGTRE AVEVRDENGL 
TVKRAVDIVN YIIDPALHGI DVREQGIIDK LLKDIDSTEN KSKLGGNTII ATSIAALKTA
SKALGLEVFK YISGPRLPKI PIPLLNIING GLHAGNKLKI QEFIIVPIKF NTFKEALFAA
IDVYRTLKGL ITERYGKIYT AVGDEGGFSP PLEDTREALD LIYTSINNAG YEGKIYMGMD
AAGSDFYDSK KEKYIIDGRE LDPNQLLEFY LDLVKQYPIV YLEDPFEENS FDMFSQLQNK
LSSTIITGDD LYTTNIKYLK IGIEKRSTKG VIVKPNQVGT ISETFEFTNL ARRNSMKLIT
SHRSGETEDN FIADFAVGIE SDFIKVGAPA RGERTSKYNK LLEIENKFGL EYEGKYFYL