Gene Ssol_0859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0859 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp801311 
End bp802747 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content40% 
IMG OID 
ProductAldehyde Dehydrogenase 
Protein accessionACX91106 
Protein GI261601503 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCGT ATCAGGGATT GGCTGACAAG TGGATTAAGG GCAGTGGGGA AGAATACCTT 
GATATTAATC CGGCTGATAA GGATCACGTA TTAGCTAAGA TAAGATTATA TACAAAAGAT
GACGTTAAAG AAGCTATAAA CAAGGCTGTA GCCAAATTCG ACGAATGGTC AAGGACTCCA
GCACCTAAAA GAGGCTCAAT ATTACTTAAG GCAGGGGAAT TAATGGAACA AGAAGCCCAA
GAGTTTGCGC TATTGATGAC ATTAGAGGAG GGTAAGACTC TCAAGGATAG TATGTTTGAA
GTGACAAGAA GTTATAATTT ACTGAAATTT TATGGAGCAT TAGCATTTAA GATATCTGGG
AAAACGCTTC CTTCAGCAGA TCCTAATACT AGGATATTTA CAGTAAAGGA ACCCTTAGGC
GTAGTAGCTT TAATTACGCC GTGGAATTTC CCATTATCAA TACCAGTATG GAAATTGGCT
CCAGCCTTGG CTGCGGGTAA CACTGCAGTA ATAAAACCAG CGACGAAAAC ACCGTTAATG
GTAGCCAAAT TGGTAGAAGT GTTGTCTAAA GCTGGATTGC CAGAGGGTGT CGTGAATTTA
GTAGTTGGTA AGGGAAGTGA AGTCGGAGAT ACCATAGTAA GTGATGATAA TATAGCTGCA
GTATCATTTA CTGGATCAAC CGAGGTAGGT AAGAGAATTT ACAAACTCGT AGGAAATAAA
AATAGAATGA CAAGAATTCA ACTAGAGCTA GGAGGTAAAA ACGCGTTATA TGTGGATAAG
AGCGCTGACT TAACGTTAGC TGCTGAATTA GCCGTAAGAG GAGGATTTGG ACTAACCGGT
CAATCATGTA CTGCAACTAG TAGGTTAATA ATTAACAAGG ATGTATATAC TCAATTTAAA
CAAAGACTAC TAGAAAGAGT TAAGAAGTGG AGAGTAGGAC CGGGTACTGA AGATGTTGAT
ATGGGTCCAG TTGTAGATGA AGGTCAATTT AAGAAAGACT TAGAATATAT AGAATACGGA
AAGAATGTGG GAGCAAAACT AATTTATGGT GGAAATATAA TACCAGGGAA GGGATATTTC
CTAGAACCTA CAATTTTCGA AGGAGTCACA TCTGATATGA GGCTATTTAA AGAAGAGATT
TTCGGTCCAG TACTTAGTGT CACTGAGGCA AAAGATTTAG ATGAGGCTAT AAGGCTAGTT
AACGCTGTAG ACTATGGACA TACAGCTGGA ATAGTCGCAA GCGATATCAA GGCGATTAAC
GAGTTCGTTA GTAGGGTAGA GGCAGGAGTT ATAAAGGTTA ATAAGCCAAC AGTCGGACTG
GAATTGCAAG CACCATTTGG TGGTTTTAAG AATTCTGGAG CCACTACGTG GAAAGAGATG
GGAGAAGATG CTTTAGAGTT CTACCTTAAG GAGAAGACAG TATACGAAGG CTGGTAA
 
Protein sequence
MKSYQGLADK WIKGSGEEYL DINPADKDHV LAKIRLYTKD DVKEAINKAV AKFDEWSRTP 
APKRGSILLK AGELMEQEAQ EFALLMTLEE GKTLKDSMFE VTRSYNLLKF YGALAFKISG
KTLPSADPNT RIFTVKEPLG VVALITPWNF PLSIPVWKLA PALAAGNTAV IKPATKTPLM
VAKLVEVLSK AGLPEGVVNL VVGKGSEVGD TIVSDDNIAA VSFTGSTEVG KRIYKLVGNK
NRMTRIQLEL GGKNALYVDK SADLTLAAEL AVRGGFGLTG QSCTATSRLI INKDVYTQFK
QRLLERVKKW RVGPGTEDVD MGPVVDEGQF KKDLEYIEYG KNVGAKLIYG GNIIPGKGYF
LEPTIFEGVT SDMRLFKEEI FGPVLSVTEA KDLDEAIRLV NAVDYGHTAG IVASDIKAIN
EFVSRVEAGV IKVNKPTVGL ELQAPFGGFK NSGATTWKEM GEDALEFYLK EKTVYEGW