Gene Ssol_0641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0641 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp588332 
End bp591271 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content39% 
IMG OID 
Productformate dehydrogenase, alpha subunit 
Protein accessionACX90914 
Protein GI261601311 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTTG TGTCAATAAA ACTAGTTATC GATAATAAAG AGGTATTAGC TAATGAAGGA 
GAGACAATTC TATCAACACT AAAGAGGAAT GGTATTTACA TTCCACACAT ATGTTACAAT
GAAGGATTAG TTCCCATAGA GAGCTGTGAT TCATGCCTAG TTGAGGTTAA TGGGAAACTA
GTTAGAGCTT GTTCAACAAG AGTCGAAGAT GGAATGAGTA TCTCAGTTAA CTCTAAAAGA
GCTATGGAAG CAAGAAAGAC CGCAATCTCC AGAATACTAA GATATCACAA ATTGTACTGT
AGTATTTGTG AGAATAATAA TGGGGATTGC GTACTTCACG AGGCTGTAAT AAAATTGAAT
ATTAATTCTC AAAAGTACGT AGAAAAGCCT TATCAAACGG ATGAAAGTGG TCCCTTTTAC
ATATATGATC CATCACAATG TATCCTATGC GGTAGATGTG TTGAGGCTTG CCAAGATTTT
GCAGTAAATG AGGTAATATG GATTAATTGG GATCTCAATC CTCCAAGAGT AGTTTGGGAT
AACGGAAATC CCATAGGCAA CTCCTCATGC GTAAATTGTG GTACGTGCGT TACTGTTTGT
CCAGTCAACG CATTAATGGA AAAATCAATG TTGGGAGAGG CTGGCTATCT CACTTGGATT
AATAAGGATT TAAAGAAAAA AGCAATAGAG GCTATAGGGA AAGCTGAGGA TAACTTTAGC
TTATTAATGA CCTTTAGCGA GATAGAGGCT AAGGCTAGGG AATCACAAAT AAAGAAAACT
AAAACAGTCT GTATATACTG TGGTGTTGGT TGTTCATTTG AGATTTGGAC TAAGGGTAGG
AAAATATTAA AAATTGAGCC AAAACCTGAG TCACCAGCCA ATGGTATTCT AACTTGCGTA
AAGGGTAAAT TCGGCTGGGA TTTTGTAAAT AGCTCGGAAA GAATTACTAA GCCCTTAATA
AGGGAGGGTG ATAAGTTTAG GGAAGCTAGT TGGGATGAGG CAATCTCGTA CATAGCTAAA
AGATTGAAGG AGATCAAGGA GAGATATGGT CCAGATTCCA TAGGTTTCAT AGCCTCAGAT
AAGATGAGCA ATGAAGAGGC GTACTTACTA CAGAAACTAG CAAGAGCTAT AATAGGTACT
AATAATGTAG ATAATTCAGC AAGGTATTGC CAATCTCCAG CAACTGTTGG GTTATGGAGA
ACTGTCGGTA TAGGTGCAGA TTCAGGAACA ATTAGGGATA TCGAAAACGC TAATTTGATT
GTAATTGTTG GTCACAACAC AACTGAGAGT CATCCAGTAA TAGGAAGCAA GGTAAAGAGA
GCTAAAAAGA TAAACGGTTC AAAGATCGTG GTAATTGACG TTAGAAAACA TGAGATTGCT
GAAAAGGCTG ACCTGTTTAT CAAACCTAAG CCTGGAACTG ATGCAGCAGT TTTAGCTGGT
GTTGCTAAAT ACCTTATTGA CCAGGGGTGG ATTGATAAAG AGTTCATTGA TAAGAGGGTT
AATGGTTTTG AAGAGTTTAA GGAATCTATA AAGGGATTTA CATTAGATTA CGTTGAAGAT
ATAACTGGTG TCCCTAGAGA TCAAATAATT AAACTTGCTG AAATGATCCA TAATGCTAAT
AGTGTGGCGG TATTATGGGG AATGGGAGTA ACTCAACATT TGGGTGGAGC TGATACTTCA
ACGATAATTT CAGACCTATT GCTTATAACT GGGAATTATG GGAAACCCGG TAGTGGAGCT
TTCCCAATGA GAGGTCATAA TAACGTCCAA GGAGTTAGCG ATTTCGGTTG CTTACCCAAT
TATTTACCAG GGTATCAAAA ACTAGAGGAT GAAAATGTAA TAGCGAAATT CGAAGAAGCT
TGGGGTGTGA AATTAAATAG AAATCCTGGA CTACAGATAC CCCAAATGAT AGAAGGTGTA
TTGGAAGGGA AAATCCACGC ATTATATATA GTCGGTGAAG ATACTGTGAT GGTTGATTGT
GGGACTCCTT TAACTAGACA AGCATTAGAG AAAGTCGACT TCCTAGTGGT ACAAGACATG
TTTATAACTG AGACTGCGAA GTTAGCTGAC GTAATATTAC CAGCTGCTGC TAGCCTAGAG
AAAGATGGTA CTTTTGTGAA TACTGAAAGG AGGATACAAA GGTTCTACAA GGCTATGGAA
CCAATTGGTG ATTCTAAACC TGACTGGGAA ATAATACAAA TGGTTGCAAA CGCACTAGGA
GCGAATTGGA GTTATAATCA TCCGGCAGAA ATAATGAACG AGATTGCTAA ACTAGGCCCA
ATATTTGCTG GCGTCAATTA TTCGAGATTA GAAGGATTTA ATAGCCTACT GTGGCCAGTT
AATGAAGATG GGAGTGATAC GCCATTGCTC TATACAAACG CATTTGCTAC TAAAGATGGC
AAGGCAATAC TTTACCCATT AAGCTGGAAA CCACCAGAAC TTAAGGATGA AGTTCACAAA
GTAACTGTAA ATACTGGAAG GGTCTTAGAG CATTTCCATG TAGGTAATAT GACTAGGAGA
GTTGAGGGGT TAAGGAGAAA GGTTCCAGAA ACATTTGTAG AGGTTTCTAA AGAGTTAGCC
TCTAAATACT CAATCAAAAA CGGTGATCTT GTGCTTGTTA AGTCTAAATT TGGTGGAGAG
ATTAAAGCAA GGGCTATAGT TAGTGATAGA GTAGAAGGTG AAGAGATCTT TATACCACTA
TATGCATCAG ATCCTTCCAA GGGTGTAAAT AACTTAACAG GGTTAGTAAT AGATAAGGCT
AGTGGTACCC CAGGGTATAA GGATACTCCA GTTGTTATTG AGAAAATAGA GGAGGGTAAA
GGTGAGAGTC CTTTACCTTT AGATAACTGG AGATTTCATG TCAATGAAAG GAGGAGACAA
ATAGGTATAG AGGTGGAGAA AAAATGGAAG AGGGAGGAGT TCAAGCCATT GACGGGTTAA
 
Protein sequence
MSLVSIKLVI DNKEVLANEG ETILSTLKRN GIYIPHICYN EGLVPIESCD SCLVEVNGKL 
VRACSTRVED GMSISVNSKR AMEARKTAIS RILRYHKLYC SICENNNGDC VLHEAVIKLN
INSQKYVEKP YQTDESGPFY IYDPSQCILC GRCVEACQDF AVNEVIWINW DLNPPRVVWD
NGNPIGNSSC VNCGTCVTVC PVNALMEKSM LGEAGYLTWI NKDLKKKAIE AIGKAEDNFS
LLMTFSEIEA KARESQIKKT KTVCIYCGVG CSFEIWTKGR KILKIEPKPE SPANGILTCV
KGKFGWDFVN SSERITKPLI REGDKFREAS WDEAISYIAK RLKEIKERYG PDSIGFIASD
KMSNEEAYLL QKLARAIIGT NNVDNSARYC QSPATVGLWR TVGIGADSGT IRDIENANLI
VIVGHNTTES HPVIGSKVKR AKKINGSKIV VIDVRKHEIA EKADLFIKPK PGTDAAVLAG
VAKYLIDQGW IDKEFIDKRV NGFEEFKESI KGFTLDYVED ITGVPRDQII KLAEMIHNAN
SVAVLWGMGV TQHLGGADTS TIISDLLLIT GNYGKPGSGA FPMRGHNNVQ GVSDFGCLPN
YLPGYQKLED ENVIAKFEEA WGVKLNRNPG LQIPQMIEGV LEGKIHALYI VGEDTVMVDC
GTPLTRQALE KVDFLVVQDM FITETAKLAD VILPAAASLE KDGTFVNTER RIQRFYKAME
PIGDSKPDWE IIQMVANALG ANWSYNHPAE IMNEIAKLGP IFAGVNYSRL EGFNSLLWPV
NEDGSDTPLL YTNAFATKDG KAILYPLSWK PPELKDEVHK VTVNTGRVLE HFHVGNMTRR
VEGLRRKVPE TFVEVSKELA SKYSIKNGDL VLVKSKFGGE IKARAIVSDR VEGEEIFIPL
YASDPSKGVN NLTGLVIDKA SGTPGYKDTP VVIEKIEEGK GESPLPLDNW RFHVNERRRQ
IGIEVEKKWK REEFKPLTG