Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_0641 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | - |
Start bp | 588332 |
End bp | 591271 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | |
Product | formate dehydrogenase, alpha subunit |
Protein accession | ACX90914 |
Protein GI | 261601311 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCTTG TGTCAATAAA ACTAGTTATC GATAATAAAG AGGTATTAGC TAATGAAGGA GAGACAATTC TATCAACACT AAAGAGGAAT GGTATTTACA TTCCACACAT ATGTTACAAT GAAGGATTAG TTCCCATAGA GAGCTGTGAT TCATGCCTAG TTGAGGTTAA TGGGAAACTA GTTAGAGCTT GTTCAACAAG AGTCGAAGAT GGAATGAGTA TCTCAGTTAA CTCTAAAAGA GCTATGGAAG CAAGAAAGAC CGCAATCTCC AGAATACTAA GATATCACAA ATTGTACTGT AGTATTTGTG AGAATAATAA TGGGGATTGC GTACTTCACG AGGCTGTAAT AAAATTGAAT ATTAATTCTC AAAAGTACGT AGAAAAGCCT TATCAAACGG ATGAAAGTGG TCCCTTTTAC ATATATGATC CATCACAATG TATCCTATGC GGTAGATGTG TTGAGGCTTG CCAAGATTTT GCAGTAAATG AGGTAATATG GATTAATTGG GATCTCAATC CTCCAAGAGT AGTTTGGGAT AACGGAAATC CCATAGGCAA CTCCTCATGC GTAAATTGTG GTACGTGCGT TACTGTTTGT CCAGTCAACG CATTAATGGA AAAATCAATG TTGGGAGAGG CTGGCTATCT CACTTGGATT AATAAGGATT TAAAGAAAAA AGCAATAGAG GCTATAGGGA AAGCTGAGGA TAACTTTAGC TTATTAATGA CCTTTAGCGA GATAGAGGCT AAGGCTAGGG AATCACAAAT AAAGAAAACT AAAACAGTCT GTATATACTG TGGTGTTGGT TGTTCATTTG AGATTTGGAC TAAGGGTAGG AAAATATTAA AAATTGAGCC AAAACCTGAG TCACCAGCCA ATGGTATTCT AACTTGCGTA AAGGGTAAAT TCGGCTGGGA TTTTGTAAAT AGCTCGGAAA GAATTACTAA GCCCTTAATA AGGGAGGGTG ATAAGTTTAG GGAAGCTAGT TGGGATGAGG CAATCTCGTA CATAGCTAAA AGATTGAAGG AGATCAAGGA GAGATATGGT CCAGATTCCA TAGGTTTCAT AGCCTCAGAT AAGATGAGCA ATGAAGAGGC GTACTTACTA CAGAAACTAG CAAGAGCTAT AATAGGTACT AATAATGTAG ATAATTCAGC AAGGTATTGC CAATCTCCAG CAACTGTTGG GTTATGGAGA ACTGTCGGTA TAGGTGCAGA TTCAGGAACA ATTAGGGATA TCGAAAACGC TAATTTGATT GTAATTGTTG GTCACAACAC AACTGAGAGT CATCCAGTAA TAGGAAGCAA GGTAAAGAGA GCTAAAAAGA TAAACGGTTC AAAGATCGTG GTAATTGACG TTAGAAAACA TGAGATTGCT GAAAAGGCTG ACCTGTTTAT CAAACCTAAG CCTGGAACTG ATGCAGCAGT TTTAGCTGGT GTTGCTAAAT ACCTTATTGA CCAGGGGTGG ATTGATAAAG AGTTCATTGA TAAGAGGGTT AATGGTTTTG AAGAGTTTAA GGAATCTATA AAGGGATTTA CATTAGATTA CGTTGAAGAT ATAACTGGTG TCCCTAGAGA TCAAATAATT AAACTTGCTG AAATGATCCA TAATGCTAAT AGTGTGGCGG TATTATGGGG AATGGGAGTA ACTCAACATT TGGGTGGAGC TGATACTTCA ACGATAATTT CAGACCTATT GCTTATAACT GGGAATTATG GGAAACCCGG TAGTGGAGCT TTCCCAATGA GAGGTCATAA TAACGTCCAA GGAGTTAGCG ATTTCGGTTG CTTACCCAAT TATTTACCAG GGTATCAAAA ACTAGAGGAT GAAAATGTAA TAGCGAAATT CGAAGAAGCT TGGGGTGTGA AATTAAATAG AAATCCTGGA CTACAGATAC CCCAAATGAT AGAAGGTGTA TTGGAAGGGA AAATCCACGC ATTATATATA GTCGGTGAAG ATACTGTGAT GGTTGATTGT GGGACTCCTT TAACTAGACA AGCATTAGAG AAAGTCGACT TCCTAGTGGT ACAAGACATG TTTATAACTG AGACTGCGAA GTTAGCTGAC GTAATATTAC CAGCTGCTGC TAGCCTAGAG AAAGATGGTA CTTTTGTGAA TACTGAAAGG AGGATACAAA GGTTCTACAA GGCTATGGAA CCAATTGGTG ATTCTAAACC TGACTGGGAA ATAATACAAA TGGTTGCAAA CGCACTAGGA GCGAATTGGA GTTATAATCA TCCGGCAGAA ATAATGAACG AGATTGCTAA ACTAGGCCCA ATATTTGCTG GCGTCAATTA TTCGAGATTA GAAGGATTTA ATAGCCTACT GTGGCCAGTT AATGAAGATG GGAGTGATAC GCCATTGCTC TATACAAACG CATTTGCTAC TAAAGATGGC AAGGCAATAC TTTACCCATT AAGCTGGAAA CCACCAGAAC TTAAGGATGA AGTTCACAAA GTAACTGTAA ATACTGGAAG GGTCTTAGAG CATTTCCATG TAGGTAATAT GACTAGGAGA GTTGAGGGGT TAAGGAGAAA GGTTCCAGAA ACATTTGTAG AGGTTTCTAA AGAGTTAGCC TCTAAATACT CAATCAAAAA CGGTGATCTT GTGCTTGTTA AGTCTAAATT TGGTGGAGAG ATTAAAGCAA GGGCTATAGT TAGTGATAGA GTAGAAGGTG AAGAGATCTT TATACCACTA TATGCATCAG ATCCTTCCAA GGGTGTAAAT AACTTAACAG GGTTAGTAAT AGATAAGGCT AGTGGTACCC CAGGGTATAA GGATACTCCA GTTGTTATTG AGAAAATAGA GGAGGGTAAA GGTGAGAGTC CTTTACCTTT AGATAACTGG AGATTTCATG TCAATGAAAG GAGGAGACAA ATAGGTATAG AGGTGGAGAA AAAATGGAAG AGGGAGGAGT TCAAGCCATT GACGGGTTAA
|
Protein sequence | MSLVSIKLVI DNKEVLANEG ETILSTLKRN GIYIPHICYN EGLVPIESCD SCLVEVNGKL VRACSTRVED GMSISVNSKR AMEARKTAIS RILRYHKLYC SICENNNGDC VLHEAVIKLN INSQKYVEKP YQTDESGPFY IYDPSQCILC GRCVEACQDF AVNEVIWINW DLNPPRVVWD NGNPIGNSSC VNCGTCVTVC PVNALMEKSM LGEAGYLTWI NKDLKKKAIE AIGKAEDNFS LLMTFSEIEA KARESQIKKT KTVCIYCGVG CSFEIWTKGR KILKIEPKPE SPANGILTCV KGKFGWDFVN SSERITKPLI REGDKFREAS WDEAISYIAK RLKEIKERYG PDSIGFIASD KMSNEEAYLL QKLARAIIGT NNVDNSARYC QSPATVGLWR TVGIGADSGT IRDIENANLI VIVGHNTTES HPVIGSKVKR AKKINGSKIV VIDVRKHEIA EKADLFIKPK PGTDAAVLAG VAKYLIDQGW IDKEFIDKRV NGFEEFKESI KGFTLDYVED ITGVPRDQII KLAEMIHNAN SVAVLWGMGV TQHLGGADTS TIISDLLLIT GNYGKPGSGA FPMRGHNNVQ GVSDFGCLPN YLPGYQKLED ENVIAKFEEA WGVKLNRNPG LQIPQMIEGV LEGKIHALYI VGEDTVMVDC GTPLTRQALE KVDFLVVQDM FITETAKLAD VILPAAASLE KDGTFVNTER RIQRFYKAME PIGDSKPDWE IIQMVANALG ANWSYNHPAE IMNEIAKLGP IFAGVNYSRL EGFNSLLWPV NEDGSDTPLL YTNAFATKDG KAILYPLSWK PPELKDEVHK VTVNTGRVLE HFHVGNMTRR VEGLRRKVPE TFVEVSKELA SKYSIKNGDL VLVKSKFGGE IKARAIVSDR VEGEEIFIPL YASDPSKGVN NLTGLVIDKA SGTPGYKDTP VVIEKIEEGK GESPLPLDNW RFHVNERRRQ IGIEVEKKWK REEFKPLTG
|
| |