Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_1553 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 1410618 |
End bp | 1413614 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | conserved hypothetical protein |
Protein accession | ACX91780 |
Protein GI | 261602177 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGAGC TCTACAAAGT CAGGTTAAAA AGCGGTGTAG ATTATCAAAA ACTCAAAGCT ACGGAATTAT TTGGTAATGT ATTCGAGTTG ATTTTCATAA AGAAGAGTAG GAACGAAATG GACTTTTATG TGAGGACTGC TGCAAAAGAG GAAATACTAA GACAATACTT CGTCCTTTTG AAAGCTGATG AGTATCCTAC AAATCGATTT GTCGCAGTTT TGAAACTGAA GAAAGAGAGC GATTTCTATG CAAATTTGGA ATATTCGAAT TTGTTAAACT TGATATCTTC TCTAGAAGAA GGGGAACAGA TTAGGATATG GGTAGTTTTG GAGCCGAGAT TGAATGACCT CTTCATAAAG AAAGCTGACA AATTAAAACT TCAAGCCCAA AAGGCAATGA TAGGAAAGAG GAAGAAAGAA CTTCTAGCAA ATATTTTGGA GTCGTATGCA AAAGATAACT TATATCTTCT TGATATAAAG ATCTTCAGCA ATCAAAAGAG TAGACTGAAG TTACTCTTCG ATTATGCCAA ACAGCTAATT CATACAAAAA GCAGGAAGCT AAGAATGGAG GTAAAGAAGG CTAAGAAGTT CAAGGAAAAA CAGCCGAAAA TAGGGGCATG GGAAGCCATT GCAAAGTATA AAGTAAGGCT ATGGATAGAC GAAGATAAAC TAAACGAGAA CTTGCCTTTG CCCTCTCCAA CTCAGGTACC AATTCCGATG AGTATCGGTG TTTCGCTACC GTACTTTAGG CTAGAAAGGA AAGATATCTA TTTGGGAGAC GATATACTTT ATGCAGAGAA AGTATTTCTA GACTGGACAG ATTTCCAACG TCACGCCATA ATTTACGGCT CAACTGGGTC AGGTAAATCA AACACGTTAG AAATCTTAGC ACAAGAATTA GCTAAATATG GCATAGTGAT ATTCCTTGAC CCTAACAGCC AGAGTGCTAG AAAGTTATCA CAAATAGCGA ACTACTATTT CACAATAACG AAGGACAGGC CGAACTACGG CATAAATATC CTCCAGCTCC CCCACATTTT CCAGGATAGA GAGAAGGACA TTGATTATCA GATTTCGAAA GTGTTACAAC TGTTCGATAA GTTGCTAAAC CTCGTTGACA CTGCAGTAAA CGTGAGGTAC ATCTTACAGG TACTGTTAAG GCAAATGTAC AGGGTGTCGG ACAGGATAAC GTTCAGGGAC GTATATGACG CTGTCATAGC GTTGCAACAG GGCACGCTTG ACCTAGACGT TAATGACGAA ACGTTTGAGC ATGAGAAAGA GTTACTTCAG CAAATGCAGG CTCAAAGTTT CATGTCTATA CTCTCTAGGC TGAAACTCCT TGTTGATAAC AACATCTTCA AAATCGTCAC GTCAGAGACT ACTATTGACT GGGACAGGGT CATTAATGAA ACAAAGAGAG GGCTGATAAT CTTCGATGTA GGGAAGAGCG CCGGGAACGA AGTGTCTGAG ATGATGCAGA TGATTATCAC GTTATCGCTT TTCAACTACG TCTTCCTGAG GGACGCTTTA GGGAAGGAGA AAATACCGAT CTTCCTGGTC ATTGATGAGG CCCAAAACGT TGCACACTTT GACTTCATAA ATGAGGTCTT GGCGGAGGCG AGGAAGTATG GACTACATCT AGTTCTTGCT ACACAGTCCT TCGTGAGGCT GCAGGCATTA GCGGGAGAGA ATAACGCCAG GGCGATAAAC GCTAACACAA ACGTAAAGCT GTTAATGAGG CTAACGGAGG GTAGTGACAT ATCTCAGCTG GCAAAGTCGG TAGGGGCTAA CCAAGAGATC GTTGAGGCTT TGCCGAAGCT GTCTATCGGA CAGGCTTTCC TCTTCCTTCT AGGAAAAACT GGGGAGTTTA CTGTACCTAA ACTGGTACAA ATTAGACCTT CTGAGCTACA AGATAAAGAG AAAGAACCTA CAAAGGGCTT CGAGCCTAAG GGGGTCAGTA AAGGACTAAC TAAGGAAACG ATTAACCCGG CGTTGGCACT ACTCAAGGAG CCTCCAGACG TTTTAGGACA GCTTATACTA TATACAGCCT TCGAGAAAGG CGAGTACGGT ATAACAATTA CAGACCTGAT AGCCCAATTA GGGATAAAGA GGGAAGTAGC ATTAGCTAAA CTGGCGGAGC TGGAGAAACT CGGCGCTGTG CAGATTGAAC AGAGGGGTAG GAGTAAGATA GTGAAATACG CGAAGGGGTT GTTTAGACTC AGGGGTATTG TAGAGAATGA GGAGGGAAAG AAAGTAGCAT TAAGAGTCTT GAGGAAGTAC CTGAAGGATG GGTACATTGT TGTTCCGGGG AGACAGGAGG GTGACATCAG ACCAGACTTT ATAGCACTGA CTTACGACAA GACCACTCTG AGGCCTAATT ATTCCAACAT CGTGATAATT GAAATAGAGT CTCCGAATGA GGTCGCTGTG CACGCTGAAC AAGTGAGAAA GAATATGCAG AAATACCTAT CGTTAGATGA GAGGACTAAG AGCATTATCA AGGAAATTCA CATCTGGACC TCTGAGGAAA AATTCGATAA ATTGAAAGAA ATCTACGACA ACTTCATTAA CGATAATTCT ATTCCTCAAG AGTATAAAAC AAAAGTCAAG ATATTCCCAG TAGAAATTAA ACAAAAAGTA AAACAAGGAG CTCTGAAAGA GAAGAAAACT AAGGCTGAAA CTGGGGAGTT TAACGGCAAA AGAGAAGAGA AGGCTGAAAG TATAGCTCGA CAAGCAGCCC AGGGAGCTCC AAACAATGCT AACAGTAAAC TGGGGAGTTT ACTCAAGATA GGGCATTTAG AGTTCCAAGT GTTGGACGAG GTAAACGACA AGGTAATAGT AAAAACTGGA GATAAAGACT ACAAGATAAG CAAGAAGGAC CTAATAGATT TAGAGGGGCT CAAGGACCTA ATAGTAGAAG CAAAAATTGA AAACGGTTAC CTTAAGGTCA AGACAAGTTT AGGCCTCATT CAGAAGATTT CTTTGGAGCC CTTATGA
|
Protein sequence | MMELYKVRLK SGVDYQKLKA TELFGNVFEL IFIKKSRNEM DFYVRTAAKE EILRQYFVLL KADEYPTNRF VAVLKLKKES DFYANLEYSN LLNLISSLEE GEQIRIWVVL EPRLNDLFIK KADKLKLQAQ KAMIGKRKKE LLANILESYA KDNLYLLDIK IFSNQKSRLK LLFDYAKQLI HTKSRKLRME VKKAKKFKEK QPKIGAWEAI AKYKVRLWID EDKLNENLPL PSPTQVPIPM SIGVSLPYFR LERKDIYLGD DILYAEKVFL DWTDFQRHAI IYGSTGSGKS NTLEILAQEL AKYGIVIFLD PNSQSARKLS QIANYYFTIT KDRPNYGINI LQLPHIFQDR EKDIDYQISK VLQLFDKLLN LVDTAVNVRY ILQVLLRQMY RVSDRITFRD VYDAVIALQQ GTLDLDVNDE TFEHEKELLQ QMQAQSFMSI LSRLKLLVDN NIFKIVTSET TIDWDRVINE TKRGLIIFDV GKSAGNEVSE MMQMIITLSL FNYVFLRDAL GKEKIPIFLV IDEAQNVAHF DFINEVLAEA RKYGLHLVLA TQSFVRLQAL AGENNARAIN ANTNVKLLMR LTEGSDISQL AKSVGANQEI VEALPKLSIG QAFLFLLGKT GEFTVPKLVQ IRPSELQDKE KEPTKGFEPK GVSKGLTKET INPALALLKE PPDVLGQLIL YTAFEKGEYG ITITDLIAQL GIKREVALAK LAELEKLGAV QIEQRGRSKI VKYAKGLFRL RGIVENEEGK KVALRVLRKY LKDGYIVVPG RQEGDIRPDF IALTYDKTTL RPNYSNIVII EIESPNEVAV HAEQVRKNMQ KYLSLDERTK SIIKEIHIWT SEEKFDKLKE IYDNFINDNS IPQEYKTKVK IFPVEIKQKV KQGALKEKKT KAETGEFNGK REEKAESIAR QAAQGAPNNA NSKLGSLLKI GHLEFQVLDE VNDKVIVKTG DKDYKISKKD LIDLEGLKDL IVEAKIENGY LKVKTSLGLI QKISLEPL
|
| |