Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_2447 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | - |
Start bp | 2250779 |
End bp | 2252830 |
Gene Length | 2052 bp |
Protein Length | 683 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | 5-oxoprolinase (ATP-hydrolyzing) |
Protein accession | ACX92596 |
Protein GI | 261602993 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAGAG TGGGCATAGA TATAGGAGGC GCTTTCACTG ACGTAGTGGT CTATAACGAA GAAAACGGAG AAATAAGTTG GGCAAAGGTA GAAACCACGC CAGATGACCC GTCAAACGGC GTTCTGGAGG CAATAGATGA AGCCAAAGTA AATTTAGGTT ATGTAAATAC TATAATTCAC GGTCAGACAT TAGCCATAAA CACAATAGTA GAAAGGAAAG GAGCAAAGGT TGGTCTAATC ACAACTAAGG GCTTTAGAGA TATCCTAGAA ATCCAAAGGG CTAATAGAAG AGATATGTAC AATTTCAGAT ATAAGAAACC TACTCCCTTT GTTCCGAGGT ATTTAAGACT AGAGGTCACC GAGAGGATAA AGAGTAATGG CGATATTTTG ACGTCTCTAA ACGAAAACGA GGTTGTTGAG GCGATTAAAA AACTAAAAGC GGAAAACGTA GAAGCCATTG CTGTAAGTTT TATCAATTCT TACGTTAATC CCATACATGA ACTTAAGGTA GGAGAGATTA TTAAGAGAGT TGATCCTAAT ATTATAGTAA CGTTATCTCA TGAGGTCACT AGAGAGTGGA GAGAGTACGA GAGAACTAGT ACCGCTGTAC TTAACGCATA CGTTATGCCA AAAATGAGTA AATACTTAAG TAAACTTGAA AATGAATTTA AAAATAGAGG TTTTAAAGGG AATTATTTCG CTATGCTTTC CAACGGAGGT ATGGCCACAT TCGACTACGC TAAAAGATTT CCAATATATA CCTTAGAATC TGGTCCAGTA GCAGGAGTTA TTGGGGCAAT TAAGATAAGC GACATATTAG GAGAAAAAAA CATTATAGCA ATGGATGGTG GAAGTACAAC AACTAAGGCT AGTCTAGTGA GAAATCTGGA ACCCAATATA AATACTGATT ACTATGTTGG AAGAGATAAG TACAACCCAG GGTATCCAGT AAAAGTTCCA ACATTGGATA TAGTAGAAAT AGGTAATGGT GGAACGAGTA TTGCTCGGAT TGATGAAACA AGTAACTTAA AAGTAGGACC CAGAGCAGCT GGTGCTTACC CAGGTCCAGT AGCTTATGGG AAAGGAGGTA AAGATGTTAC TGTGACAGAT GCTTACATAG TATGTGGATT TCTAAATCAA GAGGAACTAC TTGGAGGGAA AATAAAGGTT AATAAAAGAC TTGCTGAGGA GGCTATTTCA AATATTGCTA AATACTATAA CATGTCTATT GAAGAAGTCT CCTATGGTAT AGTTAAAATT GCTAATGATA ATGCTGTAAA TGCAGTTAGG TTAATATCAG TTCAAAGAGG ATATGATCCA AGAGAGTTTA CGCTAGTAGC ATACGGTGGC TCTGGCCCTA TGTTTGCTCC ATTTGTTGCA GAAGAATTAG ACATAAAGAA GATAATAGTA CCATTTCTCC CAGCAGGAGT ATTCTCCGCA TGGGGTATGC TTGTTTCAGA CATTAGGCAT GACCTTGTTT TATCCTATCC TTTGAGGATT GATAAGGAAA GTAGTGTAGA TTTGATAAAT GAAAAATTTA ATGAATTAGA GAGCAAAATA AGGTCCATAT TAATATCGGA GGGATTCAAA GAGAAAGATA TCATAATGCT AAGATACGCT GAGATGAGAT ATTATGGCCA AGAACATACT GTAAAAGTGA GTGTAATGCC AGGGGAAATT GGGAATAGGG AATTGGAGGA GATAGAAAGA AGGTTCCATG AAGCTCACGA AATCGCATAT GCGTTTACCT TAGATAGTCC AATCGAAATA GTAAACTTTC ATGTGAGTGG TATAGTAAAA GCTAAGACTA TTGTATTAAA GAGGATAGAG AGGGATAATT CAAGTATTGA TAAGGCGTTG GTCGGAAAAA GAAAGGTATT CTATGATGGA AAATATGAGG AGTGGAATGT GTATAATAAA GAATATTTAC CTATTAATTA TCAAATAGTT GGTCCAGCAA TTATAGAAGA TCCTACTTCT ACATCGTTAG TATTAGAAGG GCAAACGGGA ATGTTAGATA GTTATGGCAA TCTAATTATT GAGAGGGATT AA
|
Protein sequence | MIRVGIDIGG AFTDVVVYNE ENGEISWAKV ETTPDDPSNG VLEAIDEAKV NLGYVNTIIH GQTLAINTIV ERKGAKVGLI TTKGFRDILE IQRANRRDMY NFRYKKPTPF VPRYLRLEVT ERIKSNGDIL TSLNENEVVE AIKKLKAENV EAIAVSFINS YVNPIHELKV GEIIKRVDPN IIVTLSHEVT REWREYERTS TAVLNAYVMP KMSKYLSKLE NEFKNRGFKG NYFAMLSNGG MATFDYAKRF PIYTLESGPV AGVIGAIKIS DILGEKNIIA MDGGSTTTKA SLVRNLEPNI NTDYYVGRDK YNPGYPVKVP TLDIVEIGNG GTSIARIDET SNLKVGPRAA GAYPGPVAYG KGGKDVTVTD AYIVCGFLNQ EELLGGKIKV NKRLAEEAIS NIAKYYNMSI EEVSYGIVKI ANDNAVNAVR LISVQRGYDP REFTLVAYGG SGPMFAPFVA EELDIKKIIV PFLPAGVFSA WGMLVSDIRH DLVLSYPLRI DKESSVDLIN EKFNELESKI RSILISEGFK EKDIIMLRYA EMRYYGQEHT VKVSVMPGEI GNRELEEIER RFHEAHEIAY AFTLDSPIEI VNFHVSGIVK AKTIVLKRIE RDNSSIDKAL VGKRKVFYDG KYEEWNVYNK EYLPINYQIV GPAIIEDPTS TSLVLEGQTG MLDSYGNLII ERD
|
| |