Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_1945 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 1730198 |
End bp | 1731646 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | |
Product | protein of unknown function UPF0027 |
Protein accession | ACX92156 |
Protein GI | 261602553 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0738266 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGATTA ACATTACTAG AGTTAGCACA TACGAGTGGC GTATTGATAA AGGCGCTCAA GAGTGTATGA AAGTTCCCGT TACAATATTT GCAGATGACG TTCTAATTGA AAAAATGAAA CAAGATATGA CATTAAGACA AGCAACTAAC GTAGCTTGTT TACCCGGTGT CCAAGAGTCA ATTTACGTTT TACCAGATGG TCATCAAGGT TACGGTTTTC CTATAGGCGG TATAGCAGCT ACTGCGATCG AGGAAGGAGG AGTGGTAAGT CCTGGAGGTA TAGGATATGA CATAAATTGT GGTGTCAGAT TACTCAGAAC TAATTTAGAC TATAAAGATG TAAAGCCAAA ATTAGCTCAG TTGGTTGAGG AGCTGCATAG AAACGTGCCG AGCGGTGTAG GAAGTGAGGG TAAAGTAAAA TTGACATATC AGCAATTAGA TCAAGTATTA GCGGAAGGAG TTGCGTGGGC GGTTGATAAG GGCTTTGGAT GGAAAGAAGA CATGAATCAC ATGGAACAAC GTGGTAGCTG GGAGCTAGCC GATCCTTCAA AAGTAAGTCC GATAGCAAAA CAAAGAGGTG CCTCTCAGTT AGGAACTTTA GGAGCTGGTA ATCATTTCTT GGAAATTCAA GTTGTTGATA AGATATTTGA TCCCCAAATT GCTAAAGCAA TAGGGGTAGA TCACGAAGGT CAAGTAATGG TTATGGTTCA TACGGGTTCA AGAGGTTTAG GTCATCAAGT AGCTAGTGAT TATTTACAGA TAATGGAAAG AGCAATGAAG AAGTATAACA TTCAATTGCC AGATAGAGAA CTAGCTGCAG TTCCCTTTGA GAGTAGAGAG GGTCAGGATT ACTTTCATGC AATGGCATCT GGAGCTAATT TTGCGTGGAC GAATAGACAA TTGATTACGC ATTGGACGAG AGAGAGTTTC GGTAGAGTAT TTGGTGTCGA CCCAGAAAAA TTAGATCTTA GCATAGTTTA TGATGTAGCT CATAATATAG CTAAAATTGA GGAGTATGTA ATTGGTGGAG AAAGGAAGAA GGTATTAGTA CATAGGAAAG GTGCTACTAG GGCTTTCCCG CCTGGTAGTC CGGAGATTCC CGCTGATCAT AGAAATATTG GCCAGATTGT TTTAATCCCA GGTAGTATGG GCACTGCTAG TTATGTTATG GCTGGAATAC CAGAAGGTAG AAGGACATGG TTTACTGCGC CTCATGGTGC TGGTAGGTGG ATGTCTAGGG AAGCTGCAGT GAGAAATTAC CCTGCTAACG TAGTAGTTGA AACTTTAGCT GAAAAGGGTA TAGTAGTAAG GGCTGCTACT AGAAGGGTAG TAGCTGAAGA AGCACCGGGA GCCTACAAAG ATGTTGATAG GGTAGCTAAA GTTGCTCATG AAGTTAAAAT TGCTAAATTA GTTATGCGAT TAAGACCCAT AGGGGTTACC AAAGGATGA
|
Protein sequence | MQINITRVST YEWRIDKGAQ ECMKVPVTIF ADDVLIEKMK QDMTLRQATN VACLPGVQES IYVLPDGHQG YGFPIGGIAA TAIEEGGVVS PGGIGYDINC GVRLLRTNLD YKDVKPKLAQ LVEELHRNVP SGVGSEGKVK LTYQQLDQVL AEGVAWAVDK GFGWKEDMNH MEQRGSWELA DPSKVSPIAK QRGASQLGTL GAGNHFLEIQ VVDKIFDPQI AKAIGVDHEG QVMVMVHTGS RGLGHQVASD YLQIMERAMK KYNIQLPDRE LAAVPFESRE GQDYFHAMAS GANFAWTNRQ LITHWTRESF GRVFGVDPEK LDLSIVYDVA HNIAKIEEYV IGGERKKVLV HRKGATRAFP PGSPEIPADH RNIGQIVLIP GSMGTASYVM AGIPEGRRTW FTAPHGAGRW MSREAAVRNY PANVVVETLA EKGIVVRAAT RRVVAEEAPG AYKDVDRVAK VAHEVKIAKL VMRLRPIGVT KG
|
| |