Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_1147 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | - |
Start bp | 1068065 |
End bp | 1069696 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | |
Product | thiamine pyrophosphate protein central region |
Protein accession | ACX91385 |
Protein GI | 261601782 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGC CCAAAAGAAA AGAGGAGACT GTAGGCGTAG AAATGAAAGG CGAAGAAGCC CTTGCATACG TTTTAAATGA TATTGGCTTA ACTAAGGTAT TTACAACTTA CTCCCTACCA AATATTGTTA AGGAAATGTT AAAGAAGTAC AATATTGAAG TTGATTTTTC TATTTCAGTA AAAGACGCAA GCTGGCTAGC TTATGCTTAT GCAATGGAAA ATAATTCAGT AGGGACTATA ATTCAGATAC CGGGCTCGAA GTTAACTGAT GCAGTAGATG TTATAGCTCA AGCCTATATG GAGTCAGTTC CCCTTCTTAT AATATCTAGC GTAAGATCAC ATAAAGACAC AGGAAGGGCC AGAATTGGTG AATTTAGAAC GATCGATGAT TTATCAAACA TTTTATCTCC AATAATTAAG ACTAAAGAAA GAGTTATCAG TATAGAAGAA ATAACTGTTA CAATAGAGAA AGCCTATAAG GAAGCAGTTA GTAATAGACC AAGACCTTCT TATGTTGAAA TATCTGAAGA CTTATTCAGG GCAAAAGCTT ATCCGTTATC CACTGCGGGG CAAAAGCCAG AGAAGAGGAC TCCAGATAAG AATAGTGTAG CCAAAGTAGC TGAACTTTTA ACTAATGCTA AATTACCAGT TATAATTGCG GGATATGGAG TAGTTTTAAG CGACGCAGAA GACATGTTAG TCGAACTGGC TGAATTAATA GATTCACCAG TAGTTACTAC ATTTAAGGCT AAAGGTTCAA TACCATCTAA CCATAAATTA TTTGCTGGAG AAGGATTAGG TGCCTTTAGC ACTAGTGCCG CAAACTATCT AATTGAGAAT GCTGATGTAA TACTAGCGTT AGGCACTAGG TTTACTCAGT TAAGTACAGC TGGCTGGTCA TTAAAGTATA AAGGCATCCT AGTGCATAAC AATGTTGATG GTGAGGATAT AGGCAAAGTT TTCATGCCTC ATGTTCCAAT AGTAGCTGAT ACCGGGTTAT TCTTAAGAGA ACTATTAACT CAATTAAAGG CTAAGATAAA GGAGAAGATA AATAGAGGGG CTAGTGATAT TATTTATAAA ACTCAACGTC AAGCATACCC AATTACATCT CATAATGATA TATGGCCTAT AGATGTGGTG AAAATGCTAA GCAGTATAGG TGGCTTTGAG AAAGTCTATG TGGATATTTC GGCTACAACA ATAGATTTAG TTAGATTACC GATAAATGCT AAGAAGACGT GGTATACTGC AGAATCTTTA CTAGAGAGAG GTATCGCGGT AGGTGGTATT ATAGCTTCTA AATACGTTGC ATATGGAATT ACTGATATAG AGGGAATATT ACCTCATTTA TCATTACTAA AATATAAGAT GGATAAGATT AAAGGAAAGT TAATTATATT AAATGATGGG GGCGCAAATT ACATTGAAGT TTCTAACTCT GATTTACCTA CTATTGCTAG ATCACAGACT AGTTTCAATG CCAATTTTGA TGAGATTGCA GAAAAAGCCT TAGGTGGTGT AACTGTTAAT ACATTAACTG AGTTAGAAGA GGCTTTGAAA TCTGTCGATA AGAAAATCAT AAATGTAAAG ATTGATCCTA ACTTCGAGTC GGTAATACTT TCAAGAATTT AA
|
Protein sequence | MSQPKRKEET VGVEMKGEEA LAYVLNDIGL TKVFTTYSLP NIVKEMLKKY NIEVDFSISV KDASWLAYAY AMENNSVGTI IQIPGSKLTD AVDVIAQAYM ESVPLLIISS VRSHKDTGRA RIGEFRTIDD LSNILSPIIK TKERVISIEE ITVTIEKAYK EAVSNRPRPS YVEISEDLFR AKAYPLSTAG QKPEKRTPDK NSVAKVAELL TNAKLPVIIA GYGVVLSDAE DMLVELAELI DSPVVTTFKA KGSIPSNHKL FAGEGLGAFS TSAANYLIEN ADVILALGTR FTQLSTAGWS LKYKGILVHN NVDGEDIGKV FMPHVPIVAD TGLFLRELLT QLKAKIKEKI NRGASDIIYK TQRQAYPITS HNDIWPIDVV KMLSSIGGFE KVYVDISATT IDLVRLPINA KKTWYTAESL LERGIAVGGI IASKYVAYGI TDIEGILPHL SLLKYKMDKI KGKLIILNDG GANYIEVSNS DLPTIARSQT SFNANFDEIA EKALGGVTVN TLTELEEALK SVDKKIINVK IDPNFESVIL SRI
|
| |