Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_2747 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 2513560 |
End bp | 2514861 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | |
Product | thiamine biosynthesis protein ThiC |
Protein accession | ACX92833 |
Protein GI | 261603230 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0789329 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCATCA TAGATGAGGC AAAAAGGGGT CAGATAACAG ATGAGATGAG GGCAATAAGT AAGCTAGAAG GTATACCAGT AGAGAAGGTC AGAAATAGGA TAAGTGAAGG TAAAATAATG TTGATAAGAA ATGCGAAGTA CCCCAGTAGA AAACTTGTCC CAATAGGTAA GGGACTAACT ACTAAAGTTA ACGTAAACAT AGGCACTTCA AGTGAGGTTG TAGACTTAGA CATGGAATTA CAGAAGGTAA AGGTTGCAAA CAAGTGGGGA GATACTCTAA TGGATTTATC AACGGGAGGA GACTTAGATG CCATAAGGAG GGATATAATA AAGGCATCAG ATCTACCAGT TGGTACAGTC CCAGTTTACC AAATTTTCAT AGAGTCCTTT AAGAAGAAGT CTGGAGGAGC GTATTTTACT GAAGATGAAT TACTAAACAC AGTGGAAAAG CACTTAAAGG ATGGGGTTGC ATTCATGACA ATTCACGCTG GAATAACTAA GGATTTAGCT ATTAGGGCGT TAAAGAGCGA TAGGATTATT CCAATAGTCT CAAGGGGAGG GGACATGATA GCTGGTTGGA TGATACACAA TAACTCGGAG AACCCCTATA GAAAGAACTG GGATTACGTG TTAGAAATGT TTAAGGAATA TGATGCTGTA ATCTCTTTGG GAGATGCCTT AAGACCTGGA GCTACTGGAG ACGCTCATGA CGAGTTCCAA ATTGGAGAAC TATTGGAAAC CGCTAGGCTA GTTAAAAGTG CGTTGCAAAA GGGTGTGCAA GTTATGGTTG AGGGACCGGG ACACGTACCG CTGAACGAAA TAGCTTGGGA CGTTAAGTTA ATGAAGAAGT TAACTGGTGG TGTACCATAT TACGTTTTGG GTCCCTTGCC CATTGATGTA GGTGCACCCT ATGATCACAT AGCCTCTGCA ATAGGTGCGG CAATATCATC AGCTAGTGGT GTTGACTTAT TATGTTATCT AACCCCAGCT GAGCACTTAG GGTTACCAAC TGTTAAGCAA GTTGAGGAGG GGGCAATCGC TTATAGGATT GCTGCCCATG CAGGTGATGT GGTTAAGTTA GGCAGGAAAG CTAGGAAGTG GGATGACGAG GTCAGTTATT ATAGGGGGAA ACTGGATTGG GAGAACATGA TTTCTAAGCT AATAGATCCG CAAAGGGCTT ATCAAGTTTA CACTCAGTTT GGTACTCCTA AAGTGAAAGC TTGTACCATG TGTGGTGGAT ATTGTCCAAT GATGTGGGCT ATGGATCAAG TTAGGAAGAT AGGTTCTTCA TCATCCCTAT AA
|
Protein sequence | MGIIDEAKRG QITDEMRAIS KLEGIPVEKV RNRISEGKIM LIRNAKYPSR KLVPIGKGLT TKVNVNIGTS SEVVDLDMEL QKVKVANKWG DTLMDLSTGG DLDAIRRDII KASDLPVGTV PVYQIFIESF KKKSGGAYFT EDELLNTVEK HLKDGVAFMT IHAGITKDLA IRALKSDRII PIVSRGGDMI AGWMIHNNSE NPYRKNWDYV LEMFKEYDAV ISLGDALRPG ATGDAHDEFQ IGELLETARL VKSALQKGVQ VMVEGPGHVP LNEIAWDVKL MKKLTGGVPY YVLGPLPIDV GAPYDHIASA IGAAISSASG VDLLCYLTPA EHLGLPTVKQ VEEGAIAYRI AAHAGDVVKL GRKARKWDDE VSYYRGKLDW ENMISKLIDP QRAYQVYTQF GTPKVKACTM CGGYCPMMWA MDQVRKIGSS SSL
|
| |