Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ssol_0984 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sulfolobus solfataricus 98/2 |
Kingdom | Archaea |
Replicon accession | CP001800 |
Strand | + |
Start bp | 925282 |
End bp | 926463 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | |
Product | glycosyl transferase family 2 |
Protein accession | ACX91228 |
Protein GI | 261601625 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00499556 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGGATA TTCTGCTAAT AACGTTAAGT GGTATAGTAT CCTCGTGGAC TGTGTATAAT GGTATATTAT CAATACTCGG GGTAACGTGG AAACCGTTCG AGAGCAAAAA TCATAGTGGA ATAACATTTA GTCTGATAGT TCCAGTCAAA AATGAGGAAA GAGTGCTTCC TAGATTATTA GATCGGCTCG TAAATCTAGA ATATGACAAG TCTAAATATG AAATAATAGT AGTAGAAGAT GGTTCTACTG ATAGAACCTT CCAAATTTGT AAAGAGTACG AGATAAAGTA TAACAACCTT ATAAGATGCT ATAGTCTTCC GAGAGCTAAC GTACCGAATG GGAAAAGTAG AGCGTTAAAT TTTGCCTTAA GGATTAGTAA GGGAGAAATT ATAGGTATAT TCGATGGAGA TACAGTACCA AGATTAGATA TTTTGGAGTA CGTTGAACCA AAATTTGAGG ATATTACCGT TGGTGCAGTT CAAGGAAAAT TAGTTCCAAT AAATGTGAGG GAAAGTGTAA CAAGTAGATT AGCTGCTATT GAGGAATTAA TATATGAATA TTCAATAGCT GGAAGAGCTA AAGTTGGGCT CTTTGTACCA ATTGAAGGAA CTTGTTCTTT CATAAGGAAA AGTATAATTA TGGAGTTAGG CGGATGGAAT GAATATTCTC TCACTGAAGA TCTAGATATT AGCCTCAAAA TCGTTAATAA AGGCTGTAAG ATCGTTTATT CTCCCACAAC TATTAGTTGG AGGGAAGTTC CAGTTAGCTT GAGGGTTTTA ATTAGGCAAA GATTAAGATG GTATAGAGGG CATTTAGAAG TGCAATTAGG CAAACTTAGA AAGATCGATT TAAGAATAAT AGACGGTATA CTAATAGTGC TTACGCCATT CTTTATGGTC TTAAATCTGG TGAATTACTC TCTAGTATTA GTATACTCTT CTTCCCTATA CATTGTCGCA GCAAGCTTAG TTTCTTTAGC CTCTCTGCTT TCATTATTAC TTATAATCTT AATAGCAAGG AGACATATGA TAGAGTATTT CTATATGATT CCTTCGTTCG TTTATATGAA CTTTATTGTC GCACTAAACT TCACTGCAAT ATTTTTAGAG TTAATAAGAG CACCTAGAGT GTGGGTAAAA ACTGAAAGAA GTGCCAAGGT TACGGGGGAG GTCATGGGAT GA
|
Protein sequence | MLDILLITLS GIVSSWTVYN GILSILGVTW KPFESKNHSG ITFSLIVPVK NEERVLPRLL DRLVNLEYDK SKYEIIVVED GSTDRTFQIC KEYEIKYNNL IRCYSLPRAN VPNGKSRALN FALRISKGEI IGIFDGDTVP RLDILEYVEP KFEDITVGAV QGKLVPINVR ESVTSRLAAI EELIYEYSIA GRAKVGLFVP IEGTCSFIRK SIIMELGGWN EYSLTEDLDI SLKIVNKGCK IVYSPTTISW REVPVSLRVL IRQRLRWYRG HLEVQLGKLR KIDLRIIDGI LIVLTPFFMV LNLVNYSLVL VYSSSLYIVA ASLVSLASLL SLLLIILIAR RHMIEYFYMI PSFVYMNFIV ALNFTAIFLE LIRAPRVWVK TERSAKVTGE VMG
|
| |