Gene Ssol_0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0447 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp399818 
End bp401212 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content36% 
IMG OID 
Productconserved hypothetical protein 
Protein accessionACX90730 
Protein GI261601127 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.608112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATTG ATAAGCTGAT TGATAAATTT AGAGTTAATT TAGATAAGTA CAAAAAAATG 
GGATTGAATC CGCTATCGCT TGCTACAGGA TGTGCAGTAA AAGTAGATTT AATAGACACA
GTTTACCCAG CTATACGGAA GATAAGAGAC GAATTGGTGA AAAGAAATAT AGAAATATTA
CCTAGAGAAG ATGCTGACAT TTTCGTAAGT AGAGAGAAGA TTTACATAAA GAGAGTGATT
AACGGCGGAG AGTTTGACGC AGATAGAGCC GTTAGCCTTA TTCAGGTTAA CCAAGAGACA
TCAGGGAATC CTGATAAGTT TGCTGAGTTC CTATTGAAAG TTTACACTTC GATAAAAACT
ACTAGAAAGC TCACAATAGG TAAAGGTCAT TCAATAGTTA CCTCAAATCC TAAGGGTGAA
GTGGCAGTAT TAGATCTATT CAGACTAGAA GGAGGAAAGG AGAGATCTTA CACTGTTGCA
AATAACGACA CTATTCAAAT AGTAGATCCT TTGGATGACC CTGGATCTCA GATGCAAGTT
GATGTGGCTA TTTCCAATTC TTTAAATGAC CTTTTTACTA AGGGTGTTTT TCAAGATTTA
AGGATGATTC CGGTCGTTGA TGCTCCAATG GATGATCTAA AGGAACAATT GCTGAAAAAC
GCTGAAAATT ATTCTAGGGA ATATTCAATT GAATTATTAA GCGATGTTCA ACCTAATTCC
AAAACATTAA TGATAGGAGC TACTGTAATA GGTAAGTCCG ATCATGAATT ACCAACATAT
TACAATAGGG TTAATGAAAA CATGGAAATC CTAGTGACCA GACCAGTTGG TGAATTAACG
CCAATAAATG TGTTTATGTG GATGCTGACT GTTCCCGAGT TAATAGAAGA TATGGAAGCT
AGGGGAATTA CTATACAGAG AGTAGAAGAA GCTAAGAGAA AAGCCCTAAT GTACATGAGG
AAACCTAATA ATGAAGTAGC TAAAATTATA TACGATCATC TACCACCCTT TGGAGGCTCA
TTTGACGAAA ATTCTCATAT AGCCATGACT ACTGATGTTA CCGGTCCAGG ATTATTTGTA
ATAAAGGAAT TTGCTGAAAA GGCACAAGTA GATGTGGAGT TGTTTGATAT CCCAGTAATA
GATCCGGATA TACATGAGTT CGCCACCGAG AATTTTATCA TACCAAATTC TACTGCAGGG
ACAAACGGAG CCATAGTTAT TTTCGCTCAT AAGAGAGTTA TAGATGAAAT TTTCGATGAA
TTGAAAAGGA AGTCGCAAGA ACCTTATATC ATAGGTAAGG TTACTGGGAA AGGAAATGGT
ACCGTTATAG TCCCACCAAC TATTACAAAG TACATTCATA GAAATAATGT GTTAAGACAG
TTCAAAATAA GGTGA
 
Protein sequence
MAIDKLIDKF RVNLDKYKKM GLNPLSLATG CAVKVDLIDT VYPAIRKIRD ELVKRNIEIL 
PREDADIFVS REKIYIKRVI NGGEFDADRA VSLIQVNQET SGNPDKFAEF LLKVYTSIKT
TRKLTIGKGH SIVTSNPKGE VAVLDLFRLE GGKERSYTVA NNDTIQIVDP LDDPGSQMQV
DVAISNSLND LFTKGVFQDL RMIPVVDAPM DDLKEQLLKN AENYSREYSI ELLSDVQPNS
KTLMIGATVI GKSDHELPTY YNRVNENMEI LVTRPVGELT PINVFMWMLT VPELIEDMEA
RGITIQRVEE AKRKALMYMR KPNNEVAKII YDHLPPFGGS FDENSHIAMT TDVTGPGLFV
IKEFAEKAQV DVELFDIPVI DPDIHEFATE NFIIPNSTAG TNGAIVIFAH KRVIDEIFDE
LKRKSQEPYI IGKVTGKGNG TVIVPPTITK YIHRNNVLRQ FKIR