Gene Ssol_0621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0621 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp570096 
End bp571706 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content37% 
IMG OID 
ProductCytochrome b/b6 domain protein 
Protein accessionACX90895 
Protein GI261601292 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTCTA AATTATCAGA TTGGTTTAAG GAGAGATTAG GACTAGATGA TCTACCCTTC 
TTCAGAACTC CAGATTATAT GTATAAAGTA GACTATTGGC TAGGTGCGCT AGTAGCTTCT
GCGTTCATTT ACACCGTAAT AACTGGGCTT ATCCTTTTGT TATATTATAA TGCCCAGGCT
GGATACAATT CAACGGAGTT CGTCATAAAT AGTGTACCAT ATGGTTCGGT TGTGCTTTAC
AGTCATTTAT ATGGTGCATA TGCTATGATA ATTCTAGCCT ATATTCACAT GTTTAGGAAT
TATTTCGCTG GAGCCTATAA GAAGCCTAGG GAACTATTAT GGATAATTGG AGTTATAATG
CTTATACTTA CTTTGGGTAC CGCGTTCTTA GGATATAGTT TAATAGGTGA TGCGTTGGCA
ACTAGTGCTG TGGATGTTGG AGAAGGTATA ATTAGTAGTG TACCAGGACT TTCCATCTTT
ATTCCGTTTC TATTTGGGAA TTATGATGCT GGGGATTACG GTAGGGTATT AGCATGGCAC
ATAATACTTA CCGCATTAAT AGGCTTACTA TTCGTCTTTC ACTTCTTCTT AGCAGAACAT
TACGGAATGA TGCCTTCTAG GAAAGTTAAG GATAAAGTAC CAGCGGTATA CGCAAAAGAA
GAGTGGCAGA AATTCAATCC ATGGTGGCCT AGGAATTTCG TTTATATGAT GTCTTTAATA
TTCTTAACTT GGGGATTCAT ACTCATAATA CCAAATGCCT TCGCATACCT TAATGGATTA
CCACAACAAC TGAATCCATT CTTAAATCCA AAACCAGCTC CTCCACCAAA CAGTCCAGCT
GCTGCGCATA TAACAACTTA TCCACCATGG TTCTTCTTGT TCTTATACAA GATAGCGGAC
TTCACGAGTG ATGTAGTAAT ATTCTTATTT ATTGGCGTGA TTATTCCACT AATATATCTT
ATCATAGTGC CATTCTTGGA TAGAACAGAA TACTTGCATC CAATGAAGAG GAAGGTATTC
GTAGGGATAG GTATTCTAAT TGTAACTTAC TTAACACAAA CAACCATATG GGGTGACTTA
ACACCTGGTG TAGAGATTCC AGTTAGTCAA CAAGTTTTAG TATATTTACC ACCAGCAATA
ATAGTAGCTT TAGGGCTTGC TGCCATTAGA CCAAAAACGG ATAATAAAGG GAATATTAAA
GGGACAATAG GTCCCATAAC AGCATTAGCA TTTATCTTAG TTTCATCTCT GTTTATCATA
GCTGGAATCG AGCTACTAGC TAACCCTACA ATTTTAACGG TTGGAGTATT TATTCCTATT
GCAGGAATAT TTATACTAAC TACTAAAAAG ATTGCACCTA TAGCACTAAA TGCGGAACAA
AGAAGTGAGA ATGCATCAGT GAGTGAAGAA AGAGCACCAG AGTGGAAGAA GAAACTGGCA
GAAGGCTTAG TGGCATTGCT ATTCATTTAC GCAATAATAT TGGCAGCGCA AATATGGACT
ATACCAGCTA CTGGATACTT CTCTAATTTA TTTGGTATAG ATTTAGGGCT AATCCTCCTA
ATGCTAGGTG AGGGTATCTC CCTATATCAT TACGTGATAT ATAGGAAATA G
 
Protein sequence
MSSKLSDWFK ERLGLDDLPF FRTPDYMYKV DYWLGALVAS AFIYTVITGL ILLLYYNAQA 
GYNSTEFVIN SVPYGSVVLY SHLYGAYAMI ILAYIHMFRN YFAGAYKKPR ELLWIIGVIM
LILTLGTAFL GYSLIGDALA TSAVDVGEGI ISSVPGLSIF IPFLFGNYDA GDYGRVLAWH
IILTALIGLL FVFHFFLAEH YGMMPSRKVK DKVPAVYAKE EWQKFNPWWP RNFVYMMSLI
FLTWGFILII PNAFAYLNGL PQQLNPFLNP KPAPPPNSPA AAHITTYPPW FFLFLYKIAD
FTSDVVIFLF IGVIIPLIYL IIVPFLDRTE YLHPMKRKVF VGIGILIVTY LTQTTIWGDL
TPGVEIPVSQ QVLVYLPPAI IVALGLAAIR PKTDNKGNIK GTIGPITALA FILVSSLFII
AGIELLANPT ILTVGVFIPI AGIFILTTKK IAPIALNAEQ RSENASVSEE RAPEWKKKLA
EGLVALLFIY AIILAAQIWT IPATGYFSNL FGIDLGLILL MLGEGISLYH YVIYRK