Gene Ssol_0467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0467 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp417106 
End bp418641 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content39% 
IMG OID 
Productcytochrome c oxidase subunit I 
Protein accessionACX90750 
Protein GI261601147 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAAC TTTTAAATCA GATAAAGGAT ACTGTTATCC CTAAAAATAC GTTAAACGTA 
GTTTGGCTTT ACACTATAGG TTCTATATTC TGGCTAGGAG TTCTGGGAAT AGCAGCCATG
AATCTGAGAA CGTACTTGAC ATATAATAGT AATTCCCCCA ACGTGGGAGA ACTATACTAC
AGTGCATTAA CCATTCACGG CTGGGCTGGT ATGTTCGGAT TTGTACCTTT AGCTGCAGCT
GCTGTTATAG CATTTGCAAT GTATAAGAGT AACTTATCTA TAGTTCACAC TAAGTTAATG
GCAATTTTCT TCTGGTTGTC TAATATCTTC TTAGCGATTG GACTAGCTGG ATCACCAGAT
ATGGGTTGGT ATATGTACCC TCCATTGGCG ATAGAGTCTG GTAGTAACTT CCACGCTTTT
CTATTCTATA CTACACCAGC TCTAATGGGG ATGGCATATC TATCCTTAAC CTTAGCTTTC
ATACTACAGA CTGCAATGTT CATAATACTT ATTGCAGATG CATATGCAAC AAAGCCAAAA
AATGAAAGAC TAAACATTTT TGGAGCTTAT GGAGTAGCCT TTGCAATAGT TATTGCAATA
ACGCTTCCCG CTCTTGCTGC ATCAACGCTT TGGTACACAC TGTATTTCTT TGCAGGAGTT
CCAGTGAACC CATTACTGTG GGCATTGCTA TTTTGGTTCT ATGGGCATCC AGTAGTATAT
TACGTTCCAT TCCCGTTATT TGGTGCATTA TACTACTATA TTCCGAAATA TGCTGGAAGA
TCTCTCTTCA GTGAAAAGTG GGCTAGATGG AATATATACT TGCTATCAAT AGGCTCAATG
CTTATATGGG TTCACCACCT ACAAACATGG CCATTACCAA TAGACTTGAG AATATGGGTA
AACTTATCCA CATTAATTTT AGCCTCGGGA TCTGGTCTTA CCGTATTGAA TTTGGGACTC
ACAATACTTC TTTCTCAAAA ATATAATTGG AAAGATCCAG TAGGAATGGG AGGACTAATT
GCATTAGTAG GATTCATATT AGCTGGTGTT CAAGCACTCG TTTTACCAGA GAATTCAATA
AATCCACTAT TCCACAATAC CTATTATGTA GTAGGACATT TCCACTTAAT GATATGGACT
TTAATAGTAA TGGGCTTCAC TACAGTATTT TTGGATATGC TAAAAAGCAC ATTTGGTGGG
TTCAACTTTA GCGAGCAAGC CTCTAAATGG ATGAGAATTG GGATGATATG GTGGACTGCA
CCATTTTTAG GTGTAGGATA TACGATGTCA GTAATTGGTT ATCTAGGAAT GCTGAGAAGA
ATGATAGCGT ATCCATCAGT ATTTCAACCC TATAACCTAT TGGAATCAAT GCTAGCTGAA
ATAGGGATTC CGGGACTATT AGTCACAATC TTTGTGGCAA TTGTCGATGC ATTGGCTTAT
GCTTCTAAAC AAGGTATTTT TTCATCTCCT TCTCCCTCAA CTTCTCCCTC TAATTTACCC
GTAATGGCTA AAGATGGGGT GAAAAACAAT GGGTGA
 
Protein sequence
MSQLLNQIKD TVIPKNTLNV VWLYTIGSIF WLGVLGIAAM NLRTYLTYNS NSPNVGELYY 
SALTIHGWAG MFGFVPLAAA AVIAFAMYKS NLSIVHTKLM AIFFWLSNIF LAIGLAGSPD
MGWYMYPPLA IESGSNFHAF LFYTTPALMG MAYLSLTLAF ILQTAMFIIL IADAYATKPK
NERLNIFGAY GVAFAIVIAI TLPALAASTL WYTLYFFAGV PVNPLLWALL FWFYGHPVVY
YVPFPLFGAL YYYIPKYAGR SLFSEKWARW NIYLLSIGSM LIWVHHLQTW PLPIDLRIWV
NLSTLILASG SGLTVLNLGL TILLSQKYNW KDPVGMGGLI ALVGFILAGV QALVLPENSI
NPLFHNTYYV VGHFHLMIWT LIVMGFTTVF LDMLKSTFGG FNFSEQASKW MRIGMIWWTA
PFLGVGYTMS VIGYLGMLRR MIAYPSVFQP YNLLESMLAE IGIPGLLVTI FVAIVDALAY
ASKQGIFSSP SPSTSPSNLP VMAKDGVKNN G