Gene Ssol_1305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1305 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1207119 
End bp1210505 
Gene Length3387 bp 
Protein Length1128 aa 
Translation table11 
GC content33% 
IMG OID 
ProductNADH/Ubiquinone/plastoquinone (complex I) 
Protein accessionACX91541 
Protein GI261601938 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCCTC TCTCCTTTAT AATGTTAATA ATATCCTTTA TACTTGCAGT TAGTATATTT 
TTTGTTAGAT CAAGAGTAGC CTCATTTATT TTGAGTTTTG TATCAATAGC CATAAATATC
CCATTTTTGG TTTTTAGAGG TATTTTTGAA AACATATTCG TCTCGAAATA TGTTGGAGAC
TTCGGAATTG TAGTAAACAA TTTCAACTAT CCATTTATAA TAACTATTAT AATTATTACA
CTATTGTCCG CAATATATTC CTTAAGGTAT ATGGAACATA AGTTTGATGA GGAGAGAAAA
AGTAGCTGGG GTCTTTACTA TGCTTTGTAT ACTTTATTTG CCCTTTCGAT GCTATACACT
GTACTATCGA CTAATCTACT GGAATTATAT ATATTTTTGG AGATTTCGTT AATTTCGTCA
TTTTTACTGA TAGCGCTTTA TGGGTATGGT GATAGACGCA GAATAAGCCT TATGTATTTC
ATATGGACGC ACATTGGTAC AATATTACTC TTGGCTTCAA TAATAGTAAT AGGTCTTTCT
ACAGGCTCTA TGAACATTTA CGTTGATGCT TATACTTTTG CCAATTATTC AACTATATCC
TATGGAATCC TAGTATTTAT AATAGCAGTA GTTGGTATGT TTGTAAAAGG TGCTCAAGCT
GGTTTCAACA TATGGCTACC GTATGCCCAT GGCGAAGCAC CAACTCCTAT TTCAGTATTG
TTAAGTCCAA ATATGGTTGG TTTAGGAATA TTTGTAGTAA TAATTTACTA TTATCTATTT
CCCACTATGT CCTTCTTAGC TCCCATATTC ATAGCTTGGG CCATTATCAC GATGATTTAT
GGTGGTATTA ACGCTCTGGC TCAAAAAGAC TTTAAGAGGT TTTTAGCGTA TTCTAGTGTT
TCTCAAATGG GTTATATGCT ATTAGGTGCT TCTATAGCAT TTTTAAGCGG ATTATCAAAT
TCCATTATTT CGTTGCCTAT AGGAATATTG GCGAGTATTT TGATCTATGT ATCTCATGGT
TTTGGCAAAG CTATATTATT CATGAGCGCT GGTGCGTCAA TTACCGAACT TAATGAAAGG
AATATTGAAA AGTTAGGGGG ACTATATCTT TCTTCTCCCC TTCATTCAAC ATTAGCGTTT
ATAGGCGCAT TGAATCTTTT GGGTCTGCCT CCTACAATTG GACTGATAAG TGAAGCATTG
TTGATATTTT CATTAGGTAC TATATTAGAT AAGATAGGAA TAGTAGGTTT TATAGTGATA
GTAGCTTTTA TAATGATCGC AATAGGTACC TCTTCGGCTT ATATAGGTTA TTTGTTCAAG
ACTGTGTACG CAGGTAAAAA GGAAACCAAG AATATTGATA ACGTAAAAGA GTATTCTATT
CCAATGTTAT TAATAGGAAT ATTTAGCATA ATTACGTTCT TTATTCCTCA ATATGTATCA
CCTTCTTTAG TATTCACGTC ACTGTTTTCA AATTCTACAA TTTTGCCATT CATAGCGTTC
TTGCCCGTTT TAGGTTCGCT AATCGCGCTA ATAACTCCAA AGTCATTAAA TCAAGATTTG
AGAGGCGCAA TAGTAGTGGT ATCAATAGGT ATTTCAATGG TCTTATCTGC TGTGTTATTA
GTTAATAATT TAGGAAAACC CTTATTCGGA CACCCTCAGT TATCATATTC ATTTGGTTAT
CTACAATTTA GTGCTAATTT GCTTCAATCC ATTTTAGCCC TATTTGTATC TTCACTATCA
TTCTTTATAG CATTATATAG TATAGGATAC ATGAGAGAAG ATAACGTTTT AAGAAGGTAC
TGGGGTTTCT TTGGACTTTT CGTAACATCA ATGTTATCTG TTGTTTTAGC TAATAACGTT
TTATTATTTA TTGCTGGATG GGAGGGAACT AGTCTAGCAT CTTATGGTTT AATTAGTTAC
TGGCTAGATG ACAATGAAAG GAATGTAGTT GGAGATTTTG GAAGAAGAAT ATTCGGAATC
GAAAATGTCT CCAAACCAAC AACTAGTGGG ATTAGAGCTT TAATTTTCAC TCGTGTAGGT
GATGTTGGCT TGCTAGCGGT ACTAGGATAT TTGCTTTCAT TATCAAGTTA CAATTATATC
TTATATCCCA TTTCAAATGT ATCGTCCACA GTATTTTCTG CATTGTACGC TGTAGCCTCA
CATCCAGAAG GGTGGTTAAT ATTGCTAATA TTCTTTTTAG GTGGGCTTGC TAAAAGTGCA
CAATTTCCTT TCACGCAATG GCTATTGACT GCTATGACTG GTCCCACACC AGTTAGTGCT
CTAATACACG CAGCTACTAT GGTTAATTTA GGGGCTATTC TTACATTCCT CACCTATCAA
TTTATACCAA TAAATTCTAA CACTTATTTG TTCTTTGCCA TAATGGTAGG AATAACATTA
TTTACAGCAT TATACACAAG TATTAACGCT CTAGCCTCAA ATGAACAAAA GGTAATTTTA
GCTTATTCTA CAGCGGATCA AATATCGTTA ATGATATTCT CATCTTCTTT AGGTGCTTTA
CTTGGAAATG TAAGTTTAGG AATAATAATA GGGCTTATAC AAATGTTCGC ACACGGACTA
TATAAGGCAT CTCTCTTTAT GAACGCTGGT TCCGTAATAC ATTACACAGA AAGTAGATAT
GTAGCTTCAA AACCATTATT ATATAAGGAG ATTCCTTCTG TGTTTATTTT ACAACTAATT
GCTGCATTAA ACCTAGCCAA TTTGCCGCCT TTAATAGGAT TTTGGGCCCA CAATATCATA
GGTAATTTAG CTAGTAGTAC TTCAACTACG ATATTTTACT TGTATATAAT TCTGGAATTT
CTAGGTTCAA TCTATATATT AAGATATATT GCAAGAACGT TCTTATGGAA GGGAGAGACA
GATAAGACTC ATGAACATGG GCACTTAAGT ATCTTAATGG TTATAAGTCC TGCATTCTTG
ATATTAGGAT CAATAATACT AGGTGTACTT TACTTTAATC TGGCGACCTT CTTTAATATT
ATACAATTTT CTGAGATAGA TTTCCTTAGT TTAACATTAT CTATAATAGG TTGGATAATC
GCTTTTGCCA TATATACTAG ATATTTGAAT TTAAGTTCTG TAAAGCCCCT AATAGATTTT
GTTTATTATG GTTGGTATGT GAATCCAGCT TTTGATAAAT TTGGATATCT TTTCAAAGAT
TTTGCTGGTG CTCTATTCAA AAACTTTGAG AGAGGAGTTA TCGATTTAGG GCTGAACGAG
AGATTACCTA AATCTATAGT AAACTTTGGA TCAAGAATCT ATAATGTAGT CGAGGATCAT
ATTCTTGGGG ATTATATCAT GTTATATGCA TGGGGTATAG TTTTACTTTT AATTATAGTT
TTGATTCTAT TCGGGGTGAT TGGATAA
 
Protein sequence
MFPLSFIMLI ISFILAVSIF FVRSRVASFI LSFVSIAINI PFLVFRGIFE NIFVSKYVGD 
FGIVVNNFNY PFIITIIIIT LLSAIYSLRY MEHKFDEERK SSWGLYYALY TLFALSMLYT
VLSTNLLELY IFLEISLISS FLLIALYGYG DRRRISLMYF IWTHIGTILL LASIIVIGLS
TGSMNIYVDA YTFANYSTIS YGILVFIIAV VGMFVKGAQA GFNIWLPYAH GEAPTPISVL
LSPNMVGLGI FVVIIYYYLF PTMSFLAPIF IAWAIITMIY GGINALAQKD FKRFLAYSSV
SQMGYMLLGA SIAFLSGLSN SIISLPIGIL ASILIYVSHG FGKAILFMSA GASITELNER
NIEKLGGLYL SSPLHSTLAF IGALNLLGLP PTIGLISEAL LIFSLGTILD KIGIVGFIVI
VAFIMIAIGT SSAYIGYLFK TVYAGKKETK NIDNVKEYSI PMLLIGIFSI ITFFIPQYVS
PSLVFTSLFS NSTILPFIAF LPVLGSLIAL ITPKSLNQDL RGAIVVVSIG ISMVLSAVLL
VNNLGKPLFG HPQLSYSFGY LQFSANLLQS ILALFVSSLS FFIALYSIGY MREDNVLRRY
WGFFGLFVTS MLSVVLANNV LLFIAGWEGT SLASYGLISY WLDDNERNVV GDFGRRIFGI
ENVSKPTTSG IRALIFTRVG DVGLLAVLGY LLSLSSYNYI LYPISNVSST VFSALYAVAS
HPEGWLILLI FFLGGLAKSA QFPFTQWLLT AMTGPTPVSA LIHAATMVNL GAILTFLTYQ
FIPINSNTYL FFAIMVGITL FTALYTSINA LASNEQKVIL AYSTADQISL MIFSSSLGAL
LGNVSLGIII GLIQMFAHGL YKASLFMNAG SVIHYTESRY VASKPLLYKE IPSVFILQLI
AALNLANLPP LIGFWAHNII GNLASSTSTT IFYLYIILEF LGSIYILRYI ARTFLWKGET
DKTHEHGHLS ILMVISPAFL ILGSIILGVL YFNLATFFNI IQFSEIDFLS LTLSIIGWII
AFAIYTRYLN LSSVKPLIDF VYYGWYVNPA FDKFGYLFKD FAGALFKNFE RGVIDLGLNE
RLPKSIVNFG SRIYNVVEDH ILGDYIMLYA WGIVLLLIIV LILFGVIG