Gene Ssol_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1994 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1788801 
End bp1790123 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content37% 
IMG OID 
ProductNADH/Ubiquinone/plastoquinone (complex I) 
Protein accessionACX92204 
Protein GI261602601 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATTG AACCCATACT CTTACTATTA ATTCCTATAA TAAGTAACGT AGGCTTTTTC 
AAACTCAAGC TAATTAAGAC CCTATCCGTA CTATCAGCAG TACTAACATC AATCATCAGC
ATCTTCCTTT ACTTCTTAGC ACCAATCAAA AACTCTTTCT TCTTCATCAC TAAGTTCACA
ACGTTTTTCC TATTGGTGAT TGCATCAATT TATCTCCTAT CTACCCTTTA CTCCATGAAT
TATATAAAAC CAAGCAAAAT AGTAAGTGAA AGATTATACT ACATTCTACT AAATTCCTTC
GCATCATCAA TGCTTTTTAC CGTTATTATG AACAATTACG GTCTAATGTG GGTTGGGGTA
GAACTCACAA CTGTGACCTC AGCTCTTCTA ATAATAGCGG AAGCATCTGA AACCTCACTG
GAAGCCACGT GGAGGTATAT AATCATAGTA TCCGCTGGAG TAACCTTAGC CCTATTCTCA
ATAATTTTCA TCTACTATAA TTACCATACA TTAACTGTAA CGGAAATACT TACAAAACCT
GAAAACAACA TAATAACTAA ACTTGCGGTA GCTCTAGCTT TAATAGGATT TGGAACAAAA
GCAGGAGTTT TCCCCATGTA CACGTGGTTA CCAGATGCTC ATAGCGAAGC GCCATCTCCA
ATAAGCGCAT TATTTTCTGG AGTACTACTT CCAGCCGCAA CTTACGTGGT GTACATGGTA
TATCAAGTAA ATCCATTAAC CAATATCTTT GTAATATTCA CAACTTTATC GATAATAACC
GCATCGATTA TCCTAACCTA TCAATGGCAT ATAAAGAGAA TGTTTGCATA CTCAACCATA
GAAAACATGA ACCTAGCCTT GCTTGGACTT ACAATAGGCC AACCCCTCGG AGCAATAATC
CTCCTTCTTG CACACGCATT TGGCAAAGCG GGTGCCTTTT ACTCTAGCGG AATTGTATTG
AAAGTCCTTG GAGAAAAGAG AATTGAAAAC ATAGGCGGTT TACATACAAA ACTAAAACTC
ACCTCAGTAT CGCTATTATT GTCCTCTCTA GCTGTAACTG GTACGCCACC CTTTGCAACT
TTCATAGGAG AATTCTTCAT ATTACAAACA TTAATTCAAA AAGGTTATAT TATAGAGTTT
ATATTAATAG TAATTTCTCT GGCAACAGCT TTCATCTCAA TAAACTATAA CGTTACCAAA
ATGATATTCA CTCAAAGAGA GCTGACAGTC TCAGAAGAAC CCAAGCTAAT CACGTTCATT
TCTCTTGTAT CATCTATAAT TCCACTCGTT CTCGGTATAC TTTTACTGGT GATCCTTTCA
TGA
 
Protein sequence
MNIEPILLLL IPIISNVGFF KLKLIKTLSV LSAVLTSIIS IFLYFLAPIK NSFFFITKFT 
TFFLLVIASI YLLSTLYSMN YIKPSKIVSE RLYYILLNSF ASSMLFTVIM NNYGLMWVGV
ELTTVTSALL IIAEASETSL EATWRYIIIV SAGVTLALFS IIFIYYNYHT LTVTEILTKP
ENNIITKLAV ALALIGFGTK AGVFPMYTWL PDAHSEAPSP ISALFSGVLL PAATYVVYMV
YQVNPLTNIF VIFTTLSIIT ASIILTYQWH IKRMFAYSTI ENMNLALLGL TIGQPLGAII
LLLAHAFGKA GAFYSSGIVL KVLGEKRIEN IGGLHTKLKL TSVSLLLSSL AVTGTPPFAT
FIGEFFILQT LIQKGYIIEF ILIVISLATA FISINYNVTK MIFTQRELTV SEEPKLITFI
SLVSSIIPLV LGILLLVILS