Gene Ssol_1966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1966 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1750874 
End bp1752217 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content34% 
IMG OID 
ProductAlpha-amylase 
Protein accessionACX92177 
Protein GI261602574 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0717936 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAG TAATAGTAGG ATTTGAAGTT CACCAACCAT TCAGGATTAG AAGAGATTTC 
TTCTGGAACC CGCGATTTAG ACAAAAGCTA GAGGATAGAT TTTTCGATAC TGAGAGAAAT
AAAGAGATAT TTGAGAGAAT AAAGAAGAAC TGCTACATCC CTGCAACAAA CATAATACTA
AGCTCTATTG AAAGAGCCGA AGAAGAAGGA AATAACGTTA AATACTTCTT TTCAATTTCA
GGGACTTTCT TAGAGCAAGC GGAGAGATGG GGAAGAGAGG TAATAGAATT ATTTCAACAA
TTGGCATATA CACATAAAGT TGAATTTCTA GCGCAAACCT ATTATCATTC TGTAACCAGC
CTTTGGGAGG ATAAAAGTGA ATGGAAAGAG CAAGTTAAGA TGCATAAGGA TACGATAAAG
TCTTATTTTG GACAATATCC TACCACTTTT GAAAATACTG AATTAATTAC TAAAAAGGAT
ATTGTAGAAG AAGTTGAAAA AATGGGCTTT AAGATGATGT TAAGTGAGGG AACTAATAGA
AATTTAAATG GACGAAGTCC AAATTACGTC TATAAATTGA AGGGACATGA GATTAGAATG
TTGTTTAGGA ATTATACGTT AAGTGATGAT ATAGCCTTCA GATTTTCTAA TCCAAATTGG
GATCAATATC CGTTAACAGC TTCCAAGTAT GCTGATTGGA TAAGTAGAAG TGAGGGAAAT
GTAGGATTAA TATTCGTAGA TTACGAGACT TTTGGAGAAC ACCACAGAGA ACAAACTGGA
ATTTTAGAAT TTCTTAAATG GTTACCAATA GAGCTTAACA GTAAAGGAGT TGAAATGATG
ATGCCAAAGG AAGTTTACAA TGACGTCTAT GATGAAATAG AAATTGCTCA TACTACCTCG
TGGGCTGATA TAGAAAAAGA TGAGAAAAGT TGGTTGGGAA ATATAATGCA ATGGGCTTAC
GATGATGCGG TTAGAAGGGC TGAGATGCCC TCAAGGGAAT TGGGTAATGA GTATTTAAGG
GTCTGGAGAT ATTTTACTAC AAGCGATAAT TACTATTATC TTTATTTAGG GCATGGGAGT
CCAGCTGAAG TACATTCCTA TTTTAACGCC TTTGGATCCC CTATAGATGC GTTTATAAAT
GAATTTTATG CAATATCGAC ATTTATACAT GAAGAAATAA GTAAATTAAA TATTAAGAAT
GAGCCTTATA TATTCATATT AGGAGATAAG AGAGCGTCGA TAGCTTGGAA TGAAAAAGAG
TTCATGGAAA TTGTAATGAG AGATGAAAGG TTTAAAACTC ATTTGAAAAA CTTAAGGCTG
TGGTTAGGAA ATGAAAAGGA TTGA
 
Protein sequence
MKRVIVGFEV HQPFRIRRDF FWNPRFRQKL EDRFFDTERN KEIFERIKKN CYIPATNIIL 
SSIERAEEEG NNVKYFFSIS GTFLEQAERW GREVIELFQQ LAYTHKVEFL AQTYYHSVTS
LWEDKSEWKE QVKMHKDTIK SYFGQYPTTF ENTELITKKD IVEEVEKMGF KMMLSEGTNR
NLNGRSPNYV YKLKGHEIRM LFRNYTLSDD IAFRFSNPNW DQYPLTASKY ADWISRSEGN
VGLIFVDYET FGEHHREQTG ILEFLKWLPI ELNSKGVEMM MPKEVYNDVY DEIEIAHTTS
WADIEKDEKS WLGNIMQWAY DDAVRRAEMP SRELGNEYLR VWRYFTTSDN YYYLYLGHGS
PAEVHSYFNA FGSPIDAFIN EFYAISTFIH EEISKLNIKN EPYIFILGDK RASIAWNEKE
FMEIVMRDER FKTHLKNLRL WLGNEKD