Gene Ssol_1233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1233 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1148357 
End bp1149550 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content35% 
IMG OID 
Productorc1/cdc6 family replication initiation protein 
Protein accessionACX91471 
Protein GI261601868 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGATA TAATTGATGA GGTCATTTCT TCATTCAAGA CATCAAGCAT CTTCATAAAT 
AGGGAATATT TGTTGCCTGA TTATATCCCA GACGAGTTAC CACATAGAGA GGATCAGATA
AGAAAGATTG CAAGTATTTT AGCTCCATTA TATAGGGAAG AGAAACCCAA CAATATTTTC
ATATACGGTC TGACTGGGAC GGGAAAGACA GCCGTAGTGA AGTTTGTTTT ATCTAAATTA
CATAAGAAAT TTCTTGGTAA ATTTAAACAT GTATATATTA ATACTAGACA GATAGATACG
CCATATAGGG TATTGGCTGA TCTGTTGGAA TCACTAGATG TAAAGGTTCC ATTTACCGGG
TTATCAATAG CCGAACTGTA TAGACGATTG GTAAAAGCAG TGAGAGACTA CGGTTCACAA
GTCGTCATAG TTTTAGATGA GATTGATGCT TTCGTTAAAA AGTATAATGA TGATATTCTA
TACAAATTAA GTAGGATTAA TAGTGAGGTG AACAAGAGTA AGATATCTTT TATAGGAATA
ACTAATGATG TTAAGTTTGT AGATCTGTTA GATCCTAGAG TTAAAAGTAG TTTAAGTGAA
GAGGAGATAA TTTTCCCCCC TTATAATGCG GAAGAGTTAG AAGATATTTT GACAAAGAGA
GCACAAATGG CATTCAAGCC TGGAGTTTTA CCAGATAATG TAATTAAATT ATGTGCTGCA
CTAGCTGCAC GAGAGCATGG TGACGCGCGT AGAGCCTTGG ATCTTTTAAG AGTTTCTGGT
GAAATAGCTG AAAGAATGAA AGACACTAAG GTTAAAGAAG AGTATGTGTA TATGGCTAAG
GAAGAAATAG AGAGAGATCG AGTAAGAGAT ATTATATTAA CTCTTCCTTT TCACTCTAAG
TTAGTTCTTA TGGCAGTTGT TTCTATATCC TCCGAAGAAA ATGTAGTTTC AACTACTGGT
GCTGTATATG AGACTTATCT GAACATTTGT AAGAAGTTAG GTGTAGAAGC TGTTACTCAA
AGAAGAGTTA GTGATATTAT AAATGAATTA GATATGGTAG GGATACTAAC AGCCAAGGTT
GTTAACCGGG GTAGATATGG CAAGACTAAG GAGATAGGTT TAGCTGTTGA TAAGAATATA
ATTGTTAGAT CTTTAATAGA AAGCGATAGT AGGTTTGCTG ATCTCTGGAG TTGA
 
Protein sequence
MSDIIDEVIS SFKTSSIFIN REYLLPDYIP DELPHREDQI RKIASILAPL YREEKPNNIF 
IYGLTGTGKT AVVKFVLSKL HKKFLGKFKH VYINTRQIDT PYRVLADLLE SLDVKVPFTG
LSIAELYRRL VKAVRDYGSQ VVIVLDEIDA FVKKYNDDIL YKLSRINSEV NKSKISFIGI
TNDVKFVDLL DPRVKSSLSE EEIIFPPYNA EELEDILTKR AQMAFKPGVL PDNVIKLCAA
LAAREHGDAR RALDLLRVSG EIAERMKDTK VKEEYVYMAK EEIERDRVRD IILTLPFHSK
LVLMAVVSIS SEENVVSTTG AVYETYLNIC KKLGVEAVTQ RRVSDIINEL DMVGILTAKV
VNRGRYGKTK EIGLAVDKNI IVRSLIESDS RFADLWS