Gene Ssol_1234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1234 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1150083 
End bp1151414 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content31% 
IMG OID 
Productdomain of unknown function DUF1743 
Protein accessionACX91472 
Protein GI261601869 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.640745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATG TAATAGGGAT TGACGACCAC GATTCCTACA AGTTTGGATG CACTACACAT 
TTTTCTGTAA TTCTAACCTC CTATTTATAT AAAAATCATA GTACTATTTT ATTAGACTTA
CCTTACCTAG TTAGATTAAA CCCCAATATA CCTTGGAAGA CAAGAGGCAA TGCGAGCATA
AAACTTATTG TTGAGTTTAA TGGAACAAAA AAAGAACTAG CAGATATAAT TTTTTCATAT
TCTATTGAAT ACGTTAAAGA TGTTTCACTA GCACTAGAGC ATGGAAGAAA ACCTGGAATA
GCAATTATGG AATATGATAA ATATATCACT TTATTCGACA AATTATACGA TTTCTATATT
AAGGGAATAT CAGATATCAT TCCTATTGAC TATGCGAAAA AATTCGCTGA GAAAAACGAT
ATAGAACTTA GAGGAGATAG AGGTATTATT GGAAGTATTG CTGGGCTAGG AATGAGTGGG
GATTACACAT ATGAATTAAT TACTTACAGG AAAAAAGAAA ATTGGCTCAA AAAGAGAATG
ATAAATAAAG ATTCAGTAAA GAGGGTTGAT GAATCAACGT TTCCGTTAAC ATTTGCAAAT
TACGATTACA TAAATGACAC TCCCCTTATA ACTCCACATG GAACTGATCC AATCCTATAT
GGAATTAGAG GAGCCTCCAT ACAACATCTA ATTAAGGCTA TGGAACTAAT AGAGTCAAAT
GAGGATATTG ATTTCTTTGC CATTTTTAAG ACCAATCAAA GTACTGATAT TCACTTCCAA
AAAATCGGTA ACCGTTTCTA CCAGGAAACT AAGAAAGTTG TACAAGTAAA AAATGTAAGG
ATACTTGAAG GTGGAGATGT AATAGTTGAA ACTACTGATA ATGACATATT ATTTGTGTAT
AAAGAAACTG GGGAGTTAAA TAGTGCAGCT AAATTATTAA AAAAGGGTGA CGAAATAGTA
GCTTATGGAG CCGTAAAACC ATCCATAGCT TATGGAAAGA TCATAGAGCT GGAGAGGTTT
GAAATATTAA AATTATACGA TTTAGAGTTA GTCAATCCCA AATGTCCCAG ATGTGGCGGA
TCTACAAACT CTCTAGGAAA AAATAAGGGA TATAGATGTA AAAAGTGTAA ATATATTATA
AATACAACTA ATAAGAGTAC AAAAAATATA ATGAGAAACT TATCATTAGG AATGTACCAA
ACTAGATCTT ACAGACATCT TACTAAGCCT ATATTCTTAG AACTAGAAAA CAATAAACCA
AGTTTTTATG AGGAGAGAAA GTTCCTAGAG ATGTATAGAT CAACATTATA TAAGCTTGAT
TATCATCTAT AG
 
Protein sequence
MKYVIGIDDH DSYKFGCTTH FSVILTSYLY KNHSTILLDL PYLVRLNPNI PWKTRGNASI 
KLIVEFNGTK KELADIIFSY SIEYVKDVSL ALEHGRKPGI AIMEYDKYIT LFDKLYDFYI
KGISDIIPID YAKKFAEKND IELRGDRGII GSIAGLGMSG DYTYELITYR KKENWLKKRM
INKDSVKRVD ESTFPLTFAN YDYINDTPLI TPHGTDPILY GIRGASIQHL IKAMELIESN
EDIDFFAIFK TNQSTDIHFQ KIGNRFYQET KKVVQVKNVR ILEGGDVIVE TTDNDILFVY
KETGELNSAA KLLKKGDEIV AYGAVKPSIA YGKIIELERF EILKLYDLEL VNPKCPRCGG
STNSLGKNKG YRCKKCKYII NTTNKSTKNI MRNLSLGMYQ TRSYRHLTKP IFLELENNKP
SFYEERKFLE MYRSTLYKLD YHL