Gene Ssol_1224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1224 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1141619 
End bp1142836 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content34% 
IMG OID 
Productthreonine dehydratase 
Protein accessionACX91462 
Protein GI261601859 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTACT TAGAATATTT TGATAGAATT AGACTAGCAA AAGAGAAAAT AGAAAAATAT 
GTGCATATTA CTCCAATAGA TTATTCTACA ACGTTTTCCA GAATTATAAA CGCAAAAGTT
TATCTTAAGT TGGAAAATCT ACAGAAAACT GGATCATTCA AAGTTAGAGG TGCCTTTAAT
AAGTTATTAT CTTTAAAGGA GGAAGAAAAA AAGAATGGCG TTATTGCAGT TTCAGCAGGT
AATCATGCTC AAGGAGTTGC TTATGCAGCC TCCACGTTAA ATATCAAATC GACTATAGTG
ATGCCAGAAA CAGCTCCAGC TTCCAAGTAT TTAGCTACAA AATCCTATGG GGCAGAAGTA
GTTCTTTATG GTAAGTACTT GCATGAGAGT ATGAAGAAAG CGGAAGAATT GATTCAAAAT
ACTGGTTTAA TATTTGTTCA TCCTTATAGT GATTTAGATG TGATAACGGG TCAAGGTACC
ATAGGATTAG AATTGTATGA TATCGAACCA GATTACGTAA TTATTCCAAT AGGGGGTGGA
GGATTGATTT CTGGTATAAG TATAGCTTTA AAGTATAGAT TCCCAAACGT CAAGATAATA
GGCGTTCAGT CTTCTTCTTC TCCTTCAATG AAGGTTTCTA AGGATCTTGG GAGGCTTGTA
GAAATAGAGC CTAGTTATTC CATAGCTGAT GGCATATTGG TTAAGTCTCC TTCTGAATTA
ACCTTTAGTA TAATTAATGA GTTAGTAGAT GATATAGTAT TAGTGGATGA TGAAGAAATA
GCTGAGGCAA TAGTTTTACT ACTTGAAAGA AGTAAAACGC TAGCAGAAGG AGCAGGAGCT
GCAGCATTAG CGTCACTAAT TTCAGGGAAG GTTAAAGTAA ATGGAATAGA CAAAAAAGTA
ATTTCATTAG TAAGTGGGGG AAATATTGAC TTATCATTAT TGTCTACTCT AACAGAGAAG
TTTTTATATA GACAAAAAAG GGTAGTCAAA GTGAGGGTAA TAGTTCCAGA TAAGCCAGGA
CAGTTAAATA AAGTATTAAG CTATGTAGTT AAGATCAGAG GTAATATAAT AGATATTGTT
CATGATAGGC ACAGTAGTGA TGTATTGCCT GGATACACTA AAATATATAT AACTTTCGAG
CTTCAGTCTT CAGAGGCTAT TACCTTACTT CTGACAAATC TGGCAAACGA GGGAATAGAC
GTGAAAATTG TAGAATAG
 
Protein sequence
MNYLEYFDRI RLAKEKIEKY VHITPIDYST TFSRIINAKV YLKLENLQKT GSFKVRGAFN 
KLLSLKEEEK KNGVIAVSAG NHAQGVAYAA STLNIKSTIV MPETAPASKY LATKSYGAEV
VLYGKYLHES MKKAEELIQN TGLIFVHPYS DLDVITGQGT IGLELYDIEP DYVIIPIGGG
GLISGISIAL KYRFPNVKII GVQSSSSPSM KVSKDLGRLV EIEPSYSIAD GILVKSPSEL
TFSIINELVD DIVLVDDEEI AEAIVLLLER SKTLAEGAGA AALASLISGK VKVNGIDKKV
ISLVSGGNID LSLLSTLTEK FLYRQKRVVK VRVIVPDKPG QLNKVLSYVV KIRGNIIDIV
HDRHSSDVLP GYTKIYITFE LQSSEAITLL LTNLANEGID VKIVE