Gene Ssol_0531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_0531 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp476498 
End bp477703 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content38% 
IMG OID 
Productamidase, hydantoinase/carbamoylase family 
Protein accessionACX90810 
Protein GI261601207 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCAG AAAGGTTTTT GACAACTTTC CATTCGTTAA CTAATATAGG TTGGACTGAG 
GACGGAGTAC TGAGGCTTGC TTTAAATGAA TATGATATAA AAGTAAGAGA GGAACTAATA
AAAATTCTAT CGAGTATAGG TGTTCACATA ATGGTCGATG ATGCCGGAAA TATAATTGGA
GAATTAGGTG GTAAACTAAG TGATGCTATT GCGATTGGAT CACATATGGA TTCCGTGCCT
TATGGAGGAA AATACGACGG TTTTTATGGC GTTATGGCGG GACTTGAAGT ATTACGAAGT
ATTAAAGAGA GAGGCATATC TAATCATTCT ATTAAACTTA TAGATTTTAC GAATGAAGAG
GGTTCTAGAT TTCAACCCTC ACTTCTAGGC TCGGGATTAA CCACAGGTAT CTTCGATAAA
AACTACGTCT ACTCAAGGAG AGATAAGGAT AATATAAGTT TTGAGGAAGC GTTAAGGGTT
TCCGGATTTA TGGGAGATGA AAGCAATAGA CTAATGCATA TGAAGCCTAA CTACTATCTA
GAGCTTCACA TAGAACAAGG TCCAATTTTA GAGGAAGAGG GGTATCAAAT TGGAATACCT
TTAGGAATTG CTGGTTTAAG CGTATATGAA TTCACATTTA AGGGTCAGTC TAGTCAAACC
GGACCTACAC CAATGGATAG GAGAAGGGAT GCCCTAGTAG GCGCATCTAA ATTCGTAGTT
AGCGTTAGGG ATCACGCAAA GAAGCAGGAA AACTTAAGGG CCACTGTTGG TATACTTAAT
GTTAAACCAA ATGTATACAA CGCTATACCT AGGGAAGTCA GACTCACTGT TGACGTTAGG
AGTATTGAGA GGAATAGAAT AGATCACACT ATAAATGAAT TTGTTAATAT TGCAAAAAGT
ATTGCCGACG ACGAGAAACT AGAAGTTGAA TATAGGCATC TGTGGACAGC TAATCCTGTG
AGTTTTTCCG ACGAAGTCAT TAGTGTTATA GAAAGAGCGT GTAAAGAGTT AAGCATGAGA
TATAAGTTTA TGTATAGTTG GGCAGGGCAT GATGCACAGT ATATGACGAA GATTTCTAAA
GTCGGCATGA TATTTATTCC ATCTCATTTA GGCATTAGTC ACGCAAAGGA AGAATACTCC
TCAGATGAGG ATATGTTAAA CGGGCTAAGA GTACTAGAGA AAGCTGTAGA ACTTTTAAAC
AGTTGA
 
Protein sequence
MNPERFLTTF HSLTNIGWTE DGVLRLALNE YDIKVREELI KILSSIGVHI MVDDAGNIIG 
ELGGKLSDAI AIGSHMDSVP YGGKYDGFYG VMAGLEVLRS IKERGISNHS IKLIDFTNEE
GSRFQPSLLG SGLTTGIFDK NYVYSRRDKD NISFEEALRV SGFMGDESNR LMHMKPNYYL
ELHIEQGPIL EEEGYQIGIP LGIAGLSVYE FTFKGQSSQT GPTPMDRRRD ALVGASKFVV
SVRDHAKKQE NLRATVGILN VKPNVYNAIP REVRLTVDVR SIERNRIDHT INEFVNIAKS
IADDEKLEVE YRHLWTANPV SFSDEVISVI ERACKELSMR YKFMYSWAGH DAQYMTKISK
VGMIFIPSHL GISHAKEEYS SDEDMLNGLR VLEKAVELLN S