Gene Ssol_1864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1864 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1655453 
End bp1656898 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content35% 
IMG OID 
ProductUbiD family decarboxylase 
Protein accessionACX92076 
Protein GI261602473 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTTTA AAGATCTCAG AGAATATATC GAATTTATGA AAAAGAAAGG TAAATTAATT 
GAAGTAGATG ATGAAGTAAG CGTTGATTTA GAAATAGCTG AAATAACGAG AAAAGCAACT
TATGCTCATT TACCCCCTCT TTTATTCAAG AGAGTTAAAA ATTATGAGAA TTGGAAAATA
GTTTCCAATA TTTTTTACTC AATTGAAAGC TTGTACGAGA TTTTTGGAAC GAATAAACTA
GAATCAATAT CCGAAGGATT TTTATCAAAT TTATCCAATA TGCCTATCAC ATTTTTTGAT
AAAATAAAAT CACTTAGAGA AATTTTGGGA TTAGGAAAAG TAATGCCTAA AGCTAAGTCA
CCTAGTTTTA AAGAGGAAAA GAACTTAGAT CTGACTAAGA TTCCTGCAAT AAAAACTTGG
CCTAAAGATG CCGGAAGATA CCTTACTTTT TCCATAACAA TAACAAAGGA CCCAGAGACA
GATGTACATA ATCTCAGCGT TTATAGGGTT CAAATTCTAA ACGAGAAGGA GGCAATAATT
CATTGGCAAG CATTTAAAAG GGGTGCGCTT ACTGCTAAAA AATATTTAGA AAAAGGTATT
AGTAAGATAC CTATTGCCGT AGTAACCGGA GTAGATCCTG CCATAGCATT TACAGCGGCT
TCTCCAGTCC CTCATGGAAT CGATAAGTAT ATGTTCGCGG GAATCTTGAG AGGTGAGGGT
ATTGACGTAG CTGAATTAGA TAATCAATTA CTAGTACCAA GCCATTCAGA AGTAGTTTTA
ACTGGTTATG TTGACTTGAA TGATATGCGT CTAGAGGGTC CCTTTGGAGA TCATATGGGT
TATTATACAC CTGCAGATTA CTATCCAGTT TTCAAATTGG AAAGAGTATA TATTAGAGAA
GATCCTATAT TTCATGTAAC ATCAGTTGGC AAACCACCAC TTGAGGATGC TTGGATAGGT
AAGGCTGTAG AAAGAATATT CTTACCTTTT GCCAAGATGT TAGTTCCAGA ACTTATTGAC
ATGAACCTAC CAGAATATGG ATTGTTTACC GGGATTGGGA TCTTCTCTAT AAAGAAGTAT
TACCCCGGCC AGGCCAAGAG AGTTATGATG GCTTTATGGG GTACCGGCCA ACTAAGCCTT
TTAAAAATAA TAATAGTTGT TGATCAAGAT ATAGATGTTC ATGATATTAA TCAAGTTATT
TATGCTATTG CAGCTAATGT AGATCCTAAA CGCGATGTTT GGGTAATAGA AAATGCACTT
ACTGACTCAT TAGACCCTAG TGTTCCATTT CCACCATTAG GTAGCAAACT AGGTATAGAT
GCTACTAGGA AATTTAAAGA AGAAATGGGG AAAGAATGGC CAGAAGAGGT TAGATCAGAT
GAGGTAGTAG CTAAAAAAGC GGACCAAATA TTGAATAAAA TTATAAAGAG ATATCAAACC
TCCTAA
 
Protein sequence
MAFKDLREYI EFMKKKGKLI EVDDEVSVDL EIAEITRKAT YAHLPPLLFK RVKNYENWKI 
VSNIFYSIES LYEIFGTNKL ESISEGFLSN LSNMPITFFD KIKSLREILG LGKVMPKAKS
PSFKEEKNLD LTKIPAIKTW PKDAGRYLTF SITITKDPET DVHNLSVYRV QILNEKEAII
HWQAFKRGAL TAKKYLEKGI SKIPIAVVTG VDPAIAFTAA SPVPHGIDKY MFAGILRGEG
IDVAELDNQL LVPSHSEVVL TGYVDLNDMR LEGPFGDHMG YYTPADYYPV FKLERVYIRE
DPIFHVTSVG KPPLEDAWIG KAVERIFLPF AKMLVPELID MNLPEYGLFT GIGIFSIKKY
YPGQAKRVMM ALWGTGQLSL LKIIIVVDQD IDVHDINQVI YAIAANVDPK RDVWVIENAL
TDSLDPSVPF PPLGSKLGID ATRKFKEEMG KEWPEEVRSD EVVAKKADQI LNKIIKRYQT
S