Gene Ssol_1056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1056 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp986064 
End bp987362 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content34% 
IMG OID 
ProductAIR synthase related protein 
Protein accessionACX91299 
Protein GI261601696 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0908596 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTAT ATTTTGCGTT AATGGATCTA GAGGGAATAG TAAGAAGACT TTATCCGGAC 
ATTGAAAAAG CAAAAACTAA ACTAATAGAG GAAATAAGGT TCTATAAGAG TGATAGGTAT
AATGCAGAAG AAATCTCTAA CGTTATTTTG ACTGAAGTTA TTAATTCCAT GAAGGCCGAT
GATAAATTTG GTTTTCCTAA AACTAACATA AGAGCTGGGG AAGCCGGATT AGGATCTAGA
GGTATAGGTG ATAATTTAAT ACATACTAAG TTATTTGAGT TAGCTGGTAA GCAAATCGAA
GAATATGATG ATGCTGGAAT TAGGGAAAAC GTAGTCGTCT CCATAGATGG TATCCATTCT
AGGCTATCTT ATTTTCCGTT CCTAGCCGGC TTTTATGCTA CTAGAGCAAC ATTGCGGGAT
ATTATGGTTA AGGGGGCATA CCCATTAGGT CTAATTGTGG ACATACACCT TTCTGATGAT
AGCGATATAG GAATGTTGAT AGACTTTGAA GCAGGTGTAA CTACTATTAG TAAAGCTTTA
GATATCCCAA TACTGGCTGG TAGCACATTA AGAATAGGTG GCGATTTAGT TTTAGGAGAT
AGGATAAGTG GTGCTGTAGG TTCAATAGGA ATATTAAAGT CTAAGGATTT CTTAAGGAAA
AGAGTGAAGA AGGGGCTAAA AATACTCATG ACTGAAGGAA ATGGCGGTGG AACTATTGCA
ACTACAGCAA TATATAATGG TATGCCTGAT GTAATTTCGG AGACGCTTAA CATAAAAGAC
TTACTTACTT GTATTATAGT TAGAGATTAT TTGTCAAGCG ACGTTTATTC TATGACAGAT
GTTACTAACG GAGGGATACG TGCTGATGCT CTGGAGGTTT CGAAGATAAC TAACTTAAGT
TTTGTAATAG ATGAGGAGAA GTTTCTTTCA TTAATAAATC CAAAGGTTAG AAAGATGCTG
AATGAGATTA ACATAGATCC TTTTGGTATA TCAATTGATT CAATACTTAT ATTTACTGAT
AATCCAGATT TAGTTAAAAA GAGGTTGGCT CAATATAATA TTAGAAGCGA AATTGTAGGT
TATATAGATG ATTTCAGACA ATATCCTATA ATTACATATG ATGGGAAAGA ACTTAAGCCA
CAATTTAGAG AAAGTCCTTA TACTCCTATA AAGAAATATT TTGGGAATTA TTCACCTTAT
AGCATAGAAT ATATATCAAA CCGTTTAGAT TATGCTATAG CTGAGGCTAA AACAAAAATG
GATAACGTAT TGAAAAACTT AAAAACTAGT TTTACATAG
 
Protein sequence
MALYFALMDL EGIVRRLYPD IEKAKTKLIE EIRFYKSDRY NAEEISNVIL TEVINSMKAD 
DKFGFPKTNI RAGEAGLGSR GIGDNLIHTK LFELAGKQIE EYDDAGIREN VVVSIDGIHS
RLSYFPFLAG FYATRATLRD IMVKGAYPLG LIVDIHLSDD SDIGMLIDFE AGVTTISKAL
DIPILAGSTL RIGGDLVLGD RISGAVGSIG ILKSKDFLRK RVKKGLKILM TEGNGGGTIA
TTAIYNGMPD VISETLNIKD LLTCIIVRDY LSSDVYSMTD VTNGGIRADA LEVSKITNLS
FVIDEEKFLS LINPKVRKML NEINIDPFGI SIDSILIFTD NPDLVKKRLA QYNIRSEIVG
YIDDFRQYPI ITYDGKELKP QFRESPYTPI KKYFGNYSPY SIEYISNRLD YAIAEAKTKM
DNVLKNLKTS FT