Gene Ssol_2754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2754 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2520254 
End bp2522239 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content38% 
IMG OID 
Productprotein of unknown function DUF608 
Protein accessionACX92838 
Protein GI261603235 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.127114 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAAAT ACACTGACAA GGATAGGATA GTGGCTGGAG TTCCACTGGG AGGAATTGGC 
ACTGGAAAAC TGGAAATCGA CAACAAGGTG AGAATAATCA ACGTTACGAT AAGGAATAAT
TGGGGCAATC CCATTAAGCT TTTAAGAGGA TTCCACGTAT TCATAAAACC TAAAGACAAG
AGGGGATTCA TCTTCCAAAA GGACGGTGGA ATATATAAGG TAAGTGAGTT TCCAGGGGAA
ATTATTTATG AGGGAAAATA TCCAGTTGTC AAAGTAGTGG GAAAGAGTAA TGAAGTTGAA
GTGGAATTGG AAGCGTTTTC GCCAATAATC CCCAATGACT TGAAGAACTC ATCCTTACCT
GCAATAGGAA TTAGTTTAAA AGTTAAGGGT ATTAAGGAGG GAAAGGTAGC AATATCCTTC
CCAAACATTG TTGGTAGTGT GAGTATTGGG AGGGTAAATG AAAGTATAAG GAATGGCGTT
ATTCATAAGA ACTTAAAGGC AAACGATTAC GATCCGGCAA AGGGAAATAC TACGTTAATA
TCAAACAACG TGAGTGATAT TATAACGCAA TACAACCTTA AGAGGAAACC AAGTGAGACG
ATGAACTTTT GGCTATCACA ATACGAAAAC GAGGAGCCTT GGGTTAAATT GAATAGGGGA
GAGGAAATTG AGGACAATCC TCACGAGGTA ACTGGTCATA GGGATGATCC AGCAAGTATT
ATAATATCAG AAGGGGAGGA AATGAGATAT GTGTTTGCAT GGTATTTCAA TGGTAAACAC
GTCTTTTACC CCTATGGGCA TTACTATGAG AACTTCTTTA AGGATTCCTC AGAAATTGCG
AAATATTTCT TGGACAACTT TGATCACTTG AGAAAGGATA TATTTCACAA TATTGTAAAT
GTGAAGGAGG AGTGGTTAAG GGATGCAATA ATAAATAGTT CATACATTCT ATCCTCCAAT
ACTTGGTTAG ATGAGAAGGG TAGATTTGCA ATTTATGAAG CCCCGCAGAA TTGTCCATAT
TTAGGTACAA TTGGTACCTG TTATGAGTTT GGCTCCCTAC CAGTGATTTT AATGTTCCCA
GAGTTGGAGA AGTTGTTTTT AAAGCTATTG ATTAGTTACG TAAGGAATGA TGGTTATGTT
CCCCACGATC TGGGTTTTCA CTCCTTGGAT TCTCCCATTG ATGGCACTAC TTCTCCTCCT
AAGTGGAAGG ATATGAATCC TAGCTTAATA TTATTGGTTT ATAGGTACTT TAAGTTCACT
AACGATATCG ACTTCTTGAA AGAGGTTTAC CCAACTATAG TTAAAGTCAT GGATTGGGAG
TTAAGACAGT GTAGAGACGG CTTGCCTTTC ATGGAAGGAG AAATGGATAA CGCATTTGAC
GCTACCATAA TTAAGGGCCA TGATAGTTAC ACCTCTTCAC TTTTCATAGC TTCTTTAATT
GCAATGAGAG AAATTGCAAA GTTAGTTGGC GACAGCAATT ATGTTGGTTT TATTAATGAA
AAGTTAAATG TTGCTAGAGA AGCGTTTAGG AAAATGTTCA ACGGTAAGTA TTTCAAGGCA
TGGGACGGTG TTGATAAGGC TTCATTTCTT GCCCAACTAT ACGGTGAGTG GTTTACTACT
TTATTGGAAT TGGAAAATAT TGTCGATGAA AATATGATAA AGAGTGCTTT AGAAAGTATT
ATAAGACTTA ATGGTAATGC CTCTCCCTAT TGTGTTCCCA ATCTAGTTGA TGAAAACGGT
AAGATTGTTA ACTTGAGTGT TCAAACTTAC TCCTCTTGGC CTAGACTAGT ATTTGCCATT
TGCTGGTTAG CTTATAAGAA GGGTGTCGGT GATCTAAGCT TTTGCAAGAA GGAATGGGAT
AACTTGGTGA GAAACGGTAT GGTATGGGAT CAGCCGTCCA GGATAAATTG TTATACTGGG
AAGCCTGAAA TTAACTATCT AGACCATTAT GTAGGCAGTC CTAGTCTTTG GAGCTTCCTA
TTTTAG
 
Protein sequence
MVKYTDKDRI VAGVPLGGIG TGKLEIDNKV RIINVTIRNN WGNPIKLLRG FHVFIKPKDK 
RGFIFQKDGG IYKVSEFPGE IIYEGKYPVV KVVGKSNEVE VELEAFSPII PNDLKNSSLP
AIGISLKVKG IKEGKVAISF PNIVGSVSIG RVNESIRNGV IHKNLKANDY DPAKGNTTLI
SNNVSDIITQ YNLKRKPSET MNFWLSQYEN EEPWVKLNRG EEIEDNPHEV TGHRDDPASI
IISEGEEMRY VFAWYFNGKH VFYPYGHYYE NFFKDSSEIA KYFLDNFDHL RKDIFHNIVN
VKEEWLRDAI INSSYILSSN TWLDEKGRFA IYEAPQNCPY LGTIGTCYEF GSLPVILMFP
ELEKLFLKLL ISYVRNDGYV PHDLGFHSLD SPIDGTTSPP KWKDMNPSLI LLVYRYFKFT
NDIDFLKEVY PTIVKVMDWE LRQCRDGLPF MEGEMDNAFD ATIIKGHDSY TSSLFIASLI
AMREIAKLVG DSNYVGFINE KLNVAREAFR KMFNGKYFKA WDGVDKASFL AQLYGEWFTT
LLELENIVDE NMIKSALESI IRLNGNASPY CVPNLVDENG KIVNLSVQTY SSWPRLVFAI
CWLAYKKGVG DLSFCKKEWD NLVRNGMVWD QPSRINCYTG KPEINYLDHY VGSPSLWSFL
F