Gene Ssol_1999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1999 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1793671 
End bp1795644 
Gene Length1974 bp 
Protein Length657 aa 
Translation table11 
GC content38% 
IMG OID 
Productformate dehydrogenase, alpha subunit 
Protein accessionACX92209 
Protein GI261602606 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGTTA GGAAAACCAT ATGCCCTTTC TGTGGTGTAG GTTGTGGTCT AGATTTTTAT 
GTAGAGAATA ACTTTATATT TAGGGTATCT CCATCACAAG AGCACATTGT TAGCAGAGGA
CACGTTTGCG GTAAGGGCGC TGTTGCTTCA GAAGTAATTT ACGCCTGGGA CCGTTTAACT
TATCCTTTAA AGAGAGTTAA GGACACTTTC GTTAGGACTA CTTGGGATGA AGCAATTAGC
GATATTGCTA GCAAGTTAAA GGAAATTAGG AGTAAGTATG GCTCAGAGGC TATTGCCTTT
TACGGGGGTT GTCAGAATAC GTTGGAGGAA GGATATTCGT TCATGAAGTT GGCTAGAGCT
TTGGGTACTA ATAATGTTGA CTCGTGTGCT AGGGTTTGTC ATGAGCCCTC TGCAATGGCT
TTAAAGGAGT TAGTCGGTAT TGGAGCTTCT TCTGTTACCG TTTCTGAAAT TTTGAATGCT
AGAAATATTG TAATTTCTGG AGAATCTGTG ACTGATAGCC ATCCCGTTTT GTCTCAATAT
TTAGTTGAGG CTAAGAGAAA GGGTGTGAAA ATTGTAGTTA TTGATCCTAG GATGACTGGT
ACTGCTAGGA TTGCTGATTT GTTTTTACAG ATTAGTAGTG GTACTGATAT TTATCTATAT
AACGCTGTTG CGAATTATTT GATAAGTAAC GGATTATATG ATAGTAAGTT CGTTAAGGAA
CGTGTTGAGA ACTTTGATGA GTTTAGGGAG ATTGTAAAGT CTTATACTAT TGAGGAAGCC
GAGAGGATTA CTAGTGTTAG TAAGGATAAG ATTATTGAGT TCGCTAGGAT TATTGCAAAT
AAGCCTACAA TACTCTCATG GGGTTTGGGT TTAACTCAGT CAAGTGGCGT TAATGCCGTA
AAGGCTTACA TTAACCTAGC ATTGCTTACT GGGAATGTTG GTATTAATGG TGGGGGACTA
TTAGTTTATA GGGGTCAAAC AAATGTTCAA GGTTCTGGTG ATTTAATAAA GCCAGATGTT
TTCCCAAATG GTCCAATGAA TGAGGAAAAC GCTAAGGAGT TGAGTAAGAT TTGGGGTTTT
GTTCCTCCAA TCAAGAAGGG TTTGAGCATA ACTGAGGCTT TTCTAAGGGA TAGTAATGTT
AAGGCACTTT TTCTAATGAA TTACAATCCA GCTTTTAGTT TGCCAAATAG ATACAAGGTT
ATTAAGTTCT TGAAGTCGTT GGAATTGTTA GTAGTAATGG ATCCGTTTAT GACTGAGACT
GCTAAATATG CACATTACGT TTTGCCTACT CCTTTATGGG CTGAAAAGGA GGGTTCAGTT
ACTAATTTAG ATCGTACTGT TAAGTGGAGG TTTAAGGTTG TTGATCCTCC TGGGGAGGTT
AGGAGTGAGT TGTGGATAAT TAAGAGGATT GCTGAGAAGT TAGGTCTTAC TGGTTTTCAT
GACGATCCTA AATTGGTGTT TAAAGAGATA AAGGAGGTTG CAAAGTTGTA CTCTAATTTG
ACCCTTGATG AATTGATGGA TTACTCTGTA GACTCCAGGT ATCCTGATCA CGAGTCTAGT
CTGTACAAGG ATAGGTTTAT GACTCCTAGC GGTAAGGCTA AGTTTGGATT AGTTAGGTAT
AACGAAATAT CTGGTGATAG TTACATTTTA ATTACTGGTA GAGTTGTGAC TAGGTATAAT
TCTGATGAGT TGATCAAGAG AGTCCCAGGA TATAGGAATT TCAGTTCTGA TTTGTTGATA
AACCCAGAGG ATGCTACAAA GCTGAATATC AAAGACGGGG ATATGGTTAA GGTTGTTTCT
AAATGCGGCA TGGCTGTGAT GAAAGCCAAG CTAACTAACG AGGTTAAGGT TGGTCATACG
TTTGCTTATA TGCATGATTA CTATGTAAAC AACGTTGTCT GTGACGATTT AGATGATATT
TCTAAGACTC CAAGATATAA GATAACCTTT GTCAAGATTG AAAAATTGGG ATAA
 
Protein sequence
MEVRKTICPF CGVGCGLDFY VENNFIFRVS PSQEHIVSRG HVCGKGAVAS EVIYAWDRLT 
YPLKRVKDTF VRTTWDEAIS DIASKLKEIR SKYGSEAIAF YGGCQNTLEE GYSFMKLARA
LGTNNVDSCA RVCHEPSAMA LKELVGIGAS SVTVSEILNA RNIVISGESV TDSHPVLSQY
LVEAKRKGVK IVVIDPRMTG TARIADLFLQ ISSGTDIYLY NAVANYLISN GLYDSKFVKE
RVENFDEFRE IVKSYTIEEA ERITSVSKDK IIEFARIIAN KPTILSWGLG LTQSSGVNAV
KAYINLALLT GNVGINGGGL LVYRGQTNVQ GSGDLIKPDV FPNGPMNEEN AKELSKIWGF
VPPIKKGLSI TEAFLRDSNV KALFLMNYNP AFSLPNRYKV IKFLKSLELL VVMDPFMTET
AKYAHYVLPT PLWAEKEGSV TNLDRTVKWR FKVVDPPGEV RSELWIIKRI AEKLGLTGFH
DDPKLVFKEI KEVAKLYSNL TLDELMDYSV DSRYPDHESS LYKDRFMTPS GKAKFGLVRY
NEISGDSYIL ITGRVVTRYN SDELIKRVPG YRNFSSDLLI NPEDATKLNI KDGDMVKVVS
KCGMAVMKAK LTNEVKVGHT FAYMHDYYVN NVVCDDLDDI SKTPRYKITF VKIEKLG