Gene Ssol_1870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_1870 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp1662127 
End bp1663404 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content38% 
IMG OID 
Productpyridoxal-phosphate dependent TrpB-like enzyme 
Protein accessionACX92082 
Protein GI261602479 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAAAAG AAGACGAGAT TTTGCCTAAA TATTGGTACA ATATAATCCC TGATCTACCT 
AAACCCTTGC CTCCACCAAG GGATCCACAA GGTGCCTATT TCTCGAGAAT CGATTTATTA
AGAAGTATAC TACCCAAGGA GGTATTAAGA CAACAATTCA CAATAGAAAG GTATATAAAG
ATCCCTGAGG AAGTAAGAGA TAGATATTTA TCGATAGGAA GACCAACTCC ATTATTTAGG
GCTAAAAGGT TAGAAGAGTA CTTAAAGACA CCAGCAAGAA TTTACTTTAA ATATGAAGGT
GCTACACCTA CTGGATCTCA TAAGATAAAT ACAGCAATTC CTCAAGCGTA TTTTGCAAAA
GAAGAGGGAA TTGAACACGT AGTTACTGAA ACTGGAGCTG GTCAATGGGG AACTGCAGTC
GCACTTGCAG CTAGTATGTA TAATATGAAA AGTACTATAT TCATGGTAAA GGTAAGTTAT
GAACAAAAAC CGATGAGAAG GAGTATAATG CAATTATATG GGGCTAATGT TTACGCAAGC
CCCACAAACT TAACTGAATA CGGTAGGAAG ATATTAGAGA CAAACCCACA GCATCCAGGA
TCATTAGGTA TAGCAATGAG CGAGGCAATA GAGTATGCTC TTAAGAACGA ATTTAGATAT
TTAGTAGGTA GCGTTTTAGA TGTAGTACTT TTGCATCAGA GTGTTATTGG TCAAGAGACT
ATTACTCAAT TGGATTTGTT AGGAGAAGAC GCTGATATCC TAATTGGATG TGTAGGAGGT
GGGAGCAATT TTGGCGGTTT CACATACCCC TTTATCGGAA ATAAGAAAGG CAAGCGTTAT
ATTGCAGTAA GTTCTGCAGA AATTCCAAAG TTTAGTAAAG GTGAATATAA ATACGATTTT
CCAGACTCTG CTGGATTATT ACCTTTAGTG AAAATGATAA CTTTAGGTAA AGATTACGTT
CCGCCACCAA TATACGCAGG CGGGTTAAGA TATCATGGTG TAGCACCAAC ATTAAGTTTG
TTAACAAAGG AGGGTATTGT GGAATGGAGA GAATACAATG AAAGGGAGAT TTTCGAAGCT
GCTAAGATAT TTATCGAGAA CCAAGGTATT GTACCAGCCC CAGAATCAGC TCATGCAATA
AGGGCAGTAG TTGATGAAGC TATAGAGGCA AGAAAGAATA ATGAGCGAAA GGTCATCGTC
TTTAATCTAA GTGGACATGG ATTGTTAGAT CTGTCAAATT ACGAATCCAT GATGAAAAGG
TTGAATGGAA ATGGGTAA
 
Protein sequence
MVKEDEILPK YWYNIIPDLP KPLPPPRDPQ GAYFSRIDLL RSILPKEVLR QQFTIERYIK 
IPEEVRDRYL SIGRPTPLFR AKRLEEYLKT PARIYFKYEG ATPTGSHKIN TAIPQAYFAK
EEGIEHVVTE TGAGQWGTAV ALAASMYNMK STIFMVKVSY EQKPMRRSIM QLYGANVYAS
PTNLTEYGRK ILETNPQHPG SLGIAMSEAI EYALKNEFRY LVGSVLDVVL LHQSVIGQET
ITQLDLLGED ADILIGCVGG GSNFGGFTYP FIGNKKGKRY IAVSSAEIPK FSKGEYKYDF
PDSAGLLPLV KMITLGKDYV PPPIYAGGLR YHGVAPTLSL LTKEGIVEWR EYNEREIFEA
AKIFIENQGI VPAPESAHAI RAVVDEAIEA RKNNERKVIV FNLSGHGLLD LSNYESMMKR
LNGNG