Gene Ssol_2447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2447 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2250779 
End bp2252830 
Gene Length2052 bp 
Protein Length683 aa 
Translation table11 
GC content37% 
IMG OID 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionACX92596 
Protein GI261602993 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAGAG TGGGCATAGA TATAGGAGGC GCTTTCACTG ACGTAGTGGT CTATAACGAA 
GAAAACGGAG AAATAAGTTG GGCAAAGGTA GAAACCACGC CAGATGACCC GTCAAACGGC
GTTCTGGAGG CAATAGATGA AGCCAAAGTA AATTTAGGTT ATGTAAATAC TATAATTCAC
GGTCAGACAT TAGCCATAAA CACAATAGTA GAAAGGAAAG GAGCAAAGGT TGGTCTAATC
ACAACTAAGG GCTTTAGAGA TATCCTAGAA ATCCAAAGGG CTAATAGAAG AGATATGTAC
AATTTCAGAT ATAAGAAACC TACTCCCTTT GTTCCGAGGT ATTTAAGACT AGAGGTCACC
GAGAGGATAA AGAGTAATGG CGATATTTTG ACGTCTCTAA ACGAAAACGA GGTTGTTGAG
GCGATTAAAA AACTAAAAGC GGAAAACGTA GAAGCCATTG CTGTAAGTTT TATCAATTCT
TACGTTAATC CCATACATGA ACTTAAGGTA GGAGAGATTA TTAAGAGAGT TGATCCTAAT
ATTATAGTAA CGTTATCTCA TGAGGTCACT AGAGAGTGGA GAGAGTACGA GAGAACTAGT
ACCGCTGTAC TTAACGCATA CGTTATGCCA AAAATGAGTA AATACTTAAG TAAACTTGAA
AATGAATTTA AAAATAGAGG TTTTAAAGGG AATTATTTCG CTATGCTTTC CAACGGAGGT
ATGGCCACAT TCGACTACGC TAAAAGATTT CCAATATATA CCTTAGAATC TGGTCCAGTA
GCAGGAGTTA TTGGGGCAAT TAAGATAAGC GACATATTAG GAGAAAAAAA CATTATAGCA
ATGGATGGTG GAAGTACAAC AACTAAGGCT AGTCTAGTGA GAAATCTGGA ACCCAATATA
AATACTGATT ACTATGTTGG AAGAGATAAG TACAACCCAG GGTATCCAGT AAAAGTTCCA
ACATTGGATA TAGTAGAAAT AGGTAATGGT GGAACGAGTA TTGCTCGGAT TGATGAAACA
AGTAACTTAA AAGTAGGACC CAGAGCAGCT GGTGCTTACC CAGGTCCAGT AGCTTATGGG
AAAGGAGGTA AAGATGTTAC TGTGACAGAT GCTTACATAG TATGTGGATT TCTAAATCAA
GAGGAACTAC TTGGAGGGAA AATAAAGGTT AATAAAAGAC TTGCTGAGGA GGCTATTTCA
AATATTGCTA AATACTATAA CATGTCTATT GAAGAAGTCT CCTATGGTAT AGTTAAAATT
GCTAATGATA ATGCTGTAAA TGCAGTTAGG TTAATATCAG TTCAAAGAGG ATATGATCCA
AGAGAGTTTA CGCTAGTAGC ATACGGTGGC TCTGGCCCTA TGTTTGCTCC ATTTGTTGCA
GAAGAATTAG ACATAAAGAA GATAATAGTA CCATTTCTCC CAGCAGGAGT ATTCTCCGCA
TGGGGTATGC TTGTTTCAGA CATTAGGCAT GACCTTGTTT TATCCTATCC TTTGAGGATT
GATAAGGAAA GTAGTGTAGA TTTGATAAAT GAAAAATTTA ATGAATTAGA GAGCAAAATA
AGGTCCATAT TAATATCGGA GGGATTCAAA GAGAAAGATA TCATAATGCT AAGATACGCT
GAGATGAGAT ATTATGGCCA AGAACATACT GTAAAAGTGA GTGTAATGCC AGGGGAAATT
GGGAATAGGG AATTGGAGGA GATAGAAAGA AGGTTCCATG AAGCTCACGA AATCGCATAT
GCGTTTACCT TAGATAGTCC AATCGAAATA GTAAACTTTC ATGTGAGTGG TATAGTAAAA
GCTAAGACTA TTGTATTAAA GAGGATAGAG AGGGATAATT CAAGTATTGA TAAGGCGTTG
GTCGGAAAAA GAAAGGTATT CTATGATGGA AAATATGAGG AGTGGAATGT GTATAATAAA
GAATATTTAC CTATTAATTA TCAAATAGTT GGTCCAGCAA TTATAGAAGA TCCTACTTCT
ACATCGTTAG TATTAGAAGG GCAAACGGGA ATGTTAGATA GTTATGGCAA TCTAATTATT
GAGAGGGATT AA
 
Protein sequence
MIRVGIDIGG AFTDVVVYNE ENGEISWAKV ETTPDDPSNG VLEAIDEAKV NLGYVNTIIH 
GQTLAINTIV ERKGAKVGLI TTKGFRDILE IQRANRRDMY NFRYKKPTPF VPRYLRLEVT
ERIKSNGDIL TSLNENEVVE AIKKLKAENV EAIAVSFINS YVNPIHELKV GEIIKRVDPN
IIVTLSHEVT REWREYERTS TAVLNAYVMP KMSKYLSKLE NEFKNRGFKG NYFAMLSNGG
MATFDYAKRF PIYTLESGPV AGVIGAIKIS DILGEKNIIA MDGGSTTTKA SLVRNLEPNI
NTDYYVGRDK YNPGYPVKVP TLDIVEIGNG GTSIARIDET SNLKVGPRAA GAYPGPVAYG
KGGKDVTVTD AYIVCGFLNQ EELLGGKIKV NKRLAEEAIS NIAKYYNMSI EEVSYGIVKI
ANDNAVNAVR LISVQRGYDP REFTLVAYGG SGPMFAPFVA EELDIKKIIV PFLPAGVFSA
WGMLVSDIRH DLVLSYPLRI DKESSVDLIN EKFNELESKI RSILISEGFK EKDIIMLRYA
EMRYYGQEHT VKVSVMPGEI GNRELEEIER RFHEAHEIAY AFTLDSPIEI VNFHVSGIVK
AKTIVLKRIE RDNSSIDKAL VGKRKVFYDG KYEEWNVYNK EYLPINYQIV GPAIIEDPTS
TSLVLEGQTG MLDSYGNLII ERD