Gene Ssol_2452 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsol_2452 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSulfolobus solfataricus 98/2 
KingdomArchaea 
Replicon accessionCP001800 
Strand
Start bp2256498 
End bp2258039 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content38% 
IMG OID 
ProductHydantoinase/oxoprolinase 
Protein accessionACX92601 
Protein GI261602998 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.79439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGAATTA GAATAGGTAT TGATATTGGT AGTACTCATA CAGATGCAGT AGCATTAGAA 
GGTAAAGAGC TAATAGTAGC TGACAAAGTA ATGACTACAC CAGACCTAAC TACTGGACTT
TTAAATGCCA TAAGTAGAGT GATGAAAAAG CTTGGAGAAA GGAAAAACGA AGTAGATACG
CTAATGATAG GAACTACTCA CGGTCTGAAC GCCTTACACC AGGGTAAAGG CTTAAATAGA
GTAGCGACCA TTAGAATTGG CTTACCTGCA GGAGAGGGAG TTCCTCCAGT ATTTGACTGG
CCAGAGCAGT TATCAAACTT TGTCACCTAT AGATATATGG TAAGAGGAGG CCATGAATAT
ACCGGGGAAG AAATAGTGGA GTTAGATGAG GGCAAAATAA AGGAGATTGC TGAAGCCATA
AATGGTAAAG TTGATGCCAT AGCTATTAGT TCAATATTTT CAGTTGTAAA TTCGTCACAT
GAGATTAGAG CGAGGGAGAT TTTAAGAGAG AAAGGAATTA ATGTGCCTAT AGTACTTTCT
CACGAAATTG GTGGAATAGG ACTGTTAGAG AGGGAGAACT CAGCGATCCT AAATGCGTTA
ATACTTAAAA TCTTCGATAA CTTAATAAGC AAAATCAAAC AGTTACTTTC TTCTTTAGGT
ATAGAAGATG TGAGACTATT CTTTGCACAG AATGATGGGA CTGTGGCCTC TGAAGATTTC
ATCAAAAGCT ATCCAATATT CACTGTAGCT GGACCAGTTT CAAATAGTAT TAGAGGAGCG
CATTTACTGA CTGGGATAAA AGATGCAATA GTAATGGATG TAGGAGGGAC TACAACAAAT
GTGGGTGTTC TCCATGAGGG ATATCCTAGA GAATCCTCAT CTGTAGTAGA AATAGCCAAA
ATAAGGACTA ATTTTAGAAT GCCCGACATT TATACGATGG CATTGGGAGG AGGCACCATA
GTTAATAAGG AGAAAATAGG ACCAGAGAGT GTGGGTTACG CACTGATAAA TAAGGGAATA
TCATGGGGAG GTGATACTTT AACCGCAACA GATGTAGCTA TGATAGTGAA AGGAATAACA
ATAGATGGTA CAAATCCGAA GCTAGTAAAC AACAAATTCC CTATGGAGTA CTTATTTAGC
GCATACACTA AAATGGTGGA AATGTGGGAA GACGCCATAG ACTTAATGAA AACTTCAAAG
GATGACGTAA CGGTAATTGT TGTGGGTGGG GGAAGTATAA TGGTCCCAGA GAAGCTAAAA
GGTGCGATGG AAGTTATAAG GCCACGAAAT GCCCAATACG CTAATGCCAT AGGTGCGACA
TTAACTAAAG TTGGTGCAAC GATAGAAAGG ACATTCTCTT ATGATCAAAT AACTAGGGAA
AATGCAATAA AGAGTCTAAT TAATGAGGCT AAAAGTTTAG CCATAAGAGC TGGGGCCTTA
AATACAACGA TAGAAGTTAG AGAAATAGAA GAAATACAAA TACCTTATCT ACCTGGAAAT
TCAGTGAAAG TAAAAGTTAA GGTAGTTGGT GAATTTTCTT AA
 
Protein sequence
MRIRIGIDIG STHTDAVALE GKELIVADKV MTTPDLTTGL LNAISRVMKK LGERKNEVDT 
LMIGTTHGLN ALHQGKGLNR VATIRIGLPA GEGVPPVFDW PEQLSNFVTY RYMVRGGHEY
TGEEIVELDE GKIKEIAEAI NGKVDAIAIS SIFSVVNSSH EIRAREILRE KGINVPIVLS
HEIGGIGLLE RENSAILNAL ILKIFDNLIS KIKQLLSSLG IEDVRLFFAQ NDGTVASEDF
IKSYPIFTVA GPVSNSIRGA HLLTGIKDAI VMDVGGTTTN VGVLHEGYPR ESSSVVEIAK
IRTNFRMPDI YTMALGGGTI VNKEKIGPES VGYALINKGI SWGGDTLTAT DVAMIVKGIT
IDGTNPKLVN NKFPMEYLFS AYTKMVEMWE DAIDLMKTSK DDVTVIVVGG GSIMVPEKLK
GAMEVIRPRN AQYANAIGAT LTKVGATIER TFSYDQITRE NAIKSLINEA KSLAIRAGAL
NTTIEVREIE EIQIPYLPGN SVKVKVKVVG EFS