Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0524 |
Symbol | |
ID | 5056231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 472508 |
End bp | 473893 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640468086 |
Product | selenium-binding protein |
Protein accession | YP_001152771 |
Protein GI | 145590769 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.377432 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGTC TAACCCCGGA CCCGACGCTT TATCCCACGG TTCGAGACGC TATTAAGAGC CCGCCGGAGG ATGTGGCGTA TGTGGCCGCG CTGTATGTGG GCACCGGCGT GGAGGAGCCC GATTTCCTCG CGGTGGTCGA CGTGGACCCC AACTCGGCAA CATACGGCAA GATCGTGTAC AAGCTACCGA TGCCGGATAT AGGCGACGAG TTGCACCACT TCGGCTGGAA CGCCTGCTCC TCCGCCTACT GCCCCAACGC GAGGCCTTTC CTCGAGAGGC GGTACCTCAT AGTGCCTGGG CTTAGGTCCT CCAGGATATA TATCGTGGAT ACGAAGCCGG ACAAGAGGAA GCCCTCTATA CATAAAATAA TCGAGCCCAA AGTCGCCGTG GAGAAGACCG GCTACACGAA GTACCACACG GTCCACTGCG GCCCAGACGC CATATACATA TCGGCGCTGG GCGGGCCAGA CGGGAGGGGA GGCCCCGGCG GAGTCCTGTT GCTAGACCAC AACACCTTCG AGCCGCTGGG CCGGTGGGAG GTCCACAGAG GGCCGCAGTT CTACGCCTAC GACTTCTGGT GGAACCTCCC CTCCGGCGTC ATGATCTCCA GCGAGTGGAC GACGCCGGAG TGCTTCGAAA ACGGCTTCAG CCCCGAGTGC CTCGCCGCCG GTAAATACGG CCACAAGCTC CACGTCTGGG ACCTCGGCAG ACGTAGGCAC CTCTACTCCA TAGACCTGGG GGAGGAGCAC CGCATGGTGC TTGAGGTGAG GCCCCTCCAC GACCCCACCA AGCTGATGGG CTTCGTCAAC GTGGTCCTCA ACACGAAGGA CCTCTCCAGC TCCATATGGC TCTGGTTCCC AGAAGACGGC AGGTGGCACG CCGAGAAGGT GATAGAGATA GAGGCGCAGC CCTCCGAGGG CCCCCTCCCG CCGCCGCTGA AGGACCTGAA GACCGTGCCT CCGCTGGTCA CCGACATAGA CGTCTCTCTG GACGACAGAT TCCTCTACGT CTCCCTCTGG GGCCTCGGGG AGCTGAGGCA GTACGACATC ACGAACCCCC ACCAGCCGCG GCTGGCGGGG CGGGTCAAGA TAGGCGGCAT ATTCCACAGA GAGCCCCACC CCTCCGGCGC AGAGGCGACC GGCGCCCCGC AGATGATATC AGTCAGCAGA GACGGGAGGA GGGTCTACAT AACCAACAGC CTCTACAGTA GCTGGGATAA CCAGTTCTAC CCCGGCCTCA GGGGCTGGAT GGCGAAGATC AACGTAAATC CCGAGGGCGG TTTGGAGCTC GACAAGTCTT TCTTCGTGGA CTTCGGCCAG GCGAGGACCC ACCAAGTACG CCTCTGGGGC GGCGACGCCT CCACAGACAG CTTCTGCTTC CCGTAA
|
Protein sequence | MARLTPDPTL YPTVRDAIKS PPEDVAYVAA LYVGTGVEEP DFLAVVDVDP NSATYGKIVY KLPMPDIGDE LHHFGWNACS SAYCPNARPF LERRYLIVPG LRSSRIYIVD TKPDKRKPSI HKIIEPKVAV EKTGYTKYHT VHCGPDAIYI SALGGPDGRG GPGGVLLLDH NTFEPLGRWE VHRGPQFYAY DFWWNLPSGV MISSEWTTPE CFENGFSPEC LAAGKYGHKL HVWDLGRRRH LYSIDLGEEH RMVLEVRPLH DPTKLMGFVN VVLNTKDLSS SIWLWFPEDG RWHAEKVIEI EAQPSEGPLP PPLKDLKTVP PLVTDIDVSL DDRFLYVSLW GLGELRQYDI TNPHQPRLAG RVKIGGIFHR EPHPSGAEAT GAPQMISVSR DGRRVYITNS LYSSWDNQFY PGLRGWMAKI NVNPEGGLEL DKSFFVDFGQ ARTHQVRLWG GDASTDSFCF P
|
| |