Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0097 |
Symbol | |
ID | 5707067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 108238 |
End bp | 110076 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641269623 |
Product | alkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal allergen |
Protein accession | YP_001535023 |
Protein GI | 159035770 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0526] Thiol-disulfide isomerase and thioredoxins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.328632 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000931017 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTACCG CGCCTCGCGT CCGAGCACCC GAACTGAAGG GCCGCCGCTG GCTGAACACC GGCGGACGAC ACCTGACCCT GCCGGACCTT CGAGGCCGCA TCACCGTCCT CGACTTCTGG ACCTTCTGCT GCATCAACTG CCTTCACGTG CTCGACGAGC TACGCCCCAT CGAGCAGAAG TACGCGGACG TGCTCGTGGT CATCGGCGTC CACTCGCCGA AGTTCGAGCA CGAAAAGGAC CCGGACGCCC TGGCCGACGC CGTCGAACGG TACGGCGTGC ACCACCCGGT GCTCGACGAC CCCGAACTGA ACATGTGGCA GCAGTACGCC GCCCGGGCTT GGCCGACCCT GGCCGTGATC GACCCCGAGG GTTACGTGGT GGCCACCATG GCCGGCGAGG GACACGCCGA GGGCCTGGTC CGGCTGGTGG ACGACCTGAT CGCCACCCAC GAGGCCAAGG GCACCCTGCA CCGGGGCGAC GGCCCGTACG TGCCACCCGC CGAACCGGAG ACCACGCTGC GCTTTCCCGG CAAAGCTGTC GTACTCGGCA ACGGGAACCT GCTGGTGTCG GACTCGGCCC GGCACTCCAT CGCGGAGCTG GCACCCGACG GCGAGAGGGT GGTCCGCCGG ATCGGCACCG GCGCGCGCGG CCGGGCCGAC GGGCCCGCCA CGGCGGCCAC CTTCGCCGAG CCGCAGGGGC TCTGCCTGCT CCCGGCCCAC GTCGCCCGGC TGGTCGACTA CGACCTGGTC GTCGCCGACA CCGTCAACCA CCTGCTGCGC GGCGTCCGCC TCGCCACCGG CGAGGTGGTC ACCGTCGCCG GCACCGGCCG ACAGTGGCGT TCCACCGTGG ACGACCACGC CCACGACGCG CTCTCCGTCG ACCTCTCCTC CCCCTGGGAC CTGGCCTGGT ACGACGGCCG GCTCGTGATC GCCATGGCCG GCATCCACCA GCTCTGGTGG TTCGACCCGG TGAAGCGCAC CGCCGGCATG TACGCGGGCA GCACCGTCGA GGCCCTCAAG GACGGCCCGC TGGCCGAAGC GTGGCTGGCC CAGCCCTCCG GTCTGTCGGT CTCCGCCGAC GGCAGCCGGC TCTGGGTCGC CGACAGCGAA ACCAGCGCGA TCCGGTACGT CCAGGACGGT GTCCTGAACA CTGCGGTCGG CCAGGGGCTC TTCGAATTCG GGCATGTCGA CGGGCCAGCG GCACAGGCGC TGCTCCAGCA CCCGCTGGGG GTCTGTGCAC TGCCGGACGG CTCGGTGCTG ATCGCCGACA CGTACAACGG GGCGGTCCGC CGCTACGACC CGGAGTCGGA CTCGGTGGGC ACCGTCGCCG ACGGACTTGC CGAACCGAGC GACCTCGTTC TCACCCCGGA CGGCGGGGTA CTGGTCGTGG AGTCCGCCGC CCACCGACTG ACCCAGCTCG CGCCGGGCAC GCTCACCGCC GCCGGGGCCA GCACGGTCAA CGGCCCACGG CACCGTACCG AGCGGAAGCC GACCGAACTG CGAGCCGGCG AGGTGACCCT GGAGGTCATC TTCACCCCGG CCCCCGGCCA GAAGCTCGAC GACACCTACG GCCCGTCGAC CCGGCTGGTG GTCTCGGCGT CCCCGCCGGA GCTGCTACTG GCTGGGGCGG GCACCAACAC CGAGCTGACC CGCCGGCTGG TGCTCAACGG TGCGGTCTCC GAAGGCGTGC TCCAGGTGAC CGCGCAGGCG GCCACCTGCG ACGCCGACGT GGAGCATGCC GCGTGCCACC TGACCCGGCA GGACTGGGGT GTACCGATCC GGGTGACTGA CGAGGCCGCC GACCGTCTCC CGCTGGTGCT GCGCGGCATG GACGCCTGA
|
Protein sequence | MATAPRVRAP ELKGRRWLNT GGRHLTLPDL RGRITVLDFW TFCCINCLHV LDELRPIEQK YADVLVVIGV HSPKFEHEKD PDALADAVER YGVHHPVLDD PELNMWQQYA ARAWPTLAVI DPEGYVVATM AGEGHAEGLV RLVDDLIATH EAKGTLHRGD GPYVPPAEPE TTLRFPGKAV VLGNGNLLVS DSARHSIAEL APDGERVVRR IGTGARGRAD GPATAATFAE PQGLCLLPAH VARLVDYDLV VADTVNHLLR GVRLATGEVV TVAGTGRQWR STVDDHAHDA LSVDLSSPWD LAWYDGRLVI AMAGIHQLWW FDPVKRTAGM YAGSTVEALK DGPLAEAWLA QPSGLSVSAD GSRLWVADSE TSAIRYVQDG VLNTAVGQGL FEFGHVDGPA AQALLQHPLG VCALPDGSVL IADTYNGAVR RYDPESDSVG TVADGLAEPS DLVLTPDGGV LVVESAAHRL TQLAPGTLTA AGASTVNGPR HRTERKPTEL RAGEVTLEVI FTPAPGQKLD DTYGPSTRLV VSASPPELLL AGAGTNTELT RRLVLNGAVS EGVLQVTAQA ATCDADVEHA ACHLTRQDWG VPIRVTDEAA DRLPLVLRGM DA
|
| |