Gene Sare_0097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0097 
Symbol 
ID5707067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp108238 
End bp110076 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content72% 
IMG OID641269623 
Productalkyl hydroperoxide reductase/ Thiol specific antioxidant/ Mal allergen 
Protein accessionYP_001535023 
Protein GI159035770 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0526] Thiol-disulfide isomerase and thioredoxins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.328632 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000931017 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCTACCG CGCCTCGCGT CCGAGCACCC GAACTGAAGG GCCGCCGCTG GCTGAACACC 
GGCGGACGAC ACCTGACCCT GCCGGACCTT CGAGGCCGCA TCACCGTCCT CGACTTCTGG
ACCTTCTGCT GCATCAACTG CCTTCACGTG CTCGACGAGC TACGCCCCAT CGAGCAGAAG
TACGCGGACG TGCTCGTGGT CATCGGCGTC CACTCGCCGA AGTTCGAGCA CGAAAAGGAC
CCGGACGCCC TGGCCGACGC CGTCGAACGG TACGGCGTGC ACCACCCGGT GCTCGACGAC
CCCGAACTGA ACATGTGGCA GCAGTACGCC GCCCGGGCTT GGCCGACCCT GGCCGTGATC
GACCCCGAGG GTTACGTGGT GGCCACCATG GCCGGCGAGG GACACGCCGA GGGCCTGGTC
CGGCTGGTGG ACGACCTGAT CGCCACCCAC GAGGCCAAGG GCACCCTGCA CCGGGGCGAC
GGCCCGTACG TGCCACCCGC CGAACCGGAG ACCACGCTGC GCTTTCCCGG CAAAGCTGTC
GTACTCGGCA ACGGGAACCT GCTGGTGTCG GACTCGGCCC GGCACTCCAT CGCGGAGCTG
GCACCCGACG GCGAGAGGGT GGTCCGCCGG ATCGGCACCG GCGCGCGCGG CCGGGCCGAC
GGGCCCGCCA CGGCGGCCAC CTTCGCCGAG CCGCAGGGGC TCTGCCTGCT CCCGGCCCAC
GTCGCCCGGC TGGTCGACTA CGACCTGGTC GTCGCCGACA CCGTCAACCA CCTGCTGCGC
GGCGTCCGCC TCGCCACCGG CGAGGTGGTC ACCGTCGCCG GCACCGGCCG ACAGTGGCGT
TCCACCGTGG ACGACCACGC CCACGACGCG CTCTCCGTCG ACCTCTCCTC CCCCTGGGAC
CTGGCCTGGT ACGACGGCCG GCTCGTGATC GCCATGGCCG GCATCCACCA GCTCTGGTGG
TTCGACCCGG TGAAGCGCAC CGCCGGCATG TACGCGGGCA GCACCGTCGA GGCCCTCAAG
GACGGCCCGC TGGCCGAAGC GTGGCTGGCC CAGCCCTCCG GTCTGTCGGT CTCCGCCGAC
GGCAGCCGGC TCTGGGTCGC CGACAGCGAA ACCAGCGCGA TCCGGTACGT CCAGGACGGT
GTCCTGAACA CTGCGGTCGG CCAGGGGCTC TTCGAATTCG GGCATGTCGA CGGGCCAGCG
GCACAGGCGC TGCTCCAGCA CCCGCTGGGG GTCTGTGCAC TGCCGGACGG CTCGGTGCTG
ATCGCCGACA CGTACAACGG GGCGGTCCGC CGCTACGACC CGGAGTCGGA CTCGGTGGGC
ACCGTCGCCG ACGGACTTGC CGAACCGAGC GACCTCGTTC TCACCCCGGA CGGCGGGGTA
CTGGTCGTGG AGTCCGCCGC CCACCGACTG ACCCAGCTCG CGCCGGGCAC GCTCACCGCC
GCCGGGGCCA GCACGGTCAA CGGCCCACGG CACCGTACCG AGCGGAAGCC GACCGAACTG
CGAGCCGGCG AGGTGACCCT GGAGGTCATC TTCACCCCGG CCCCCGGCCA GAAGCTCGAC
GACACCTACG GCCCGTCGAC CCGGCTGGTG GTCTCGGCGT CCCCGCCGGA GCTGCTACTG
GCTGGGGCGG GCACCAACAC CGAGCTGACC CGCCGGCTGG TGCTCAACGG TGCGGTCTCC
GAAGGCGTGC TCCAGGTGAC CGCGCAGGCG GCCACCTGCG ACGCCGACGT GGAGCATGCC
GCGTGCCACC TGACCCGGCA GGACTGGGGT GTACCGATCC GGGTGACTGA CGAGGCCGCC
GACCGTCTCC CGCTGGTGCT GCGCGGCATG GACGCCTGA
 
Protein sequence
MATAPRVRAP ELKGRRWLNT GGRHLTLPDL RGRITVLDFW TFCCINCLHV LDELRPIEQK 
YADVLVVIGV HSPKFEHEKD PDALADAVER YGVHHPVLDD PELNMWQQYA ARAWPTLAVI
DPEGYVVATM AGEGHAEGLV RLVDDLIATH EAKGTLHRGD GPYVPPAEPE TTLRFPGKAV
VLGNGNLLVS DSARHSIAEL APDGERVVRR IGTGARGRAD GPATAATFAE PQGLCLLPAH
VARLVDYDLV VADTVNHLLR GVRLATGEVV TVAGTGRQWR STVDDHAHDA LSVDLSSPWD
LAWYDGRLVI AMAGIHQLWW FDPVKRTAGM YAGSTVEALK DGPLAEAWLA QPSGLSVSAD
GSRLWVADSE TSAIRYVQDG VLNTAVGQGL FEFGHVDGPA AQALLQHPLG VCALPDGSVL
IADTYNGAVR RYDPESDSVG TVADGLAEPS DLVLTPDGGV LVVESAAHRL TQLAPGTLTA
AGASTVNGPR HRTERKPTEL RAGEVTLEVI FTPAPGQKLD DTYGPSTRLV VSASPPELLL
AGAGTNTELT RRLVLNGAVS EGVLQVTAQA ATCDADVEHA ACHLTRQDWG VPIRVTDEAA
DRLPLVLRGM DA