Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2788 |
Symbol | |
ID | 5707867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3169183 |
End bp | 3170403 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641272244 |
Product | Dyp-type peroxidase family protein |
Protein accession | YP_001537614 |
Protein GI | 159038361 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2837] Predicted iron-dependent peroxidase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01412] Tat-translocated enzyme [TIGR01413] Dyp-type peroxidase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00338158 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000783217 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACCGGGA CCAACTCGCG CGCGGTGAGC AGGCGCGGCC TGCTCACCGG CGGCGCCCTG GCCGCCGGTG GCGCGCTCGG CGGCGCCGTC GCCGTCACCG CCGTCGGTTC CGACGATCCT TCGACACACC CCGCCGAGCC GGTAGCGGAG GTCGGGCTCG CCGTCGAACC GTTCCACGGC ACTCGCCAGT CCGGGGTCGC CACCGAACCG CAGGCCCACG GGGCGTTCGT CGCGCTCGTC CTCCGGCCCG ACACGGACCG GGCGGCGCTG GGGCGGATGC TGCGACTGCT CTCGGACGAT GCCGCCCGCC TCACCCAGGG CCATCCCGCC CTGGCCGACA CCGAACCGGA GCTCGGGCTG CTACCAGCCC GGCTGACCGT GACCTTCGGC TTCGGTCCCG GCCTCTATCA GGCGGCCAAC CTCGACGACC GACGGCCACC GTCGGTGGCC GCCCTGCCGG AGTTCCGGAT CGACAAACTC CAGCCCCGCT GGTCCGGCGG GGATCTACTG CTGCAGATCT GCGCCGACGA CCCGCTGACC GTCGCGCACG CCCAGCGGGT GCTGGTGAAG GACAGCCGAC CCTTCGCCAC CGTCGCGTGG GTGCAGCAGG GCTTCCGGCG CGCGGCCGGC GCCGAGCCGG GGCGTACCCA GCGCAACCTG TTCGGCCAAC TCGACGGCAC CGCCAACCCG AAGCCGGGCG CTCCCCTGGA GACCGCGGTG TGGGTGCCGG ACGGGCCGGC GTGGCTGCGC GACAGCACCA CCCTGGTCGT CCGGCGGATC AGCATGAACC TGGAGACGTG GGACCTGCTC GGCCGCACCG ACCGGGAGTT GTCCGTCGGC CGCCGGCTCG ACACCGGCGC ACCGCTGACC GGCACCGACG AACACGACGA GCCCGACCTC ACCGCCCTCG GGCCGGACGG GCTCACCGTC ATTCCGGACT TCTCGCATCT GACCCGCTCC CACGTCGACG AGGATCGGCT GCGGATCCTG CGCCGCCCGT ATAATTACGA CGGCGTGCCC AGCGCCGACG GCACCGCGGA CAGCGGGTTG ATCTTTGCTT CCTACCAGGC CGACATCACC CGCCAGTTCC TGCCCATCCA ACGACGCCTG GCCGAACGGG ACCTACTCAA CGAGTGGACC ACCCCAATCG GTTCTGCCGT GTTCGCGATC CCACCCGGCT GCCCCGAAGG TGGCTGGATC GGCCAGCAAC TGCTTGGCTG A
|
Protein sequence | MTGTNSRAVS RRGLLTGGAL AAGGALGGAV AVTAVGSDDP STHPAEPVAE VGLAVEPFHG TRQSGVATEP QAHGAFVALV LRPDTDRAAL GRMLRLLSDD AARLTQGHPA LADTEPELGL LPARLTVTFG FGPGLYQAAN LDDRRPPSVA ALPEFRIDKL QPRWSGGDLL LQICADDPLT VAHAQRVLVK DSRPFATVAW VQQGFRRAAG AEPGRTQRNL FGQLDGTANP KPGAPLETAV WVPDGPAWLR DSTTLVVRRI SMNLETWDLL GRTDRELSVG RRLDTGAPLT GTDEHDEPDL TALGPDGLTV IPDFSHLTRS HVDEDRLRIL RRPYNYDGVP SADGTADSGL IFASYQADIT RQFLPIQRRL AERDLLNEWT TPIGSAVFAI PPGCPEGGWI GQQLLG
|
| |