Gene Sare_2788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2788 
Symbol 
ID5707867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3169183 
End bp3170403 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content72% 
IMG OID641272244 
ProductDyp-type peroxidase family protein 
Protein accessionYP_001537614 
Protein GI159038361 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00338158 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000783217 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCGGGA CCAACTCGCG CGCGGTGAGC AGGCGCGGCC TGCTCACCGG CGGCGCCCTG 
GCCGCCGGTG GCGCGCTCGG CGGCGCCGTC GCCGTCACCG CCGTCGGTTC CGACGATCCT
TCGACACACC CCGCCGAGCC GGTAGCGGAG GTCGGGCTCG CCGTCGAACC GTTCCACGGC
ACTCGCCAGT CCGGGGTCGC CACCGAACCG CAGGCCCACG GGGCGTTCGT CGCGCTCGTC
CTCCGGCCCG ACACGGACCG GGCGGCGCTG GGGCGGATGC TGCGACTGCT CTCGGACGAT
GCCGCCCGCC TCACCCAGGG CCATCCCGCC CTGGCCGACA CCGAACCGGA GCTCGGGCTG
CTACCAGCCC GGCTGACCGT GACCTTCGGC TTCGGTCCCG GCCTCTATCA GGCGGCCAAC
CTCGACGACC GACGGCCACC GTCGGTGGCC GCCCTGCCGG AGTTCCGGAT CGACAAACTC
CAGCCCCGCT GGTCCGGCGG GGATCTACTG CTGCAGATCT GCGCCGACGA CCCGCTGACC
GTCGCGCACG CCCAGCGGGT GCTGGTGAAG GACAGCCGAC CCTTCGCCAC CGTCGCGTGG
GTGCAGCAGG GCTTCCGGCG CGCGGCCGGC GCCGAGCCGG GGCGTACCCA GCGCAACCTG
TTCGGCCAAC TCGACGGCAC CGCCAACCCG AAGCCGGGCG CTCCCCTGGA GACCGCGGTG
TGGGTGCCGG ACGGGCCGGC GTGGCTGCGC GACAGCACCA CCCTGGTCGT CCGGCGGATC
AGCATGAACC TGGAGACGTG GGACCTGCTC GGCCGCACCG ACCGGGAGTT GTCCGTCGGC
CGCCGGCTCG ACACCGGCGC ACCGCTGACC GGCACCGACG AACACGACGA GCCCGACCTC
ACCGCCCTCG GGCCGGACGG GCTCACCGTC ATTCCGGACT TCTCGCATCT GACCCGCTCC
CACGTCGACG AGGATCGGCT GCGGATCCTG CGCCGCCCGT ATAATTACGA CGGCGTGCCC
AGCGCCGACG GCACCGCGGA CAGCGGGTTG ATCTTTGCTT CCTACCAGGC CGACATCACC
CGCCAGTTCC TGCCCATCCA ACGACGCCTG GCCGAACGGG ACCTACTCAA CGAGTGGACC
ACCCCAATCG GTTCTGCCGT GTTCGCGATC CCACCCGGCT GCCCCGAAGG TGGCTGGATC
GGCCAGCAAC TGCTTGGCTG A
 
Protein sequence
MTGTNSRAVS RRGLLTGGAL AAGGALGGAV AVTAVGSDDP STHPAEPVAE VGLAVEPFHG 
TRQSGVATEP QAHGAFVALV LRPDTDRAAL GRMLRLLSDD AARLTQGHPA LADTEPELGL
LPARLTVTFG FGPGLYQAAN LDDRRPPSVA ALPEFRIDKL QPRWSGGDLL LQICADDPLT
VAHAQRVLVK DSRPFATVAW VQQGFRRAAG AEPGRTQRNL FGQLDGTANP KPGAPLETAV
WVPDGPAWLR DSTTLVVRRI SMNLETWDLL GRTDRELSVG RRLDTGAPLT GTDEHDEPDL
TALGPDGLTV IPDFSHLTRS HVDEDRLRIL RRPYNYDGVP SADGTADSGL IFASYQADIT
RQFLPIQRRL AERDLLNEWT TPIGSAVFAI PPGCPEGGWI GQQLLG