Gene Sare_1475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1475 
Symbol 
ID5706068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1705084 
End bp1706493 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content75% 
IMG OID641270983 
Productprotoporphyrinogen oxidase 
Protein accessionYP_001536364 
Protein GI159037111 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1232] Protoporphyrinogen oxidase 
TIGRFAM ID[TIGR00562] protoporphyrinogen oxidase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0850035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000274844 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGACAC CATGGCGGAT CGCGGTGGTC GGCGGCGGGA TAGCCGGCCT CGCCGCCGCC 
GTCCAACTGA GGGACCACGC TCCCGAAGGC ACCGAGGTGA CGGTGTACGA GCGAAGCGGC
GCGCTCGGCG GCAAGCTGCA CACCGGTGAG CTGGCCGGCG GGCCGGTGGA GTTCGGCGCG
GAAGCGTTCC TGATGCGGGA CGCCAGCGGC GGTGAGTCCG CTGTGGTGTC CCTGATCCGT
CGGCTCGGGC TGGCCGACGA CATCGTCCAC CCCACCGTCG GGCAGGCGGC GCTCCTCGCC
GGCGGCGAGC TGCATCCGGT GCCCCGCGGC ACGCTGGTCG GCGTACCCGG GGACCTCGCG
GCGGCGGCGG CGGTGGCCCG CCCGACCGCG GAGGCCGACG TGGACACCGG TAGGCCGTTG
CTCGCCCCCG GTGCCGACGT CACCGTCGGG GAGTTGGTCC GCGGCCGGCT GGGCGACGAG
GTCGTCGACC GGTTGGTCGA TCCGATGCTC GGTGGCGTCT ACGCCGGCCG CGCTGACGAC
CTCTCCCTGG CCGCGACCAT GCCCGCGCTG GCCCGGCAGG CCCGGGTCGA GCACACCCTC
GTCGGCGCGG TCCGCGCGGC GCAGGCCGCG GCACCGCGGG CACCGGGCAC GCCGTTCTTC
GGCACCCTGG CCGGTGGTCT GAGCACCCTG GTCGAGGCCG CGGCCGCGGC CAGCGGCGCC
ACGATCCAGC GGAACGCGAC GGTTCGTGCG CTGACCCCGG CGGAGGCCGG CTGGCGACTG
ACCATCGGGC CGAACGGCGA CGCGGACCAC GTCCAGGCCG ACGCCGTGGT GCTGGCCGTG
CCGGCCAGCC CGGCGGCGCG GCTTCTCGAC GACGTCGCCC CGGCCGTGGC GGAGAACATC
GGCGCCCTGG CCTACGCCAG CGTCGCGCTG GTCACTCTCG CGTTGCCGGA GGCGACGCTG
CCCGCGCTCT CCGGCTTCCT GGTGTCGACC GGCGAGGGGC TGGTGATGAA GGCCTCCACC
TTCTTCACCA CGAAGTGGGG GCACCTGCGT CGGCCGGACG GGCTGGCCCT GGTCCGTGCC
TCGGTCGGGC GGCTCGGGGA CGAGGCGCAG CTCCAGCGCC CCGACGCGGA CCTGGTCGCC
ACGACGCACC GAGAGTTGTC GACAGTGCTC GGTGACGCGC TTCCCACGCC GGTCGCCACG
CACGTGCAGC GCTGGGGCGG GTCGCTGCCG CAGTACGCGC CGGGCCACCT CGACCGGGTG
GGGTCGGCGC GGGCGGTGCT GCGGGCGGAG CGGCCGACCC TGGCGTTGGC GGGTGCCGGC
TACGACGGCG TCGGCATTCC GGTCTGTGTC CGTTCCGGCA TGGCGGCGGC TGACGAGATC
ATCACTGCAC TGAAGGGTTC CGGGGAATGA
 
Protein sequence
MATPWRIAVV GGGIAGLAAA VQLRDHAPEG TEVTVYERSG ALGGKLHTGE LAGGPVEFGA 
EAFLMRDASG GESAVVSLIR RLGLADDIVH PTVGQAALLA GGELHPVPRG TLVGVPGDLA
AAAAVARPTA EADVDTGRPL LAPGADVTVG ELVRGRLGDE VVDRLVDPML GGVYAGRADD
LSLAATMPAL ARQARVEHTL VGAVRAAQAA APRAPGTPFF GTLAGGLSTL VEAAAAASGA
TIQRNATVRA LTPAEAGWRL TIGPNGDADH VQADAVVLAV PASPAARLLD DVAPAVAENI
GALAYASVAL VTLALPEATL PALSGFLVST GEGLVMKAST FFTTKWGHLR RPDGLALVRA
SVGRLGDEAQ LQRPDADLVA TTHRELSTVL GDALPTPVAT HVQRWGGSLP QYAPGHLDRV
GSARAVLRAE RPTLALAGAG YDGVGIPVCV RSGMAAADEI ITALKGSGE