Gene Sare_0428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0428 
Symbol 
ID5708405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp488900 
End bp490480 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content72% 
IMG OID641269953 
Producturoporphyrinogen III synthase HEM4 
Protein accessionYP_001535348 
Protein GI159036095 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1587] Uroporphyrinogen-III synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.256746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000733658 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCCGCA GCCGTAAGCC CGTAGGCCGC ATCGCGTTCG TCGGGGCCGG CCCCGGCGAC 
CCGGGCCTGC TGACCCGCCG GGGGTACGAC GCCCTGGTCA ATGCCGACCA GGTGGTATAT
GACCGGGGAG TTCCCGAGTC GCTGCTCGAC GTCGTTCGCG CCCAGGCGAA ACAGGACGCC
CAGCTCACCC TGGCTGAGGG CGGGTCCGGC GACGTGGCGA AGGTGCTGAT CTCGGCGGCC
CGTTCCGGGC TGAACGCGGT GCACCTGGTC GCCGGTGACC CGTTCGGCCA CGGGTCGGTG
GTCAAGGAGG TGCAGGCGGT CGCGCGGACC GCCGGGCACT TCGAGGTGGT CCCGGGCGTC
GGCCAGGCAG AGGGGGTGGC GACCTACGCG GGTGTGCCGC TACCCGGAGT CCGCACAGCG
GCTGACGTCG AGGACGTCAA CACGCTGGAC TTCGAGGCGC TGGCCGCCGC CGTCACCCGG
GGGCCGCTGG CACTCGCGGC GGACGCCGGG GACCTTGCCG CGATCCGGGA CGGGTTGCTC
GCCGCCGGGG TCGACGACAC AACGGCCGTG GGGGTGACCG GTGACGGCAC CGGTGAGACC
CAGTACACGA CCACGTCGAC CGTGGACTCC TTCGTCGCGG CGGCGCTCGG CTTCACCGGC
CGGGTCGTAC TCACCCTTGG CGACGGGGTT GGCCAGCGGG ACAAGCTCAG CTGGTGGGAG
AACCGCCCGC TGTACGGCTG GAAGGTGCTC GTACCGCGGA CGAAGGAACA GGCCGGCGTG
ATGAGTGCCC GGCTGCGCGC GTACGGCGCG ATTCCCTGTG AGGTACCGAC CATCGCGGTC
GAGCCGCCGC GTACTCCCGC GCAGATGGAG CGGGCGGTCA AGGGGCTGGT CGACGGCCGG
TACGCCTGGG TGATCTTCAC GTCGGTGAAC GCGGTCCGGG CGGTCTGGGA GAAGTTCGCC
GAGCACGGCC TCGACGCCCG ACACTTCGGC GGCGTGAAGA TCGCCTGTAT CGGTGACGCG
ACCGCCGACG CGGTCCGCGC CTTCGGGATC AGGCCGGAGC TGGTCCCCGC CGGTGAGCAG
TCGTCGGAGG GCCTGTTGGC CGAGTTCTCG CCGCACGACG AGGTACTCGA CCCGGTTGGT
CGGGTGCTGC TGCCGCGCGC CGACATCGCT ACCGAGACGT TGGCCGCCGG GCTCACCGAG
CGAGGCTGGG AGGTTGACGA CGTCACCGCG TACCGGACGG TCCGGGCGGC GCCGCCGCCG
GCCGAGATCC GCGACGCGAT CAAGTCGGGT GGGTTCGACG CGGTGCTCTT CACCTCGTCG
TCCACGGTCC GGAACTTGGT CGGCATCGCC GGGAAGCCGC ACGCGCGTAC CGTTGTTTCG
GTTATCGGGC CCAAGACGGC GGAGACCGCC ACCGAGTTCG GCCTTCGGGT CGACGTGCAG
CCTCCGCACG CCTCGGTCCC TGACTTGGTG GAGGCGCTGG CCGGCTACGC CGTCGAGCTG
CGCGAGAAGC TCGCCGCTAT GCCGGCGAAG CAGCGTCGCG GCTCGAAGGT GCAGGGGCCG
ACCGCCCTCA GGTTCCGCTG A
 
Protein sequence
MTRSRKPVGR IAFVGAGPGD PGLLTRRGYD ALVNADQVVY DRGVPESLLD VVRAQAKQDA 
QLTLAEGGSG DVAKVLISAA RSGLNAVHLV AGDPFGHGSV VKEVQAVART AGHFEVVPGV
GQAEGVATYA GVPLPGVRTA ADVEDVNTLD FEALAAAVTR GPLALAADAG DLAAIRDGLL
AAGVDDTTAV GVTGDGTGET QYTTTSTVDS FVAAALGFTG RVVLTLGDGV GQRDKLSWWE
NRPLYGWKVL VPRTKEQAGV MSARLRAYGA IPCEVPTIAV EPPRTPAQME RAVKGLVDGR
YAWVIFTSVN AVRAVWEKFA EHGLDARHFG GVKIACIGDA TADAVRAFGI RPELVPAGEQ
SSEGLLAEFS PHDEVLDPVG RVLLPRADIA TETLAAGLTE RGWEVDDVTA YRTVRAAPPP
AEIRDAIKSG GFDAVLFTSS STVRNLVGIA GKPHARTVVS VIGPKTAETA TEFGLRVDVQ
PPHASVPDLV EALAGYAVEL REKLAAMPAK QRRGSKVQGP TALRFR