Gene Strop_0358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0358 
Symbol 
ID5056796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp409482 
End bp411062 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content72% 
IMG OID640472630 
Producturoporphyrinogen III synthase HEM4 
Protein accessionYP_001157221 
Protein GI145592924 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1587] Uroporphyrinogen-III synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCA CCCGTAAGCC CGTAGGCCGC ATCGCGTTCG TCGGGGCCGG TCCCGGCGAT 
CCGGGCCTGC TGACCCGCCG GGGATACGAC GCCCTGGTCA GTGCCGACCA GGTGGTATAT
GACCGGGGAG TCCCTGAGGC GCTGCTCGAT GTCGTCCGCG CCCAGGCGAA GCAGGAGGCC
CAGCTCACCC TGGCCGAGGG CGGATCGGGC GACGTGGCGA AGGTTCTGAT CTCGGCGGCC
CGCTCCGGGC TGAACGCGGT GCACCTGGTC GCCGGTGACC CGTTCGGCCA CGAGGCGGTG
GTCCGGGAGG TGCAGGCGGT CGCGCGGACC GCCGGACAGT TCGAGGTGGT ACCGGGCGTC
GGCCAGGCCG AGGGGGTGGC GACCTACGCG GGTGTGCCGC TGCCCGGTGT TCGTACGGCG
GCCGATGTCG AGGATGTCAC CACGCTCGAC TTCGAGGCAC TGGCCGCCGC AGTCGCCCGG
GGACCGCTGG CCCTCGCGGT GGAGGCCGGA GATCTCGCCG CGATCCGGGA CGGGTTGCTC
GCCGCCGGAG TTGACGATGC GACCGCGGTC GGCGTGACCG GCGACGGCAC CGGCGAGACC
CAGTACACGA CGACGTCGAC GGTGGACTCC TTCGTCGCGG CGGCGCTCGG GTTCACCGGC
CGGGTGGTGC TCACCCTCGG CGAGGGCGTC GGCCAGCGGG ACAAGCTCAG CTGGTGGGAG
AACCGTCCGC TGTACGGCTG GAAGGTGCTC GTCCCGCGGA CCAAGGAGCA GGCGGGGGTG
ATGAGCGCCC GACTGCGCGC GTACGGCGCG ATCCCCTGTG AGGTGCCGAC CATCGCGGTC
GAGCCGCCGC GTACCCCCGC GCAGATGGAG CGGGCGGTCA AGGGGCTGGT CGACGGCCGG
TACGCTTGGG TGATCTTCAC TTCGGTGAAT GCGGTCCGGG CGGTCTGGGA GAAGTTCGCC
GAGCATGGCC TCGACGCCCG CCACTTCGGC GGTGTCAAGA TCGCCTGCAT CGGCGACGCG
ACCGCGGACG CGGTCCGCGC CTTCGGGATC CGGCCGGAGC TGGTCCCCGC CGGGGAGCAG
TCCTCGGAGG GGCTGCTGGC CGAGTTCTCG CCGCACGACG AGGTGCTCGA CCCGGTCGGT
CGGGTGCTGC TGCCGCGCGC CGACATCGCC ACCGAGACGC TCGCCGCCGG GCTCACCGAG
CGCGGCTGGG AGGTCGACGA CGTGACCGCC TACCGGACGG TTCGGGCGGC ACCGCCGCCG
GCCGAGATCC GCGACGCGAT CAAGTCGGGT GGGTTCGACG CGGTGCTCTT CACCTCATCG
TCCACGGTCC GGAATCTGGT CGGCATCGCC GGGAAGCCAC ACGCGCGTAC CGTTGTTGCG
GTTATCGGGC CCAAGACGGC GGAGACCGCC ACCGAGTTCG GCCTTCGGGT TGACGTGCAG
CCACCGCACG CCTCGGTCCC CGACCTGGTG GAGGCGCTGG CCGGCTACGC CGTCGAGCTG
CGCGAGAAGC TCGCCGCGAT GCCGGCGAAG CAGCGGCGCG GCTCGAAGGT ACAGGGGCCG
ACCGCCCTCC GGTTCCGCTG A
 
Protein sequence
MTRTRKPVGR IAFVGAGPGD PGLLTRRGYD ALVSADQVVY DRGVPEALLD VVRAQAKQEA 
QLTLAEGGSG DVAKVLISAA RSGLNAVHLV AGDPFGHEAV VREVQAVART AGQFEVVPGV
GQAEGVATYA GVPLPGVRTA ADVEDVTTLD FEALAAAVAR GPLALAVEAG DLAAIRDGLL
AAGVDDATAV GVTGDGTGET QYTTTSTVDS FVAAALGFTG RVVLTLGEGV GQRDKLSWWE
NRPLYGWKVL VPRTKEQAGV MSARLRAYGA IPCEVPTIAV EPPRTPAQME RAVKGLVDGR
YAWVIFTSVN AVRAVWEKFA EHGLDARHFG GVKIACIGDA TADAVRAFGI RPELVPAGEQ
SSEGLLAEFS PHDEVLDPVG RVLLPRADIA TETLAAGLTE RGWEVDDVTA YRTVRAAPPP
AEIRDAIKSG GFDAVLFTSS STVRNLVGIA GKPHARTVVA VIGPKTAETA TEFGLRVDVQ
PPHASVPDLV EALAGYAVEL REKLAAMPAK QRRGSKVQGP TALRFR