Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0428 |
Symbol | |
ID | 5708405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 488900 |
End bp | 490480 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641269953 |
Product | uroporphyrinogen III synthase HEM4 |
Protein accession | YP_001535348 |
Protein GI | 159036095 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0007] Uroporphyrinogen-III methylase [COG1587] Uroporphyrinogen-III synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.256746 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000733658 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCCGCA GCCGTAAGCC CGTAGGCCGC ATCGCGTTCG TCGGGGCCGG CCCCGGCGAC CCGGGCCTGC TGACCCGCCG GGGGTACGAC GCCCTGGTCA ATGCCGACCA GGTGGTATAT GACCGGGGAG TTCCCGAGTC GCTGCTCGAC GTCGTTCGCG CCCAGGCGAA ACAGGACGCC CAGCTCACCC TGGCTGAGGG CGGGTCCGGC GACGTGGCGA AGGTGCTGAT CTCGGCGGCC CGTTCCGGGC TGAACGCGGT GCACCTGGTC GCCGGTGACC CGTTCGGCCA CGGGTCGGTG GTCAAGGAGG TGCAGGCGGT CGCGCGGACC GCCGGGCACT TCGAGGTGGT CCCGGGCGTC GGCCAGGCAG AGGGGGTGGC GACCTACGCG GGTGTGCCGC TACCCGGAGT CCGCACAGCG GCTGACGTCG AGGACGTCAA CACGCTGGAC TTCGAGGCGC TGGCCGCCGC CGTCACCCGG GGGCCGCTGG CACTCGCGGC GGACGCCGGG GACCTTGCCG CGATCCGGGA CGGGTTGCTC GCCGCCGGGG TCGACGACAC AACGGCCGTG GGGGTGACCG GTGACGGCAC CGGTGAGACC CAGTACACGA CCACGTCGAC CGTGGACTCC TTCGTCGCGG CGGCGCTCGG CTTCACCGGC CGGGTCGTAC TCACCCTTGG CGACGGGGTT GGCCAGCGGG ACAAGCTCAG CTGGTGGGAG AACCGCCCGC TGTACGGCTG GAAGGTGCTC GTACCGCGGA CGAAGGAACA GGCCGGCGTG ATGAGTGCCC GGCTGCGCGC GTACGGCGCG ATTCCCTGTG AGGTACCGAC CATCGCGGTC GAGCCGCCGC GTACTCCCGC GCAGATGGAG CGGGCGGTCA AGGGGCTGGT CGACGGCCGG TACGCCTGGG TGATCTTCAC GTCGGTGAAC GCGGTCCGGG CGGTCTGGGA GAAGTTCGCC GAGCACGGCC TCGACGCCCG ACACTTCGGC GGCGTGAAGA TCGCCTGTAT CGGTGACGCG ACCGCCGACG CGGTCCGCGC CTTCGGGATC AGGCCGGAGC TGGTCCCCGC CGGTGAGCAG TCGTCGGAGG GCCTGTTGGC CGAGTTCTCG CCGCACGACG AGGTACTCGA CCCGGTTGGT CGGGTGCTGC TGCCGCGCGC CGACATCGCT ACCGAGACGT TGGCCGCCGG GCTCACCGAG CGAGGCTGGG AGGTTGACGA CGTCACCGCG TACCGGACGG TCCGGGCGGC GCCGCCGCCG GCCGAGATCC GCGACGCGAT CAAGTCGGGT GGGTTCGACG CGGTGCTCTT CACCTCGTCG TCCACGGTCC GGAACTTGGT CGGCATCGCC GGGAAGCCGC ACGCGCGTAC CGTTGTTTCG GTTATCGGGC CCAAGACGGC GGAGACCGCC ACCGAGTTCG GCCTTCGGGT CGACGTGCAG CCTCCGCACG CCTCGGTCCC TGACTTGGTG GAGGCGCTGG CCGGCTACGC CGTCGAGCTG CGCGAGAAGC TCGCCGCTAT GCCGGCGAAG CAGCGTCGCG GCTCGAAGGT GCAGGGGCCG ACCGCCCTCA GGTTCCGCTG A
|
Protein sequence | MTRSRKPVGR IAFVGAGPGD PGLLTRRGYD ALVNADQVVY DRGVPESLLD VVRAQAKQDA QLTLAEGGSG DVAKVLISAA RSGLNAVHLV AGDPFGHGSV VKEVQAVART AGHFEVVPGV GQAEGVATYA GVPLPGVRTA ADVEDVNTLD FEALAAAVTR GPLALAADAG DLAAIRDGLL AAGVDDTTAV GVTGDGTGET QYTTTSTVDS FVAAALGFTG RVVLTLGDGV GQRDKLSWWE NRPLYGWKVL VPRTKEQAGV MSARLRAYGA IPCEVPTIAV EPPRTPAQME RAVKGLVDGR YAWVIFTSVN AVRAVWEKFA EHGLDARHFG GVKIACIGDA TADAVRAFGI RPELVPAGEQ SSEGLLAEFS PHDEVLDPVG RVLLPRADIA TETLAAGLTE RGWEVDDVTA YRTVRAAPPP AEIRDAIKSG GFDAVLFTSS STVRNLVGIA GKPHARTVVS VIGPKTAETA TEFGLRVDVQ PPHASVPDLV EALAGYAVEL REKLAAMPAK QRRGSKVQGP TALRFR
|
| |