Gene Sare_3920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3920 
Symbol 
ID5703771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4462384 
End bp4463643 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content75% 
IMG OID641273345 
Producturoporphyrin-III C-methyltransferase 
Protein accessionYP_001538702 
Protein GI159039449 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1648] Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase
[TIGR01470] siroheme synthase, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.797973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.18472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCCA ACCCGTACCC GCTCGGGCTG CGCCTGGCCG GCCGTAGGGT GGTCGTGGTC 
GGTGGGGGAG CGGTCGCCAC CCGCCGAGTG CCGGCCCTGC TCGACGCGGG CGCGGACGTC
CTCCTGGTCG CACCGGAGCT GACCCCGGCG CTGCGCGCCC ACGCCGACGC TGGTCGGTTG
CACTGGGCGC GGCGACGGTT CGCGGTGGAC GACCTCGATG GTGCCTGGCT GGTGCAGGTG
GCGGTGAACG ACCCGATCGC CGCCGCCGCG GTCAGCGCGG CGGCCGGCGA GCGGCGGATC
TTCTGTGTCC GCGCCGATGA TCGCGCCGCC GCCACTGCCT GGACCCCGGC GGTCACCCGG
CAGGGTCCGG TGACGGTGGC GGTCCTCGGC GGCGGCGACC CCCGGCGCGC GATGGCCGTC
CGGGATGCCG TCCGGGACCT GCTCGCTGCC GGAGCCGGAC CGCTGGCCCC GCCCTCGACA
ACCGGTGACG GGACCGCTGG TGCTCCGGGG CGCGCACCCT CGACCACCGC CCGCGGCGGG
CGCGTCGCCC TGGTCGGCGC TGGACCGGGC GACCCGGAGC TGATCACGGT CAAGGGGCGA
CGGCTGCTCA CCGAGGCGGA CGTGGTGGTT GCCGACCGGC TGGTGCCAGG CCTCCTTCTG
GACGAGTTGC GCCCCGAGGT CGAACTGGTC GACGCGGCCA AGATTCCCTA CGGCCCGGCC
CGTGCCCAGG AGGAGATCAA CCGTGTCCTG GTCGACCGGG CTCTGGCCGG CAAGGCCGTG
GTCCGGCTCA AGGGCGGCGA CCCATACGTC TTCGGTCGTG GGGGCGAGGA ACTGCTGGCC
TGCGCCGCGG CGGGCGTACC GGTGACGGTG GTGTCCGGGG TGACCAGCGC GATTGCTGCG
CCAGCGGGCG CCGGTGTCCC GGTCACCCAC CGGGCGGTGG CGCACGAGTT CACCGTGGTG
TCCGGGCACG TTCCGCCGGA CTCGCCGGCC TCGATGGTGC GCTGGGAGCA CCTCGCCGGG
CTGCGCGGCA CGCTGGCGAT CATGATGGGG TTGAAGAATC TGGGGGCGAT CTCCGCGACG
TTGGTCACCC ACGGCCGCCC CGCGGACACC CCGGCGGTGG TCGTGCAAGA GGGCACGACC
GGCGATCAGC GTACGGTCCG CTCGACGCTC GGCGGGGTGG CCGTCGATGT GGCCGCGGCG
GGCCTCCGTC CCCCGGCGGT CGTGCTGATC GGCGACGTGG TCGGGGTCCT GGACACCTGA
 
Protein sequence
MSANPYPLGL RLAGRRVVVV GGGAVATRRV PALLDAGADV LLVAPELTPA LRAHADAGRL 
HWARRRFAVD DLDGAWLVQV AVNDPIAAAA VSAAAGERRI FCVRADDRAA ATAWTPAVTR
QGPVTVAVLG GGDPRRAMAV RDAVRDLLAA GAGPLAPPST TGDGTAGAPG RAPSTTARGG
RVALVGAGPG DPELITVKGR RLLTEADVVV ADRLVPGLLL DELRPEVELV DAAKIPYGPA
RAQEEINRVL VDRALAGKAV VRLKGGDPYV FGRGGEELLA CAAAGVPVTV VSGVTSAIAA
PAGAGVPVTH RAVAHEFTVV SGHVPPDSPA SMVRWEHLAG LRGTLAIMMG LKNLGAISAT
LVTHGRPADT PAVVVQEGTT GDQRTVRSTL GGVAVDVAAA GLRPPAVVLI GDVVGVLDT