Gene SeD_A3846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3846 
SymbolcysG 
ID6873825 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3668196 
End bp3669569 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content58% 
IMG OID642786811 
Productsiroheme synthase 
Protein accessionYP_002217439 
Protein GI198244791 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1648] Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase
[TIGR01470] siroheme synthase, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.130759 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCATT TGCCTATATT TTGTCAATTA CGCGACCGCG ACTGTCTGAT CGTCGGCGGT 
GGCGATGTCG CAGAACGCAA AGCACGGTTA CTGCTGGAAG CAGGCGCACG TTTAACGGTC
AATGCGCTAA CCTTCATTCC ACAGTTCACC GTATGGGCAA ATGAAGGCAT GTTGACTCTG
GTTAAGGGTC CGTTCGACGA AACGCTTCTC GACTCGTGTT GGCTGGCGAT CGCGGCCACT
GATGACGATA CCGTCAACCA GCGCGTCAGC GACGCGGCGG AGTCACGCCG TATCTTCTGC
AACGTGGTGG ATGCGCCGAA AGCCGCCAGC TTTATCATGC CCTCCATTAT TGACCGCTCG
CCGCTGATGG TCGCCGTCTC CTCGGGCGGC ACTTCCCCGG TACTGGCGCG TCTGCTGCGC
GAGAAACTGG AATCGCTGCT GCCGCAGCAT CTGGGGCAGG TCGCGCGCTA TGCCGGGCAA
CTCCGCGCCC GGGTGAAAAA GCAGTTTGCC ACGATGGGCG AGCGTCGTCG CTTCTGGGAA
AAATTTTTCG TCAATGACCG GCTGGCGCAG TCGCTGGCGA ATGCCGATGA GAAAGCGGTT
AACGCGACAA CCGAACGCCT GTTTAGCGAA CCGCTGGATC ACCGTGGCGA AGTCGTGCTG
GTCGGCGCCG GGCCGGGCGA TGCCGGACTG CTGACGCTGA AAGGGTTACA ACAAATCCAA
CAGGCGGATA TCGTGGTTTA CGATCGCCTC GTCTCCGACG ACATTATGAA CCTGGTACGC
CGCGATGCCG ATCGGGTCTT TGTGGGGAAA CGCGCGGGTT ACCACTGCGT CCCACAGGAA
GAAATCAACC AGATCCTGCT GCGTGAAGCG CAAAAAGGTA AACGCGTGGT ACGCCTGAAA
GGCGGCGATC CCTTTATCTT TGGTCGCGGC GGCGAAGAGC TGGAAACGCT GTGTCATGCC
GGTATTCCTT TCTCGGTAGT GCCAGGGATT ACCGCGGCTT CCGGCTGCTC CGCCTACTCC
GGTATTCCGC TAACCCATCG CGATTACGCC CAGAGCGTAC GTCTGGTCAC CGGTCACCTG
AAAACCGGCG GCGAGCTGGA CTGGGAAAAC CTGGCGGCAG AAAAACAGAC GCTGGTGTTC
TACATGGGGC TGAATCAGGC AGCGACTATC CAGGAAAAAC TGATCGCATT CGGTATGCAG
GCCGATATGC CGGTTGCGCT GGTAGAAAAC GGTACCTCCG TGAAGCAACG CGTCGTCCAC
GGTGTGCTGA CGCAGCTTGG TGAATTAGCG CAACAGGTTG AAAGCCCGGC GCTGATTATC
GTTGGCCGGG TCGTTGGCTT ACGCGATAAA TTAAATTGGT TCTCTAATTA TTAA
 
Protein sequence
MDHLPIFCQL RDRDCLIVGG GDVAERKARL LLEAGARLTV NALTFIPQFT VWANEGMLTL 
VKGPFDETLL DSCWLAIAAT DDDTVNQRVS DAAESRRIFC NVVDAPKAAS FIMPSIIDRS
PLMVAVSSGG TSPVLARLLR EKLESLLPQH LGQVARYAGQ LRARVKKQFA TMGERRRFWE
KFFVNDRLAQ SLANADEKAV NATTERLFSE PLDHRGEVVL VGAGPGDAGL LTLKGLQQIQ
QADIVVYDRL VSDDIMNLVR RDADRVFVGK RAGYHCVPQE EINQILLREA QKGKRVVRLK
GGDPFIFGRG GEELETLCHA GIPFSVVPGI TAASGCSAYS GIPLTHRDYA QSVRLVTGHL
KTGGELDWEN LAAEKQTLVF YMGLNQAATI QEKLIAFGMQ ADMPVALVEN GTSVKQRVVH
GVLTQLGELA QQVESPALII VGRVVGLRDK LNWFSNY