Gene ECH74115_4679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4679 
SymbolcysG 
ID6967315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4320814 
End bp4322187 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content55% 
IMG OID643388383 
Productsiroheme synthase 
Protein accessionYP_002272811 
Protein GI209397180 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1648] Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase
[TIGR01470] siroheme synthase, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.839222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCATT TGCCTATATT TTGCCAATTA CGCGATCGCG ACTGTCTGAT TGTCGGCGGT 
GGTGATGTCG CGGAACGCAA AGCAAGGTTG CTGTTAGACG CAGGCGCTCG CTTAACGGTG
AATGCATTAG CGTTTATTCC ACAGTTCACC GCATGGGCAG ATGCAGGCAT GTTAACCCTC
GTCGAAGGGC CATTTGATGA AAGCCTTCTC GACACCTGCT GGCTGGCGAT TGCAGCGACG
GATGATGACG CGCTTAACCA GCGCGTCAGC GAAGCCGCTG AAGCTCGTCA CATCTTCTGT
AACGTGGTCG ATGCGCCGAA AGCTGCCAGC TTCATTATGC CGTCGATTAT TGACCGCTCA
CCGCTGATGG TGGCGGTTTC CTCTGGCGGT ACTTCTCCGG TTCTGGCGCG CCTGTTGCGC
GAAAAACTCG AATCACTGCT GCCGCTGCAT CTGGGCCAGG TAGCGAAATA CGCCGGGCAG
TTACGCGGTC GGGTAAAACA ACAGTTCGCC ACGATGGGTG AACGCCGCCG CTTCTGGGAA
AAACTGTTCG TTAACGACCG TCTGGCGCAG TCGCTGGCGA ACAACGATCA GAAAGCCATT
ACTGAAACCA CCGAACAGTT AATCAACGAA CCGCTCGACC ATCGCGGTGA AGTGGTGCTG
GTTGGTGCAG GTCCGGGCGA TGCCGGGCTG CTGACACTGA AAGGTCTGCA ACAAATTCAG
CAGGCAGATG TGGTGGTCTA CGACCGTCTG GTTTCTGACG ATATTATGAA TCTGGTACGC
CGCGATGCGG ATCGTGTTTT CGTCGGCAAA CGCGCGGGAT ACCACTGCGT ACCGCAGGAG
GAGATTAACC AGATCCTGCT GCGGGAAGCG CAAAAAGGCA AACGCGTGGT GCGGCTGAAA
GGCGGCGATC CGTTTATTTT TGGCCGTGGT GGCGAAGAGC TGGAAACACT GTGCAACGCG
GGTATTCCGT TCTCGGTGGT TCCGGGTATT ACCGCAGCTT CTGGTTGCTC AGCCTATTCG
GGTATTCCGC TCACGCATCG CGATTATGCC CAGAGCGTGC GCTTAATTAC CGGACACTTA
AAAACCGGTG GCGAGCTGGA CTGGGAAAAC CTGGCGGCAG AAAAACAGAC GCTGGTGTTC
TATATGGGGC TGAATCAGGC TGCGACTATT CAGCAAAAGC TGATTGAACA CGGTATGCCT
GGCGAAATGC CGGTAGCAAT TGTCGAAAAC GGAACAGCAG TCACGCAGCG CGTGATTGAC
GGTACGCTCA CACAACTGGG CGAACTGGCG CAGCAAATGA ATAGTCCATC GCTAATTATT
ATTGGTCGGG TTGTTGGCCT GCGCGATAAA CTGAACTGGT TCTCCAACCA TTAA
 
Protein sequence
MDHLPIFCQL RDRDCLIVGG GDVAERKARL LLDAGARLTV NALAFIPQFT AWADAGMLTL 
VEGPFDESLL DTCWLAIAAT DDDALNQRVS EAAEARHIFC NVVDAPKAAS FIMPSIIDRS
PLMVAVSSGG TSPVLARLLR EKLESLLPLH LGQVAKYAGQ LRGRVKQQFA TMGERRRFWE
KLFVNDRLAQ SLANNDQKAI TETTEQLINE PLDHRGEVVL VGAGPGDAGL LTLKGLQQIQ
QADVVVYDRL VSDDIMNLVR RDADRVFVGK RAGYHCVPQE EINQILLREA QKGKRVVRLK
GGDPFIFGRG GEELETLCNA GIPFSVVPGI TAASGCSAYS GIPLTHRDYA QSVRLITGHL
KTGGELDWEN LAAEKQTLVF YMGLNQAATI QQKLIEHGMP GEMPVAIVEN GTAVTQRVID
GTLTQLGELA QQMNSPSLII IGRVVGLRDK LNWFSNH