Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3650 |
Symbol | cysG |
ID | 6143292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3709956 |
End bp | 3711329 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618477 |
Product | siroheme synthase |
Protein accession | YP_001745617 |
Protein GI | 170680692 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0007] Uroporphyrinogen-III methylase [COG1648] Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) |
TIGRFAM ID | [TIGR01469] uroporphyrin-III C-methyltransferase [TIGR01470] siroheme synthase, N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0000355794 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGATCATT TGCCTATATT TTGCCAATTA CGCGATCGCG ACTGTCTGAT TGTCGGCGGT GGTGATGTCG CGGAACGCAA AGCAAGGTTG CTGTTAGACG CAGGCGCTCG CTTAACGGTG AATGCATTAG CGTTTATTCC ACAGTTCACC GCATGGGCAG ATGCAGGCAT GTTAACCCTC GTCGAAGGGC CATTTGATGA AAGCCTTCTC GACACCTGCT GGCTGGCGAT TGCAGCGACA GATGATGACG CGCTTAACCA GCGCGTCAGT GAAGCCGCTG AAGCTCGTCG CATCTTCTGT AACGTGGTCG ATGCGCCGAA AGCCGCCAGC TTTATTATGC CGTCAATTAT TGACCGCTCA CCGCTCATGG TCGCGGTCTC CTCTGGCGGC ACCTCTCCGG TTCTGGCACG CCTGTTGCGT GAAAAACTTG AATCACTGCT GCCGTTGCAT CTGGGCCAGG TAGCGAAATA CGCCGGGCAA TTACGCGGTC GAGTGAAACA ACAGTTCGCC ACAATGGGTG AACGTCGCCG TTTCTGGGAA AAACTGTTCG TTAATGACCG TCTGGCGCAG TCGCTGGCAA ACAACGATCA GAAAGCCATT ACTGAAACGA CCGAACAATT AATCAACGAA CCGCTCGACC ATCGCGGTGA AGTGGTGCTG GTTGGCGCAG GTCCGGGCGA TGCCGGGCTG CTGACACTGA AAGGACTGCA ACAAATTCAG CAGGCAGATG TGGTGGTCTA CGACCGTCTG GTTTCTGACG ATATTATGAA TCTGGTACGC CGCGATGCTG ATCGCGTTTT CGTCGGCAAA CGCGCGGGAT ACCACTGCGT ACCGCAGGAA GAGATTAACC AGATCCTGCT GCGGGAAGCG CAAAAAGGCA AACGCGTGGT GCGACTGAAA GGCGGCGATC CGTTTATTTT TGGCCGTGGT GGCGAAGAGC TGGAAACACT GTGCAATGCA GGCATTCCGT TCTCGGTGGT TCCGGGTATT ACCGCAGCTT CTGGTTGCTC TGCCTATTCG GGTATTCCGC TCACGCATCG CGATTATGCC CAGAGCGTAC GCTTAATTAC CGGACACTTA AAAACCGGTG GCGAACTGGA CTGGGAAAAC CTGGCGGCAG AAAAACAGAC GCTGGTGTTC TATATGGGGT TGAATCAGGC CGCGACTATT CAGCAAAAGC TGATTGAACA CGGTATGCCT GGCGAAATGC CGGTGGCAAT TGTCGAAAAC GGAACGGCAG TCACGCAGCG CGTGATTGAC GGTACGCTCA CGCAGCTGGG CGAACTGGCG CAGCAAATGA ACAGTCCATC GCTAATTATT ATTGGTCGGG TTGTTGGCCT GCGCGATAAA TTGAACTGGT TCTCTAACCA TTAA
|
Protein sequence | MDHLPIFCQL RDRDCLIVGG GDVAERKARL LLDAGARLTV NALAFIPQFT AWADAGMLTL VEGPFDESLL DTCWLAIAAT DDDALNQRVS EAAEARRIFC NVVDAPKAAS FIMPSIIDRS PLMVAVSSGG TSPVLARLLR EKLESLLPLH LGQVAKYAGQ LRGRVKQQFA TMGERRRFWE KLFVNDRLAQ SLANNDQKAI TETTEQLINE PLDHRGEVVL VGAGPGDAGL LTLKGLQQIQ QADVVVYDRL VSDDIMNLVR RDADRVFVGK RAGYHCVPQE EINQILLREA QKGKRVVRLK GGDPFIFGRG GEELETLCNA GIPFSVVPGI TAASGCSAYS GIPLTHRDYA QSVRLITGHL KTGGELDWEN LAAEKQTLVF YMGLNQAATI QQKLIEHGMP GEMPVAIVEN GTAVTQRVID GTLTQLGELA QQMNSPSLII IGRVVGLRDK LNWFSNH
|
| |