Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03171 |
Symbol | cysG |
ID | 8112986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 3356640 |
End bp | 3358013 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644849352 |
Product | hypothetical protein |
Protein accession | YP_003000925 |
Protein GI | 251786621 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0007] Uroporphyrinogen-III methylase [COG1648] Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) |
TIGRFAM ID | [TIGR01469] uroporphyrin-III C-methyltransferase [TIGR01470] siroheme synthase, N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGATCATT TGCCTATATT TTGCCAATTA CGCGATCGCG ACTGTCTGAT TGTCGGCGGT GGTGATGTCG CGGAACGCAA AGCAAGGTTG CTGTTAGACG CAGGCGCTCG CTTAACGGTG AATGCATTAG CGTTTATTCC ACAGTTCACC GCATGGGCAG ATGCAGGCAT GTTAACCCTC GTCGAAGGGC CATTTGATGA AAGCCTTCTC GACACCTGCT GGCTGGCGAT TGCAGCGACG GATGATGACG CGCTTAACCA GCGCGTCAGC GAAGCCGCTG AAGCTCGTCG CATCTTCTGT AACGTGGTCG ATGCGCCGAA AGCCGCCAGC TTTATTATGC CGTCGATTAT TGACCGCTCA CCGCTCATGG TAGCGGTCTC CTCTGGCGGC ACCTCTCCGG TTCTGGCACG CCTGTTGCGC GAAAAACTTG AATCACTGCT GCCGTTACAT CTGGGCCAGG TAGCGAAATA CGCCGGGCAA TTACGCGGGC GAGTGAAACA ACAGTTCGCC ACGATGGGTG AGCGTCGCCG TTTCTGGGAG AAATTGTTCG TTAACGACCG CCTGGCGCAG TCGCTGGCAA ACAACGATCA GAAAGCCATT ACTGAAACGA CCGAACAGTT AATCAACGAA CCGCTCGACC ATCGCGGTGA AGTGGTGCTG GTTGGTGCAG GTCCGGGCGA TGCCGGGCTG CTGACACTGA AAGGACTGCA ACAAATTCAG CAGGCAGATG TGGTGGTCTA CGACCGTCTG GTTTCTGACG ATATTATGAA TCTGGTACGC CGCGATGCGG ACCGTGTTTT CGTCGGCAAA CGCGCGGGAT ACCACTGCGT ACCCCAGGAA GAGATTAACC AGATCCTGCT GCGGGAAGCG CAAAAAGGCA AACGCGTGGT GCGGCTGAAA GGTGGCGATC CGTTTATTTT TGGCCGTGGT GGCGAAGAGC TGGAAACACT GTGCAACGCG GGTATTCCGT TCTCGGTGGT TCCGGGTATT ACCGCAGCTT CTGGTTGCTC TGCCTATTCG GGTATTCCAC TCACGCATCG CGATTATGCC CAGAGCGTAC GCTTAATTAC CGGACACTTA AAAACCGGTG GCGAGCTGGA CTGGGAAAAC CTGGCGGCAG AAAAACAGAC GCTGGTGTTC TATATGGGGT TGAATCAGGC CGCGACTATT CAGCAAAAGC TGATTGAACA CGGAATGCCA GGCGAAATGC CGGTGGCAAT TGTCGAAAAC GGTACGGCAG TCACGCAGCG CGTGATTGAC GGTACGCTCA CACAGCTGGG AGAACTGGCG CAGCAAATGA ACAGTCCATC GCTAATTATT ATTGGTCGGG TTGTTGGCCT GCGCGATAAA CTGAACTGGT TCTCCAACCA TTAA
|
Protein sequence | MDHLPIFCQL RDRDCLIVGG GDVAERKARL LLDAGARLTV NALAFIPQFT AWADAGMLTL VEGPFDESLL DTCWLAIAAT DDDALNQRVS EAAEARRIFC NVVDAPKAAS FIMPSIIDRS PLMVAVSSGG TSPVLARLLR EKLESLLPLH LGQVAKYAGQ LRGRVKQQFA TMGERRRFWE KLFVNDRLAQ SLANNDQKAI TETTEQLINE PLDHRGEVVL VGAGPGDAGL LTLKGLQQIQ QADVVVYDRL VSDDIMNLVR RDADRVFVGK RAGYHCVPQE EINQILLREA QKGKRVVRLK GGDPFIFGRG GEELETLCNA GIPFSVVPGI TAASGCSAYS GIPLTHRDYA QSVRLITGHL KTGGELDWEN LAAEKQTLVF YMGLNQAATI QQKLIEHGMP GEMPVAIVEN GTAVTQRVID GTLTQLGELA QQMNSPSLII IGRVVGLRDK LNWFSNH
|
| |