Gene EcE24377A_3838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3838 
SymbolcysG 
ID5585910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3812536 
End bp3813909 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content55% 
IMG OID640927462 
Productsiroheme synthase 
Protein accessionYP_001464823 
Protein GI157157083 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1648] Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase
[TIGR01470] siroheme synthase, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATCATT TGCCTATATT TTGCCAATTA CGCGATCGCG ACTGTCTGAT TGTCGGCGGT 
GGTGATGTCG CGGAACGCAA AGCAAGGTTG CTGTTAGACG CAGGCGCTCG CTTAACGGTG
AATGCATTAG CGTTTATTCC ACAGTTCACC GCATGGGCAG ATGCAGGCAT GTTAACCCTC
GTCGAAGGGC CATTTGATGA AAGCCTTCTC GACACCTGCT GGCTGGCGAT TGCAGCGACG
GATGATGACG CGCTTAACCA GCGCGTCAGC GAAGCCGCTG AAGCTCGTCG CATCTTCTGT
AACGTGGTCG ATGCGCCGAA AGCCGCCAGC TTTATTATGC CGTCGATTAT TGACCGCTCA
CCGCTCATGG TCGCGGTCTC CTCTGGCGGC ACCTCTCCGG TTCTGGCGCG CCTGTTGCGC
GAAAAACTCG AATCACTGCT GCCGTTGCAT CTGGGCCAGG TGGCGAAATA TGCCGGGCAA
TTACGCGGGC GAGTGAAACA ACAGTTCGCC ACGATGAGTG AGCGTCGCCG TTTCTGGGAG
AAATTGTTCG TTAACGACCG CCTGGCGCAG TCGCTGGCAA ACAACGATCA GAAAGCCATT
ACTGAAACGA CCGAACAGTT AATCAACGAA CCGCTCGACC ATCGCGGTGA AGTGGTGCTG
GTTGGTGCAG GTCCGGGCGA TGCCGGGCTG CTGACGCTGA AAGGACTGCA ACAAATTCAG
CAGGCAGATG TGGTGGTCTA CGACCGTCTG GTTTCTGACG ATATTATGAA TCTGGTACGC
CGCGATGCTG ATCGCGTTTT CGTCGGCAAA CGCGCGGGAT ACCACTGTGT ACCGCAGGAA
GAGATTAACC AGATCCTGCT GCGGGAAGCG CAAAAAGGCA AACGCGTGGT GCGGCTGAAA
GGTGGCGATC CGTTTATTTT TGGCCGTGGT GGCGAAGAGC TGGAAACACT GTGCAACGCG
GGTATTCCGT TCTCGGTGGT TCCGGGTATT ACCGCAGCTT CTGGTTGCTC TGCCTATTCG
GGTATTCCGC TCACGCATCG CGATTATGCC CAGAGCGTGC GCTTAATTAC CGGACACTTA
AAAACCGGTG GCGAGCTGGA CTGGGAAAAC CTGGCGGCAG AAAAACAGAC GCTGGTGTTC
TATATGGGGC TGAATCAGGC TGCGACTATT CAGCAAAAGC TGATTGAACA CGGTATGCCT
GGCGAAATGC CGGTGGCAAT TGTCGAAAAC GGAACAGCAG TCACGCAGCG CGTGATTGAC
GGTACGCTCA CGCAACTCGG TGAACTTGCT CAGCAAATGA ACAGTCCATC GCTAATTATT
ATTGGTCGGG TTGTTGGCCT GCGCGATAAA CTGAACTGGT TCTCCAACCA TTAA
 
Protein sequence
MDHLPIFCQL RDRDCLIVGG GDVAERKARL LLDAGARLTV NALAFIPQFT AWADAGMLTL 
VEGPFDESLL DTCWLAIAAT DDDALNQRVS EAAEARRIFC NVVDAPKAAS FIMPSIIDRS
PLMVAVSSGG TSPVLARLLR EKLESLLPLH LGQVAKYAGQ LRGRVKQQFA TMSERRRFWE
KLFVNDRLAQ SLANNDQKAI TETTEQLINE PLDHRGEVVL VGAGPGDAGL LTLKGLQQIQ
QADVVVYDRL VSDDIMNLVR RDADRVFVGK RAGYHCVPQE EINQILLREA QKGKRVVRLK
GGDPFIFGRG GEELETLCNA GIPFSVVPGI TAASGCSAYS GIPLTHRDYA QSVRLITGHL
KTGGELDWEN LAAEKQTLVF YMGLNQAATI QQKLIEHGMP GEMPVAIVEN GTAVTQRVID
GTLTQLGELA QQMNSPSLII IGRVVGLRDK LNWFSNH