Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3301 |
Symbol | |
ID | 6968448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3033719 |
End bp | 3034657 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643387113 |
Product | indigoidine synthase A like protein |
Protein accession | YP_002271577 |
Protein GI | 209399243 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2313] Uncharacterized enzyme involved in pigment biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.025754 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.0279599 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAT TAAAAATTTC CCCTGAATTA TTACAAATTT CCCCGGAAGT GCAGGACGCT TTAAAAAACA AAAAACCGGT TGTGGCGCTG GAATCGACCA TTATTTCTCA CGGGATGCCG TTCCCACAAA ATGCCCAGAC CGCAATTGAA GTAGAAGAAA CTATTCGTAA ACAGGGCGCA GTACCTGCCA CTATCGCCAT TATTGGCGGC GTGATGAAAG TGGGTTTAAG CAAAGAAGAA ATTGAATTAC TGGGTCGTGA AGGGCATAAC GTGACTAAAG TTAGTCGTCG CGATTTACCT TTTGTCGTTG CCGCAGGAAA AAATGGTGCA ACCACCGTGG CTTCAACGAT GATTATTGCG GCGCTTGCCG GAATTAAAGT ATTTGCCACC GGCGGAATTG GTGGTGTTCA TCGAGGGGCG GAACATACCT TCGATATTTC TGCCGATTTG CAAGAACTGG CAAATACTAA TGTCACCGTT GTTTGTGCCG GGGCGAAATC TATTCTCGAT TTAGGATTAA CCACTGAGTA TTTAGAAACC TTCGGTGTGC CGTTAATTGG CTATCAGACT AAAGCGCTGC CTGCGTTTTT CTGCCGTACC AGCCCGTTTG ACGTCAGCAT TCGTCTCGAC AGCGCCAGTG AAATTGCCCG TGCAATGGCG GTGAAATGGC AAAGCGGGCT GAACGGTGGC CTCGTGGTAG CGAACCCGAT CCCGGAACAG TTTGCGATGC CAGAACACAC TATCAATGCG GTGATCGATC AGGCGGTAGC TGAAGCTGAA GCGCAGGGTG TTATTGGTAA AGAAAGTACG CCATTCCTGC TGGCGCGCGT TGCTGAGCTG ACCGGCGGTG ACAGCCTGAA ATCCAACATC CAGCTTGTGT TCAACAACGC CATTCTGGCG AGCGAAATTG CCAAAGAATA TCAGCGTCTC GCGGGTTAA
|
Protein sequence | MSELKISPEL LQISPEVQDA LKNKKPVVAL ESTIISHGMP FPQNAQTAIE VEETIRKQGA VPATIAIIGG VMKVGLSKEE IELLGREGHN VTKVSRRDLP FVVAAGKNGA TTVASTMIIA ALAGIKVFAT GGIGGVHRGA EHTFDISADL QELANTNVTV VCAGAKSILD LGLTTEYLET FGVPLIGYQT KALPAFFCRT SPFDVSIRLD SASEIARAMA VKWQSGLNGG LVVANPIPEQ FAMPEHTINA VIDQAVAEAE AQGVIGKEST PFLLARVAEL TGGDSLKSNI QLVFNNAILA SEIAKEYQRL AG
|
| |