Gene ECH74115_3301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3301 
Symbol 
ID6968448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3033719 
End bp3034657 
Gene Length939 bp 
Protein Length312 aa 
Translation table11 
GC content49% 
IMG OID643387113 
Productindigoidine synthase A like protein 
Protein accessionYP_002271577 
Protein GI209399243 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2313] Uncharacterized enzyme involved in pigment biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.025754 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.0279599 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAT TAAAAATTTC CCCTGAATTA TTACAAATTT CCCCGGAAGT GCAGGACGCT 
TTAAAAAACA AAAAACCGGT TGTGGCGCTG GAATCGACCA TTATTTCTCA CGGGATGCCG
TTCCCACAAA ATGCCCAGAC CGCAATTGAA GTAGAAGAAA CTATTCGTAA ACAGGGCGCA
GTACCTGCCA CTATCGCCAT TATTGGCGGC GTGATGAAAG TGGGTTTAAG CAAAGAAGAA
ATTGAATTAC TGGGTCGTGA AGGGCATAAC GTGACTAAAG TTAGTCGTCG CGATTTACCT
TTTGTCGTTG CCGCAGGAAA AAATGGTGCA ACCACCGTGG CTTCAACGAT GATTATTGCG
GCGCTTGCCG GAATTAAAGT ATTTGCCACC GGCGGAATTG GTGGTGTTCA TCGAGGGGCG
GAACATACCT TCGATATTTC TGCCGATTTG CAAGAACTGG CAAATACTAA TGTCACCGTT
GTTTGTGCCG GGGCGAAATC TATTCTCGAT TTAGGATTAA CCACTGAGTA TTTAGAAACC
TTCGGTGTGC CGTTAATTGG CTATCAGACT AAAGCGCTGC CTGCGTTTTT CTGCCGTACC
AGCCCGTTTG ACGTCAGCAT TCGTCTCGAC AGCGCCAGTG AAATTGCCCG TGCAATGGCG
GTGAAATGGC AAAGCGGGCT GAACGGTGGC CTCGTGGTAG CGAACCCGAT CCCGGAACAG
TTTGCGATGC CAGAACACAC TATCAATGCG GTGATCGATC AGGCGGTAGC TGAAGCTGAA
GCGCAGGGTG TTATTGGTAA AGAAAGTACG CCATTCCTGC TGGCGCGCGT TGCTGAGCTG
ACCGGCGGTG ACAGCCTGAA ATCCAACATC CAGCTTGTGT TCAACAACGC CATTCTGGCG
AGCGAAATTG CCAAAGAATA TCAGCGTCTC GCGGGTTAA
 
Protein sequence
MSELKISPEL LQISPEVQDA LKNKKPVVAL ESTIISHGMP FPQNAQTAIE VEETIRKQGA 
VPATIAIIGG VMKVGLSKEE IELLGREGHN VTKVSRRDLP FVVAAGKNGA TTVASTMIIA
ALAGIKVFAT GGIGGVHRGA EHTFDISADL QELANTNVTV VCAGAKSILD LGLTTEYLET
FGVPLIGYQT KALPAFFCRT SPFDVSIRLD SASEIARAMA VKWQSGLNGG LVVANPIPEQ
FAMPEHTINA VIDQAVAEAE AQGVIGKEST PFLLARVAEL TGGDSLKSNI QLVFNNAILA
SEIAKEYQRL AG