Gene ECH74115_1249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1249 
Symbol 
ID6972129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1255961 
End bp1257052 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content55% 
IMG OID643385241 
Productputative monooxygenase rutA 
Protein accessionYP_002269736 
Protein GI209398634 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03612] pyrimidine utilization protein A 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTG GCGTATTCGT ACCTATTGGC AACAACGGCT GGCTCATTTC GACCCACGCG 
CCGCAGTACA TGCCGACCTT TGAACTGAAT AAAGCCATCG TGCAAAAAGC GGAGCACTAC
CATTTCGATT TCGCCCTGTC GATGATCAAA CTGCGTGGCT TTGGCGGCAA AACTGAGTTC
TGGGATCACA ACCTTGAGTC GTTCACCTTG ATGGCGGGGC TGGCGGCCGT GACCTCGCGC
ATTCAGATTT ACGCCACCGC CGCCACCTTA ACGTTACCTC CGGCAATCGT CGCCCGTATG
GCCGCAACCA TCGACTCCAT CTCTGGCGGG CGTTTTGGCG TCAACCTCGT GACTGGCTGG
CAAAAGCCCG AGTATGAGCA GATGGGTATC TGGCCTGGCG ATGACTATTT CTCCCGTCGT
TACGACTATC TCACCGAGTA TGTTCAGGTG CTGCGCGACC TGTGGGGCAC GGGAAAAAGC
GATTTTAAAG GCGATTTTTT CACCATGAAT GATTGTCGCG TCAGTCCGCA ACCGAGTGTC
CCCATGAAAG TGATCTGCGC CGGGCAAAGC GACGCTGGCA TGGCGTTCTC CGCTCAGTAT
GCCGATTTCA ACTTCTGTTT CGGCAAAGGC GTAAATACAC CCACGGCTTT CGCCCCGACC
GCTGCGCGGA TGAAACAGGC CGCAGAGCAA ACCGGGCGCG ACGTTGGCTC TTATGTATTG
TTTATGGTGA TTGCCGATGA AACCGACGAT GCCGCTCGCG CCAAATGGGA ACACTACAAA
GCGGGCGCGG ATGAAGAGGC GTTAAGCTGG CTAACCGAAC AAAGTCAGAA AGATACCCGC
TCCGGTACTG ACACCAACGT CCGTCAGATG GCCGATCCCA CTTCGGCGGT AAACATCAAT
ATGGGGACGT TAGTCGGTTC TTACGCCAGT GTCGCGCGCA TGTTAGATGA AGTCGCAAGC
GTGCCTGGTG CCGAAGGCGT GCTGTTAACC TTCGACGATT TTCTGTCGGG AATCGAAACC
TTCGGCGAGC GCATTCAACC ACTGATGCAG TGCCGCGCCC ATCTCCCTGT GCTGACTCAG
GAGGTGGCAT GA
 
Protein sequence
MKIGVFVPIG NNGWLISTHA PQYMPTFELN KAIVQKAEHY HFDFALSMIK LRGFGGKTEF 
WDHNLESFTL MAGLAAVTSR IQIYATAATL TLPPAIVARM AATIDSISGG RFGVNLVTGW
QKPEYEQMGI WPGDDYFSRR YDYLTEYVQV LRDLWGTGKS DFKGDFFTMN DCRVSPQPSV
PMKVICAGQS DAGMAFSAQY ADFNFCFGKG VNTPTAFAPT AARMKQAAEQ TGRDVGSYVL
FMVIADETDD AARAKWEHYK AGADEEALSW LTEQSQKDTR SGTDTNVRQM ADPTSAVNIN
MGTLVGSYAS VARMLDEVAS VPGAEGVLLT FDDFLSGIET FGERIQPLMQ CRAHLPVLTQ
EVA