Gene ECH74115_1438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1438 
SymbolsolA 
ID6971653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1422195 
End bp1423313 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content52% 
IMG OID643385411 
ProductN-methyltryptophan oxidase 
Protein accessionYP_002269905 
Protein GI209396898 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01377] sarcosine oxidase, monomeric form 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.478275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.0445868 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATACG ATCTCATCAT TATTGGCAGC GGTTCCGTAG GCGCTGCCGC CGGGTATTAT 
GCAACCCGCG CCGGTTTAAA CGTGCTTATG ACCGACGCCC ATATGCCACC ACATCAACAC
GGCAGCCACC ACGGCGATAC GCGATTAATT CGACATGCTT ATGGTGAAGG CGAAAAGTAT
GTCCCGCTGG TCCTCCGCGC ACAAACGCTG TGGGATGACC TCTCCCGCCA CAACGAAGAT
GATCCCATTT TTGTACGCTC TGGTGTCATT AACCTTGGCC CGGCTGACTC CGCATTTCTC
GCCAACGTCG CCCACAGCGC CGAACAGTGG CAACTCAACG TTGAAAAGCT CGATGCGCAA
GGGATTATGG CCCGCTGGCC AGAAATACGC GTCCCGGACA ACTACATCGG CTTATTTGAG
ACTGATTCCG GTTTTTTGCG CAGCGAACTG GCGATTAAAA CCTGGATCCA ACTGGCGAAG
GAAGCGGGCT GTGCGCAACT GTTCAACTGC CCGGTCACCG CAATTCGTCA TGACGATGAT
GGCGTAACTA TTGAAACGGC TGACGGTGAG TATCAGGCGA AAAAAGCGAT TGTCTGCGCG
GGAACATGGG TAAAAGACCT GCTCCCGGAG CTGCCTGTCC AGCCCGTACG TAAAGTATTT
GCCTGGTATC AGGCCGATGG CCGCTATAGC GTGAAGAATA AATTCCCGGC GTTTACCGGT
GAACTGCCCA ATGGCGATCA ATATTATGGT TTTCCGGCAG AAAACGACGC GTTGAAGATT
GGCAAACATA ACGGAGGCCA GGTTATCCAT TCAGCGGATG AACGTGTTCC GTTTGCGGAA
GTGGTCAGCG ATGGTTCGGA AGCCTTCCCG TTCTTGCGCA ATGTATTGCC GGGTATCGGT
TGCTGCCTGT ACGGCGCTGC CTGCACCTAT GATAATTCGC CTGACGAAGA TTTTATTATC
GATACCCTAC CCGGCCACGA TAATACACTG CTCATTACCG GCCTGAGTGG GCACGGTTTT
AAATTTGCGT CAGTTTTAGG GGAAATAGCT GCCGATTTTG CGCAAGACAA AAAAAGCGAT
TTTGATTTGA CGCCATTCAG CCTTTCCCGC TTCCAATAA
 
Protein sequence
MKYDLIIIGS GSVGAAAGYY ATRAGLNVLM TDAHMPPHQH GSHHGDTRLI RHAYGEGEKY 
VPLVLRAQTL WDDLSRHNED DPIFVRSGVI NLGPADSAFL ANVAHSAEQW QLNVEKLDAQ
GIMARWPEIR VPDNYIGLFE TDSGFLRSEL AIKTWIQLAK EAGCAQLFNC PVTAIRHDDD
GVTIETADGE YQAKKAIVCA GTWVKDLLPE LPVQPVRKVF AWYQADGRYS VKNKFPAFTG
ELPNGDQYYG FPAENDALKI GKHNGGQVIH SADERVPFAE VVSDGSEAFP FLRNVLPGIG
CCLYGAACTY DNSPDEDFII DTLPGHDNTL LITGLSGHGF KFASVLGEIA ADFAQDKKSD
FDLTPFSLSR FQ