Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1438 |
Symbol | solA |
ID | 6971653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1422195 |
End bp | 1423313 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643385411 |
Product | N-methyltryptophan oxidase |
Protein accession | YP_002269905 |
Protein GI | 209396898 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01377] sarcosine oxidase, monomeric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.478275 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.0445868 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATACG ATCTCATCAT TATTGGCAGC GGTTCCGTAG GCGCTGCCGC CGGGTATTAT GCAACCCGCG CCGGTTTAAA CGTGCTTATG ACCGACGCCC ATATGCCACC ACATCAACAC GGCAGCCACC ACGGCGATAC GCGATTAATT CGACATGCTT ATGGTGAAGG CGAAAAGTAT GTCCCGCTGG TCCTCCGCGC ACAAACGCTG TGGGATGACC TCTCCCGCCA CAACGAAGAT GATCCCATTT TTGTACGCTC TGGTGTCATT AACCTTGGCC CGGCTGACTC CGCATTTCTC GCCAACGTCG CCCACAGCGC CGAACAGTGG CAACTCAACG TTGAAAAGCT CGATGCGCAA GGGATTATGG CCCGCTGGCC AGAAATACGC GTCCCGGACA ACTACATCGG CTTATTTGAG ACTGATTCCG GTTTTTTGCG CAGCGAACTG GCGATTAAAA CCTGGATCCA ACTGGCGAAG GAAGCGGGCT GTGCGCAACT GTTCAACTGC CCGGTCACCG CAATTCGTCA TGACGATGAT GGCGTAACTA TTGAAACGGC TGACGGTGAG TATCAGGCGA AAAAAGCGAT TGTCTGCGCG GGAACATGGG TAAAAGACCT GCTCCCGGAG CTGCCTGTCC AGCCCGTACG TAAAGTATTT GCCTGGTATC AGGCCGATGG CCGCTATAGC GTGAAGAATA AATTCCCGGC GTTTACCGGT GAACTGCCCA ATGGCGATCA ATATTATGGT TTTCCGGCAG AAAACGACGC GTTGAAGATT GGCAAACATA ACGGAGGCCA GGTTATCCAT TCAGCGGATG AACGTGTTCC GTTTGCGGAA GTGGTCAGCG ATGGTTCGGA AGCCTTCCCG TTCTTGCGCA ATGTATTGCC GGGTATCGGT TGCTGCCTGT ACGGCGCTGC CTGCACCTAT GATAATTCGC CTGACGAAGA TTTTATTATC GATACCCTAC CCGGCCACGA TAATACACTG CTCATTACCG GCCTGAGTGG GCACGGTTTT AAATTTGCGT CAGTTTTAGG GGAAATAGCT GCCGATTTTG CGCAAGACAA AAAAAGCGAT TTTGATTTGA CGCCATTCAG CCTTTCCCGC TTCCAATAA
|
Protein sequence | MKYDLIIIGS GSVGAAAGYY ATRAGLNVLM TDAHMPPHQH GSHHGDTRLI RHAYGEGEKY VPLVLRAQTL WDDLSRHNED DPIFVRSGVI NLGPADSAFL ANVAHSAEQW QLNVEKLDAQ GIMARWPEIR VPDNYIGLFE TDSGFLRSEL AIKTWIQLAK EAGCAQLFNC PVTAIRHDDD GVTIETADGE YQAKKAIVCA GTWVKDLLPE LPVQPVRKVF AWYQADGRYS VKNKFPAFTG ELPNGDQYYG FPAENDALKI GKHNGGQVIH SADERVPFAE VVSDGSEAFP FLRNVLPGIG CCLYGAACTY DNSPDEDFII DTLPGHDNTL LITGLSGHGF KFASVLGEIA ADFAQDKKSD FDLTPFSLSR FQ
|
| |