Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4161 |
Symbol | |
ID | 6969989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3850238 |
End bp | 3851434 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643387907 |
Product | diaminopropionate ammonia-lyase |
Protein accession | YP_002272346 |
Protein GI | 209398501 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1171] Threonine dehydratase |
TIGRFAM ID | [TIGR01747] diaminopropionate ammonia-lyase family [TIGR03528] diaminopropionate ammonia-lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTTT TCTCATTGAA GATTGATATC GCCGATAACA AATTTTTCAA CGGCGAAACA TCACCGCTCT TTTCGCAAAG CCAGGCCAAA CTGGCGCGCC AGTTCCACCA GAAAATAGCT GGTTATCGCC CAACACCGCT TTGTGCGCTG GACGATCTCG CAAACCTTTT TGGGGTGAAG AAAATTCTTG TCAAAGACGA ATCAAAACGA TTCGGTCTGA ACGCCTTCAA AATGCTCGGC GGTGCGTACG CTATCGCTCA ATTATTGTGT GAAAAATATC ATCTTGATAT CGAAACGCTG TCATTTGAGC ACCTGAAAAA TGCCATCGGC GAAAAAATGA CTTTCGCGAC CACCACCGAC GGCAACCACG GGCGCGGTGT GGCGTGGGCA GCGCAGCAAC TCGGACAGAA TGCGGTGATT TACATGCCGA AAGGTTCTGC TCAGGAACGC GTTGACGCCA TTCTGAACCT CGGTGCCGAG TGCATCGTCA CAGATATGAA CTATGACGAT ACCGTTCGCC TGACCATGCA ACACGCGCAG CAGCACGGCT GGGAAGTGGT ACAGGACACG GCATGGGAAG GTTACACCAA AATCCCAACC TGGATCATGC AAGGCTACGC AACCCTGGCA GATGAAGCCG TCGAGCAAAT GCGTGAAATG GGCGTAACCC CGACGCACGT TCTGCTGCAA GCCGGTGTCG GAGCAATGGC CGGTGGTGTG CTGGGTTATC TGGTCGACGT CTATAGCCCG CAAAATCTGC ACAGCATTAT TGTTGAACCT GACAAAGCTG ACTGTATTTA TCGCTCCGGC GTCAAAGGCG ACATCGTCAA CGTTGGCGGT GATATGGCCA CCATCATGGC AGGCCTGGCC TGTGGCGAAC CTAACCCGCT GGGCTGGGAA ATCCTACGTA ACTGCGCCAC CCAATTCATC TCCTGCCAGG ACAGCGTTGC CGCATTAGGT ATGCGCGTGC TGGGTAATCC GTACGGCAAC GACCCGCGCA TCATCTCCGG TGAATCCGGC GCTGTCGGTT TGGGCGTTCT CGCAGCGGTT CATTATCACC CGCAACGTCA AAGCCTGATG GAAAAACTGG CGCTGAACAA AGATGCCGTA GTGCTGGTTA TCAGCACTGA AGGCGACACC GACGTGAAGC ACTACCGCGA AGTTGTCTGG GAAGGCAAAC ACGCTGTAGC ACCTTAA
|
Protein sequence | MSVFSLKIDI ADNKFFNGET SPLFSQSQAK LARQFHQKIA GYRPTPLCAL DDLANLFGVK KILVKDESKR FGLNAFKMLG GAYAIAQLLC EKYHLDIETL SFEHLKNAIG EKMTFATTTD GNHGRGVAWA AQQLGQNAVI YMPKGSAQER VDAILNLGAE CIVTDMNYDD TVRLTMQHAQ QHGWEVVQDT AWEGYTKIPT WIMQGYATLA DEAVEQMREM GVTPTHVLLQ AGVGAMAGGV LGYLVDVYSP QNLHSIIVEP DKADCIYRSG VKGDIVNVGG DMATIMAGLA CGEPNPLGWE ILRNCATQFI SCQDSVAALG MRVLGNPYGN DPRIISGESG AVGLGVLAAV HYHPQRQSLM EKLALNKDAV VLVISTEGDT DVKHYREVVW EGKHAVAP
|
| |