Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0686 |
Symbol | |
ID | 6970102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 718330 |
End bp | 719490 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643384722 |
Product | putative aminotransferase |
Protein accession | YP_002269235 |
Protein GI | 209399170 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0436] Aspartate/tyrosine/aromatic aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 82 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAATA ACCCTCTGAT TCCACAAAGC AAACTTCCAC AACTTGGCAC CACTATTTTC ACCCAGATGA GCGCGCTGGC GCAGCAACAC CAGGCGATTA ACCTGTCGCA AGGCTTTCCT GATTTTGATG GTCCGCGCTA TTTACAGGAG CGGCTGGCGT ACCACGTTGC CCAGGGGGCA AACCAATACG CGCCCATGAC CGGCGTGCAG GCCTTGCGCG AGGCGATTGC TCAGAAAACG GAACGTTTGT ATGGCTATCA ACCAGATGCC GATAGCGATA TCACCGTAAC GGCAGGGGCG ACGGAAGCGT TATACGCGGC GATTACCGCA CTGGTGCGCA ATGGCGATGA AGTGATTTGT TTTGATCCCA GCTATGACAG TTACGCCCCC GCCATCGCGC TTTCTGGGGG AATAGTGAAG CGTATGGCAC TGCAACCACC GCATTTTCGC GTTGACTGGC AGGAATTTGC CGCATTGTTA AGCGAGCGCA CCAGACTGGT GATCCTCAAC ACTCCGCATA ATCCCAGTGC AACTGTCTGG CAGCAGGCTG ATTTCGCCGC TTTGTGGCAG GCGATCGCCG GGCACGAGAT TTTTGTCATT AGCGATGAAG TCTACGAGCA CATCAACTTT TCACAACAGG GCCATGCCAG TGTGCTGGCG CATCCGCAGC TGCGTGAGCG GGCGGTGGCG GTGTCATCGT TTGGCAAGAC CTATCATATG ACCGGCTGGA AAGTGGGTTA TTGTGTTGCG CCAGCGCCCA TCAGCGCCGA AATTCGCAAG GTACATCAGT ATCTGACCTT TTCGGTGAAT ACCCCGGCAC AGCTGGCAAT TGCGGATATG CTACGTGCAG AACCTGAGCA TTATCTTGCG TTACCGGACT TTTATCGCCA GAAGCGCGAT ATTCTGGTAA ATGCCTTAAA TGAAAGTCGG CTGGAGATTT TACCGTGCGA AGGCACATAC TTTTTGCTGG TGGATTACAG CGCGGTTTCT ACCCTGGATG ATGTTGAGTT TTGCCAGTGG CTGACGCGGG AGCACGGCGT GGCGGCGATT CCGCTATCGG TGTTTTGCGC CGATCCCTTC CCACATAAAC TGATTCGTCT CTGTTTTGCC AAGAAGGAAT CGACGTTGCT GGCAGCAGCT GAACGACTGC GCCAGCTGTA G
|
Protein sequence | MTNNPLIPQS KLPQLGTTIF TQMSALAQQH QAINLSQGFP DFDGPRYLQE RLAYHVAQGA NQYAPMTGVQ ALREAIAQKT ERLYGYQPDA DSDITVTAGA TEALYAAITA LVRNGDEVIC FDPSYDSYAP AIALSGGIVK RMALQPPHFR VDWQEFAALL SERTRLVILN TPHNPSATVW QQADFAALWQ AIAGHEIFVI SDEVYEHINF SQQGHASVLA HPQLRERAVA VSSFGKTYHM TGWKVGYCVA PAPISAEIRK VHQYLTFSVN TPAQLAIADM LRAEPEHYLA LPDFYRQKRD ILVNALNESR LEILPCEGTY FLLVDYSAVS TLDDVEFCQW LTREHGVAAI PLSVFCADPF PHKLIRLCFA KKESTLLAAA ERLRQL
|
| |