Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0652 |
Symbol | |
ID | 5594783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 669667 |
End bp | 670827 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640919833 |
Product | putative aminotransferase |
Protein accession | YP_001457415 |
Protein GI | 157160097 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0436] Aspartate/tyrosine/aromatic aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 0.841721 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAATA ACCCTCTGAT TCCACAAAGC AAACTTCCAC AACTTGGCAC CACTATTTTC ACCCAGATGA GCGCGCTGGC GCAGCAACAC CAGGCGATTA ACCTGTCGCA AGGCTTTCCT GATTTTGATG GTCCGCGCTA TTTACAGGAG CGGCTGGCGC ACCACGTTGC ACAGGGGGCA AACCAATACG CGCCCATGAC CGGCGTGCAG GCCTTGCGCG AGGCGATTGC TCAGAAAACG GAACGTTTGT ATGGCTATCA ACCAGATGCC GATAGCGATA TCACCGTAAC GGCAGGGGCG ACGGAAGCGT TATACGCGGC GATTACCGCA CTGGTGCGCA ATGGCGATGA AGTGATTTGT TTTGATCCCA GCTATGACAG TTACGCCCCC GCCATCGCGC TTTCTGGGGG AATAGTGAAG CGTATGGCAC TGCAACCACC GCATTTTCGC GTTGACTGGC AGGAATTTGC CGCATTATTA AGCGAGCGCA CCAGACTGGT GATCCTCAAC ACTCCGCATA ACCCCAGTGC AACTGTCTGG CAGCAGGCTG ATTTCGCCGC TTTGTGGCAG GCGATCGCCG GGCACGAGAT TTTTGTCATT AGCGATGAAG TCTACGAGCA CATCAACTTT TCACAACAGG GCCATGCCAG TGTGCTGGCG CATCCGCAGC TGCGTGAGCG GGCAGTGGCG GTTTCTTCAT TTGGCAAGAC CTATCATATG ACCGGCTGGA AAGTGGGTTA TTGTGTTGCG CCAGCGCCCA TCAGCGCCGA AATTCGCAAG GTACATCAGT ATCTGACCTT TTCGGTGAAT ACCCCGGCAC AGCTGGCGCT TGCTGATATG CTACGTGCAG AACCTGAGCA TTATCTTGCG TTACCGGACT TTTATCGCCA GAAGCGCGAT ATTCTGGTGA ATGCTTTAAA TGAAAGCCGG CTGGAGATTT TACCGTGTGA AGGTACATAC TTTTTGCTGG TGGATTACAG CGCGGTTTCT ACCCTGGATG ATGTTGAGTT TTGCCAGTGG CTGACGCAGG AGCACGGCGT AGCGGCGATT CCGCTGTCGG TGTTTTGCGC CGATCCCTTC CCACATAAAC TGATTCGTCT CTGTTTTGCC AAGAAGGAAT CGACGTTGCT GGCAGCAGCT GAACGCCTGC GCCAGCTTTA G
|
Protein sequence | MTNNPLIPQS KLPQLGTTIF TQMSALAQQH QAINLSQGFP DFDGPRYLQE RLAHHVAQGA NQYAPMTGVQ ALREAIAQKT ERLYGYQPDA DSDITVTAGA TEALYAAITA LVRNGDEVIC FDPSYDSYAP AIALSGGIVK RMALQPPHFR VDWQEFAALL SERTRLVILN TPHNPSATVW QQADFAALWQ AIAGHEIFVI SDEVYEHINF SQQGHASVLA HPQLRERAVA VSSFGKTYHM TGWKVGYCVA PAPISAEIRK VHQYLTFSVN TPAQLALADM LRAEPEHYLA LPDFYRQKRD ILVNALNESR LEILPCEGTY FLLVDYSAVS TLDDVEFCQW LTQEHGVAAI PLSVFCADPF PHKLIRLCFA KKESTLLAAA ERLRQL
|
| |