Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1089 |
Symbol | aspC |
ID | 6966649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1117346 |
End bp | 1118536 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643385101 |
Product | aromatic amino acid aminotransferase |
Protein accession | YP_002269600 |
Protein GI | 209397827 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1448] Aspartate/tyrosine/aromatic aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00147404 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.202333 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGAGA ACATTACCGC CGCTCCTGCC GACCCGATTC TGGGCCTGGC CGATCTGTTT CGTGCCGATG AACGTCCCGG CAAAATTAAC CTCGGGATTG GTGTCTATAA AGATGAGACG GGCAAAACCC CGGTACTGAC CAGCGTGAAA AAGGCTGAAC AGTATCTGCT CGAAAATGAA ACCACCAAAA ATTACCTCGG CATTGACGGC ATCCCTGAAT TTGGTCGCTG CACTCAGGAA CTGCTGTTTG GTAAAGGTAG CGCCCTGATC AATGACAAAC GTGCTCGCAC GGCACAGACT CCGGGTGGCA CTGGCGCACT ACGCATAGCT GCCGATTTCC TGGCAAAAAA TACCAGCGTT AAGCGAGTGT GGGTGAGCAA CCCAAGCTGG CCGAACCATA AGAGCGTCTT TAACTCTGCA GATCTGGAAG TTCGTGAATA CGCTTATTAT GATGCGGAAA ACCACACCCT TGACTTCGAT GCACTGATTA ACAGCCTGAA CGAAGCTCAG GCTGGCGACG TAGTGCTGTT CCATGGCTGC TGCCACAACC CAACCGGTAT CGACCCTACG CTGGAACAAT GGCAGACACT GGCACAACTC TCCGTTGAGA AAGGCTGGTT ACCGCTGTTT GACTTCGCTT ACCAGGGTTT TGCCCGTGGT CTGGAAGAAG ATGCTGAAGG ACTGCGCGCT TTCGCGGCTA TGCATAAAGA GCTGATTGTT GCCAGTTCCT ACTCTAAAAA CTTTGGCCTG TACAACGAGC GTGTTGGCGC TTGTACTCTG GTTGCTGCCG ACAGTGAAAC CGTTGATCGC GCATTCAGCC AAATGAAAGC GGCGATTCGC GCTAACTACT CTAACCCACC AGCACACGGC GCTTCTGTTG TTGCCACCAT CCTGAGCAAC GATGCGTTAC GTGCGATTTG GGAACAAGAG CTGACTGATA TGCGCCAGCG TATTCAGCGT ATGCGTCAGT TGTTCGTCAA TACGCTGCAG GAAAAAGGCG CAAACCGCGA CTTCAGCTTT ATCATCAAAC AGAACGGCAT GTTCTCCTTC AGTGGCCTGA CAAAAGAACA AGTGCTGCGT CTGCGCGAAG AGTTTGGCGT GTATGCTGTT GCTTCTGGTC GCGTAAACGT GGCCGGGATG ACACCAGATA ACATGGCTCC GCTGTGCGAA GCGATTGTGG CAGTGCTGTA A
|
Protein sequence | MFENITAAPA DPILGLADLF RADERPGKIN LGIGVYKDET GKTPVLTSVK KAEQYLLENE TTKNYLGIDG IPEFGRCTQE LLFGKGSALI NDKRARTAQT PGGTGALRIA ADFLAKNTSV KRVWVSNPSW PNHKSVFNSA DLEVREYAYY DAENHTLDFD ALINSLNEAQ AGDVVLFHGC CHNPTGIDPT LEQWQTLAQL SVEKGWLPLF DFAYQGFARG LEEDAEGLRA FAAMHKELIV ASSYSKNFGL YNERVGACTL VAADSETVDR AFSQMKAAIR ANYSNPPAHG ASVVATILSN DALRAIWEQE LTDMRQRIQR MRQLFVNTLQ EKGANRDFSF IIKQNGMFSF SGLTKEQVLR LREEFGVYAV ASGRVNVAGM TPDNMAPLCE AIVAVL
|
| |