Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2192 |
Symbol | aspC |
ID | 6146560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2201720 |
End bp | 2202910 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617068 |
Product | aromatic amino acid aminotransferase |
Protein accession | YP_001744242 |
Protein GI | 170682145 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1448] Aspartate/tyrosine/aromatic aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.330977 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGAGA ACATTACCGC CGCTCCTGCC GACCCGATTC TGGGCCTGGC CGATCTGTTT CGTGCCGATG AACGTCCCGG CAAAATTAAC CTCGGGATTG GTGTCTATAA AGATGAGACG GGCAAAACCC CGGTACTGAC CAGCGTGAAA AAGGCTGAAC AGTATTTGCT CGAAAATGAA ACCACCAAAA ATTACCTCGG CATTGACGGC ATCCCTGAAT TTGGTCGCTG CACTCAGGAA CTGCTGTTTG GTAAAGGTAG CGCCCTGATC AATGACAAAC GTGCTCGCAC GGCACAGACT CCGGGTGGCA CTGGCGCACT ACGCGTGGCT GCCGATTTCC TGGCAAAAAA TACCAGCGTT AAGCGTGTGT GGGTGAGCAA CCCAAGCTGG CCGAACCATA AGAGCGTCTT TAACTCTGCA GGTCTGGAAG TTCGTGAATA CGCTTATTAT GATGCGGAAA ATCACACCCT TGACTTCGAT GCACTGATTA ACAGCCTGAA CGAAGCTCAG GCTGGCGACG TAGTGCTGTT CCATGGCTGC TGCCATAACC CAACCGGTAT CGACCCTACG CTGGAACAAT GGCAGACACT GGCACAACTT TCCGTTGAGA AAGGCTGGTT ACCGCTGTTT GACTTCGCTT ACCAGGGTTT TGCCCGTGGT CTGGAAGAAG ATGCTGAAGG ACTGCGCGCT TTCGCTGCTA TGCATAAAGA GCTGATTGTT GCCAGTTCCT ACTCTAAAAA CTTTGGCCTG TACAACGAGC GTGTTGGCGC TTGTACTCTG GTTGCTGCTG ACAGTGAGAC CGTTGATCGC GCATTCAGCC AAATGAAAGC GGCGATTCGC GCTAACTACT CTAACCCACC AGCACACGGC GCTTCTGTTG TTGCCACCAT CCTGAGCAAC GATGCGTTAC GTGCGATTTG GGAACAAGAG CTGACTGATA TGCGCCAGCG TATTCAGCGT ATGCGTCAGT TGTTCGTCAA TACGCTGCAG GAAAAAGGGG CAAACCGCGA CTTCAGCTTT ATCATCAAAC AGAACGGTAT GTTCTCCTTC AGTGGCCTGA CGAAAGAACA GGTACTGCGT CTGCGTGAAG AGTTTGGCGT GTATGCAGTG GCTTCTGGTC GTGTGAACGT GGCCGGGATG ACGCCAGATA ACATGGCTCC GCTGTGCGAA GCGATTGTGG CCGTGCTGTA A
|
Protein sequence | MFENITAAPA DPILGLADLF RADERPGKIN LGIGVYKDET GKTPVLTSVK KAEQYLLENE TTKNYLGIDG IPEFGRCTQE LLFGKGSALI NDKRARTAQT PGGTGALRVA ADFLAKNTSV KRVWVSNPSW PNHKSVFNSA GLEVREYAYY DAENHTLDFD ALINSLNEAQ AGDVVLFHGC CHNPTGIDPT LEQWQTLAQL SVEKGWLPLF DFAYQGFARG LEEDAEGLRA FAAMHKELIV ASSYSKNFGL YNERVGACTL VAADSETVDR AFSQMKAAIR ANYSNPPAHG ASVVATILSN DALRAIWEQE LTDMRQRIQR MRQLFVNTLQ EKGANRDFSF IIKQNGMFSF SGLTKEQVLR LREEFGVYAV ASGRVNVAGM TPDNMAPLCE AIVAVL
|
| |