Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00939 |
Symbol | aspC |
ID | 8116196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | - |
Start bp | 989020 |
End bp | 990210 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644847201 |
Product | hypothetical protein |
Protein accession | YP_002998774 |
Protein GI | 251784470 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1448] Aspartate/tyrosine/aromatic aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00958049 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGAGA ACATTACCGC CGCTCCTGCC GACCCGATTC TGGGCCTGGC CGATCTGTTT CGTGCCGATG AACGTCCCGG CAAAATTAAC CTCGGGATTG GTGTCTATAA AGATGAGACG GGCAAAACCC CGGTACTGAC CAGCGTGAAA AAGGCTGAAC AGTATCTGCT CGAAAATGAA ACCACCAAAA ATTACCTCGG CATTGACGGC ATCCCTGAAT TTGGTCGCTG CACTCAGGAA CTGCTGTTTG GTAAAGGTAG CGCCCTGATC AATGACAAAC GTGCTCGCAC GGCACAGACT CCGGGGGGCA CTGGCGCACT ACGCGTGGCT GCCGATTTCC TGGCAAAAAA TACCAGCGTT AAGCGTGTGT GGGTGAGCAA CCCAAGCTGG CCGAACCATA AGAGCGTCTT TAACTCTGCA GGTCTGGAAG TTCGTGAATA CGCTTATTAT GATGCGGAAA ATCACACTCT TGACTTCGAT GCACTGATTA ACAGCCTGAA TGAAGCTCAG GCTGGCGACG TAGTGCTGTT CCATGGCTGC TGCCATAACC CAACCGGTAT CGACCCTACG CTGGAACAAT GGCAAACACT GGCACAACTC TCCGTTGAGA AAGGCTGGTT ACCGCTGTTT GACTTCGCTT ACCAGGGTTT TGCCCGTGGT CTGGAAGAAG ATGCTGAAGG ACTGCGCGCT TTCGCGGCTA TGCATAAAGA GCTGATTGTT GCCAGTTCCT ACTCTAAAAA CTTTGGCCTG TACAACGAGC GTGTTGGCGC TTGTACTCTG GTTGCTGCCG ACAGTGAAAC CGTTGATCGC GCATTCAGCC AAATGAAAGC GGCGATTCGC GCTAACTACT CTAACCCACC AGCACACGGC GCTTCTGTTG TTGCCACCAT CCTGAGCAAC GATGCGTTAC GTGCGATTTG GGAACAAGAG CTGACTGATA TGCGCCAGCG TATTCAGCGT ATGCGTCAGT TGTTCGTCAA TACGCTGCAG GAAAAAGGCG CAAACCGCGA CTTCAGCTTT ATCATCAAAC AGAACGGCAT GTTCTCCTTC AGTGGCCTGA CAAAAGAACA AGTGCTGCGT CTGCGCGAAG AGTTTGGCGT ATATGCGGTT GCTTCTGGTC GCGTAAATGT GGCCGGGATG ACACCAGATA ACATGGCTCC GCTGTGCGAA GCGATTGTGG CAGTGCTGTA A
|
Protein sequence | MFENITAAPA DPILGLADLF RADERPGKIN LGIGVYKDET GKTPVLTSVK KAEQYLLENE TTKNYLGIDG IPEFGRCTQE LLFGKGSALI NDKRARTAQT PGGTGALRVA ADFLAKNTSV KRVWVSNPSW PNHKSVFNSA GLEVREYAYY DAENHTLDFD ALINSLNEAQ AGDVVLFHGC CHNPTGIDPT LEQWQTLAQL SVEKGWLPLF DFAYQGFARG LEEDAEGLRA FAAMHKELIV ASSYSKNFGL YNERVGACTL VAADSETVDR AFSQMKAAIR ANYSNPPAHG ASVVATILSN DALRAIWEQE LTDMRQRIQR MRQLFVNTLQ EKGANRDFSF IIKQNGMFSF SGLTKEQVLR LREEFGVYAV ASGRVNVAGM TPDNMAPLCE AIVAVL
|
| |