Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_03375 |
Symbol | avtA |
ID | 8113663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 3599933 |
End bp | 3601186 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 644849548 |
Product | hypothetical protein |
Protein accession | YP_003001121 |
Protein GI | 251786817 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3977] Alanine-alpha-ketoisovalerate (or valine-pyruvate) aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATTCT CCCTTTTTGG TGACAAATTT ACCCGCCACT CCGGCATTAC GCTGTTGATG GAAGATCTGA ACGACGGTTT ACGCACGCCT GGCGCGATTA TGCTCGGCGG CGGTAATCCG GCGCAGATCC CGGAAATGCA GGACTACTTC CAGACGCTAC TGACCGACAT GCTGGAAAGT GGCAAAGCGA CTGATGCACT GTGTAACTAC GACGGTCCAC AGGGGAAAAC GGAGCTACTC ACACTGCTTG CCGGAATGCT GCGCGAGAAG TTGGGTTGGG ATATCGAACC ACAGAATATT GCACTAACAA ACGGCAGCCA GAGCGCGTTT TTCTACTTAT TTAACCTGTT TGCCGGACGC CGTGCCGATG GTCGGGTCAA AAAAGTGCTG TTCCCGCTTG CACCGGAATA CATTGGCTAT GCTGACGCCG GACTGGAAGA AGATCTGTTT GTCTCTGCGC GTCCGAATAT TGAACTGCTG CCGGAAGGCC AGTTTAAATA CCACGTCGAT TTTGAGCATC TGCATATTGG CGAAGAAACC GGGATGATTT GCGTCTCCCG GCCGACGAAT CCAACAGGCA ATGTGATTAC TGACGAAGAG TTGCTGAAGC TTGACGCGCT GGCGAATCAA CACGGCATTC CGCTGGTGAT TGATAACGCT TATGGCGTCC CGTTCCCGGG TATCATCTTC AGTGAAGCGC GCCCGCTATG GAATCCGAAT ATCGTGCTGT GCATGAGTCT TTCCAAGCTG GGTCTACCTG GCTCCCGCTG CGGCATTATC ATCGCCAATG AAAAAATCAT CACCGCCATC ACCAATATGA ACGGCATTAT CAGCCTGGCA CCTGGCGGTA TTGGTCCGGC GATGATGTGT GAAATGATTA AGCGTAACGA TCTGCTGCGC CTGTCTGAAA CAGTCATCAA ACCGTTTTAC TACCAGCGTG TTCAGGAAAC TATCGCCATC ATTCGCCGCT ATTTACCGGA AAATCGCTGC CTGATTCATA AACCGGAAGG AGCCATTTTC CTCTGGCTAT GGTTTAAGGA TTTGCCCATT ACGACCGAGC AGCTCTATCA GCGCCTGAAA GCACGCGGCG TGCTGATGGT GCCGGGGCAC AACTTCTTCC CAGGGCTGGA TAAACCGTGG CCGCATACGC ATCAATGTAT GCGCATGAAC TACGTACCAG AGCCGGAGAA AATTGAGGCG GGGGTGAAGA TTCTGGCGGA AGAGATAGAA AGAGCCTGGG CTGAAAGTCA CTAA
|
Protein sequence | MTFSLFGDKF TRHSGITLLM EDLNDGLRTP GAIMLGGGNP AQIPEMQDYF QTLLTDMLES GKATDALCNY DGPQGKTELL TLLAGMLREK LGWDIEPQNI ALTNGSQSAF FYLFNLFAGR RADGRVKKVL FPLAPEYIGY ADAGLEEDLF VSARPNIELL PEGQFKYHVD FEHLHIGEET GMICVSRPTN PTGNVITDEE LLKLDALANQ HGIPLVIDNA YGVPFPGIIF SEARPLWNPN IVLCMSLSKL GLPGSRCGII IANEKIITAI TNMNGIISLA PGGIGPAMMC EMIKRNDLLR LSETVIKPFY YQRVQETIAI IRRYLPENRC LIHKPEGAIF LWLWFKDLPI TTEQLYQRLK ARGVLMVPGH NFFPGLDKPW PHTHQCMRMN YVPEPEKIEA GVKILAEEIE RAWAESH
|
| |