Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_46640 |
Symbol | trpE |
ID | 7763527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4733112 |
End bp | 4734590 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643807508 |
Product | anthranilate synthase component I |
Protein accession | YP_002801744 |
Protein GI | 226946671 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCGCG AAGAATTCCT GCGCCTGGCC GCCAAAGGCT ACAACCGCAT TCCGCTCGCC TGCGAAACCC TCGCCGACTT CGACACCCCG CTGTCGATCT ACCTGAAACT CTCCGACGCG CCCAATTCCT ACCTGCTCGA ATCGGTGCAG GGCGGCGAGA AATGGGGCCG CTACTCGATC ATCGGCCTGC CGGCGCGCAC CGTGCTGCGC ATCCATGGCC AGCGGGTGAC GGTGAGCGTG GACGGCGAGG AGGTGGAGCG CCACGACTGC GAAGACCCGC TGGCCTTCGT CGAGCAGTTC AAGGCGCGCT ACCGGGTGCC GGACCTGCCG GGACTGCCGC GCTTCAACGG CGGCCTGGTC GGCTACTTCG GTTACGACAG CGTGCGCTAC GTGGAGAAGA AGCTCGCCCG TTGCCCGAAC CCGGACCCCC TGGGCACCCC GGACATCCTG CTGATGGTTT CCGACGCCGT GGTGGTGTTC GACAACCTGG CCGGCAAGAT GCACCTGATC GTCCTCGCCG ACCCGGCCGA GGCGGACGCC TTCGAGCGGG GCCGGGCGCA CCTGCGCGCG CTATCGGAGG AACTGCGCCA GCCGCTGGCG CCGCGCCAGG GCATCGATCT CGGCGCGCCG GCTGGCGCGG AGCCTGCGTT CCGCTCCAGC TTCGGCCGCG AGGACTTCGA GCGGACGGTG GCGCGCATCA AGGACTACAT CCTCGCCGGC GACTGCATGC AGGTGGTGAT CTCCCAGCGC ATGTCGATCC CCTTCGCCGC CGCGCCCATC GACCTGTACC GGGCGCTGCG CTGCTTCAAC CCGACCCCCT ACATGTACTT CTTCGACTTT GGCGACTTCC ACGTGGTCGG CAGCTCGCCG GAGGTGCTGG TGCGCGTCGA GGACGGTCTG GTCACGGTGC GCCCGATCGC CGGCACCCGC CCGCGCGGCG CCAGCGAGGA GGCCGACCTG GCGCTGGAGC GCGACCTGCT CTCCGACGCC AAGGAGCTGG CCGAGCACCT GATGCTGATC GACCTCGGCC GCAACGACGT CGGCCGGGTC GCCGATACCG GCTCGGTGAA GCTCACCGAG AAGATGGTCA TCGAGCGCTA TTCCAACGTC ATGCACATCG TCTCCAACGT CACCGGCCAT CTGCGCCAGG GGCTGACGGC GATGGACGCG CTGCGCGCCA TCCTGCCGGC CGGCACCCTC TCCGGCGCGC CCAAGGTCCG CGCCATGGAG ATCATCGACG AACTGGAGCC GGTCAAGCGC GGCGTCTACG GCGGCGCCGT CGGCTACCTG GCGTGGAACG GCAACATGGA TACCGCGATC GCCATCCGCA CCGCGGTGAT CAAGGACGGC GAACTGCACG TCCAGGCCGG CGCCGGCATC GTCGCCGATT CGGTGCCGGC GCTGGAGTGG GAGGAAACCC TGAACAAGCG CCGCGCCATG TTCCGCGCCG TGGCCCTGGC CGAGCAGGGC GGGACCTGA
|
Protein sequence | MIREEFLRLA AKGYNRIPLA CETLADFDTP LSIYLKLSDA PNSYLLESVQ GGEKWGRYSI IGLPARTVLR IHGQRVTVSV DGEEVERHDC EDPLAFVEQF KARYRVPDLP GLPRFNGGLV GYFGYDSVRY VEKKLARCPN PDPLGTPDIL LMVSDAVVVF DNLAGKMHLI VLADPAEADA FERGRAHLRA LSEELRQPLA PRQGIDLGAP AGAEPAFRSS FGREDFERTV ARIKDYILAG DCMQVVISQR MSIPFAAAPI DLYRALRCFN PTPYMYFFDF GDFHVVGSSP EVLVRVEDGL VTVRPIAGTR PRGASEEADL ALERDLLSDA KELAEHLMLI DLGRNDVGRV ADTGSVKLTE KMVIERYSNV MHIVSNVTGH LRQGLTAMDA LRAILPAGTL SGAPKVRAME IIDELEPVKR GVYGGAVGYL AWNGNMDTAI AIRTAVIKDG ELHVQAGAGI VADSVPALEW EETLNKRRAM FRAVALAEQG GT
|
| |