Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avi_3259 |
Symbol | trpEG |
ID | 7387432 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Agrobacterium vitis S4 |
Kingdom | Bacteria |
Replicon accession | NC_011989 |
Strand | + |
Start bp | 2694914 |
End bp | 2697103 |
Gene Length | 2190 bp |
Protein Length | 729 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643652162 |
Product | anthranilate synthase |
Protein accession | YP_002550346 |
Protein GI | 222149389 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I [COG0512] Anthranilate/para-aminobenzoate synthases component II |
TIGRFAM ID | [TIGR00566] glutamine amidotransferase of anthranilate synthase or aminodeoxychorismate synthase [TIGR01815] anthranilate synthase, alpha proteobacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACCA TTATTCGCGA CGACAATAGC GATGTCTACC AGACCCGGGG CGGGATTACG GTCACGCGCC AGCGTCGTGC CACACCCTAT GCCGATGCAG TGTCCAGCTA TGTCGAAAAG CTGGATGAGC GGCGCGGCGC GGTGTTTTCC TCCAACTACG AATATCCGGG CCGCTATACC CGCTGGGATA CGGCCATTGT CGATCCGCCG CTGGGCATTT CCAGCTTTGG CCGCGCCGTG TGGATCGAGG CCTATAATGG CCGGGGCGAA GTGCTGCTGT CGCTGATCGC CGAAAAGCTG AAGACCGTGC CGGAACTGGT GCTGGGCGCG CTGACCACGC AGCGGCTGGA CTTGACCGTG AAGACCCCCG ACCGGGTATT TACCGAGGAG GAGCGCTCCA AGGCACCTAC GGTATTTACC GTGCTACGCG CCATTACCGA TCTTTTCTAT TCGTCGGCTG ACAGCAGCAT CGGCCTGTTC GGGGCTTTCG GCTATGATCT GGCCTTCCAG TTCGATGCCA TCGATCTGAA ACTGAAGCGG CCGGACGATC AGCGCGACAT GGTGCTGTTC CTGCCCGATG AAATTCTGGT GGTTGACCAT TATTCCGCCA AGGCCTGGAT CGACCGCTAT GATTTCGAAA AGGACGGCGT GACCACGGCG GGCAAGGCCG AGACGATTGC GCCGGAGCCT TTCCGCCACA CCGATACCAT TCCGCCGCGC GGCGATCACC GGCCCGGCGA ATTTGCCGAA CTGGTGGTGA AGGCCAAGGA AAGCTTCCGC AAGGGCGACC TGTTCGAAGT GGTTCCCGGC CAGAAATTCA TGGAACGGTG CGATAGCAAG CCCTCGGATA TTTCCAAGCG GTTGAAGGAC ATCAACCCGT CGCCCTATTC CTTCTTCATC AATCTCGGCA ATCAGGAATA TCTGGTCGGC GCTTCGCCGG AAATGTTCGT GCGCGTCAAC GGACGGCGCA TCGAGACCTG CCCGATTTCT GGCACGATCA AACGCGGCGA CGACCCGATT GCCGATAGCG AACAGATCCT CAAGCTGTTG AATTCCAAGA AGGACGAATC CGAACTGACC ATGTGTTCGG ATGTTGACCG CAACGACAAA AGCCGGGTCT GCGAGCCGGG CTCCGTCAAG GTGATCGGCC GCCGCCAGAT TGAAATGTAT TCCCGGCTGA TCCACACGGT GGACCATATT GAGGGCCGTC TGCGCGACGA TATGGATGCG TTCGACGGCT TCCTCTCCCA TGCCTGGGCC GTCACGGTTA CCGGCGCGCC AAAACTCTGG GCGATGCGTT TTGTCGAAGC CCATGAGAAA AGCGCGCGCG CCTGGTATGG CGGCGCGGTC GGCATGGTCG GCTTTAATGG CGACATGAAT ACCGGGCTGA CGCTGCGCAC TGTGCGCATC AAGGACGGTA TTGCCGAAGT GCGGGCCGGT GCGACGCTGC TGAATGACAG CGATCCGCAG GAAGAAGAAG CTGAAACCGA ATTGAAGGCA TCGGCCATGA TTGCTGCAAT CCGCGACGCC AAATCCGGCC AAAATGCCAA GGCCCAGCGC GGCGTGGCAG CTGTCGGCCA TGGCGTCAAA ATCCTGCTGG TCGATCATGA GGACTCTTTC GTCCACACGC TGGCCAATTA CTTCCGCCAG ACCGGCGCTA CCGTCAGCAC GGTGCGCAGC CCGGTGCCGG AAGAGGTGTT TGATCGCCTG AACCCCGATC TCGTCGTCTT GTCGCCCGGA CCGGGTTCTC CGTCGGATTT TGATTGCACG GCAACAATCA AGAAAGCGCG CGACCGCAAC CTGCCGATCT TCGGCGTCTG CCTTGGTCTC CAGGCGCTGA CGGAAGCCTA TGGCGGCGTG CTGCGCCAGT TGGATGTGCC GATGCATGGC AAGCCATCGC GCATCCGCGT GCTGGAACCC GGCATCGTGT TTTCCGGCCT GAACAAGGAA GTCACGGTCG GGCGCTATCA CTCGATCCAT GCCGATCCGG CCAGCTTGCC CAAGGATTTC ATCATCACCG CCGAAAGCGA GGATGGGACG ATCATGGGTA TCGAACATGC CAAGGAACCG GTCGCTGCTG TGCAGTTCCA CCCGGAATCG ATCATGACGC TCGGCAACGA TGCCGGGATG CGAATGATCG AGAATGTGGT GGAAAAGCTG GCCAAGCGGG CGAAGGTGAA GGCGGCGTGA
|
Protein sequence | MATIIRDDNS DVYQTRGGIT VTRQRRATPY ADAVSSYVEK LDERRGAVFS SNYEYPGRYT RWDTAIVDPP LGISSFGRAV WIEAYNGRGE VLLSLIAEKL KTVPELVLGA LTTQRLDLTV KTPDRVFTEE ERSKAPTVFT VLRAITDLFY SSADSSIGLF GAFGYDLAFQ FDAIDLKLKR PDDQRDMVLF LPDEILVVDH YSAKAWIDRY DFEKDGVTTA GKAETIAPEP FRHTDTIPPR GDHRPGEFAE LVVKAKESFR KGDLFEVVPG QKFMERCDSK PSDISKRLKD INPSPYSFFI NLGNQEYLVG ASPEMFVRVN GRRIETCPIS GTIKRGDDPI ADSEQILKLL NSKKDESELT MCSDVDRNDK SRVCEPGSVK VIGRRQIEMY SRLIHTVDHI EGRLRDDMDA FDGFLSHAWA VTVTGAPKLW AMRFVEAHEK SARAWYGGAV GMVGFNGDMN TGLTLRTVRI KDGIAEVRAG ATLLNDSDPQ EEEAETELKA SAMIAAIRDA KSGQNAKAQR GVAAVGHGVK ILLVDHEDSF VHTLANYFRQ TGATVSTVRS PVPEEVFDRL NPDLVVLSPG PGSPSDFDCT ATIKKARDRN LPIFGVCLGL QALTEAYGGV LRQLDVPMHG KPSRIRVLEP GIVFSGLNKE VTVGRYHSIH ADPASLPKDF IITAESEDGT IMGIEHAKEP VAAVQFHPES IMTLGNDAGM RMIENVVEKL AKRAKVKAA
|
| |