Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_2470 |
Symbol | aroC |
ID | 5152219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 2556436 |
End bp | 2557521 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640557386 |
Product | chorismate synthase |
Protein accession | YP_001238541 |
Protein GI | 148253956 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTTCA ACACCTTCGG CCACATGTTC CGCGTCACCA CCTTCGGCGA GAGCCATGGC GTGGCCATCG GCTGCGTGGT CGACGGCTGT CCGCCGCGCA TTCCGCTGGA GCCTGCGGAG ATCCAGGTCG ATCTCGACCG CCGCCGGCCC GGCCAGTCGC GCTTCACCAC CCAGCGCCAG GAGCCGGACC AGGTGAAGAT CCTGTCCGGC GTGATGCCCG ATCCCGACAC CGGCGCGCAG GTCACCACGG GCACGCCGAT TGCGCTCTTG ATCGAGAACA CCGACCAGCG CTCGAAGGAC TATTCCGAGA TCAAGGACAA GTTCCGCCCG GGCCATGCCG ACTTCACCTA TGAGGCCAAA TACGGCCTGC GCGATTATCG CGGTGGCGGC CGCTCCTCGG CGCGCGAGAC CGCGACGCGC GTCGCGGCGG GCGCGATCGC GCGCAAGATC CTGGCGGGCG TCAAGGTGCG CGGCGCGCTG GTGCAGATGG GGCCGCACAA GATCGACCGC GCCAAGTGGG ACTGGGACGA GATCGCGCGC AACCCGTTCT TCTGTCCCGA CAAGGACAAA GCGGCGTTCT TCGAGGACTA TCTCGACGGC ATCCGCAAAT CCGGCTCCTC GATCGGCGCG GTGCTGGAGA TCACCGCCGA AGGCGTGCCG GCGGGACTCG GCGCGCCGCT CTACGGCAAG CTCGACGCGG ATCTGGCGGC GGCGATGATG AGTATCAACG CCGTCAAGGG CGTCGAGATC GGCGCCGGTT TTGCTGCGGC CGAACTGACC GGCGAGGAGA ACGCCGACGA GATGCGCTCG GCCAATGACG GCACGCGCTT CCTCTCCAAC AATGCCGGCG GCATTCTCGG CGGCATCGCC ACCGGGCAGC CGATCGTGGT GCGCTTCGCC GTCAAGCCGA CCTCGTCGAT CCTGACCCCG CGCCAGACCG TCGATCGCGC CGGCCACGAG ACCGAGATCC TGACCAAGGG CCGTCACGAC CCCTGCGTCG GCATCCGCGC CGTGCCGGTC GGCGAAGCGA TGATGGCCTG CGTGCTCGCC GACCATCTGC TGCGCCATCG CGGCCAAGTC GGCTGA
|
Protein sequence | MSFNTFGHMF RVTTFGESHG VAIGCVVDGC PPRIPLEPAE IQVDLDRRRP GQSRFTTQRQ EPDQVKILSG VMPDPDTGAQ VTTGTPIALL IENTDQRSKD YSEIKDKFRP GHADFTYEAK YGLRDYRGGG RSSARETATR VAAGAIARKI LAGVKVRGAL VQMGPHKIDR AKWDWDEIAR NPFFCPDKDK AAFFEDYLDG IRKSGSSIGA VLEITAEGVP AGLGAPLYGK LDADLAAAMM SINAVKGVEI GAGFAAAELT GEENADEMRS ANDGTRFLSN NAGGILGGIA TGQPIVVRFA VKPTSSILTP RQTVDRAGHE TEILTKGRHD PCVGIRAVPV GEAMMACVLA DHLLRHRGQV G
|
| |