Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1710b_1867 |
Symbol | aroC |
ID | 3690536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1710b |
Kingdom | Bacteria |
Replicon accession | NC_007434 |
Strand | + |
Start bp | 2039541 |
End bp | 2040650 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637728323 |
Product | chorismate synthase |
Protein accession | YP_333268 |
Protein GI | 162210097 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0797858 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGGCA ACACCCTCGG CACGCTTTTC ACTGTCACGA CCTTCGGCGA ATCGCACGGC CCCGCGATCG GCTGCGTGAT CGACGGCTGC CCGCCGGGCA TGGCGCTCAC GGAAGCCGAC GTCCAGCTCG AGCTCGACCG CCGCAAGCCC GGCACGTCGC GCCACGTCAC GCAGCGTCAG GAGCCCGACC AGGTCGAGAT CCTGTCCGGC GTGTTCGAGG GCGTGACGAC CGGCGCGCCG ATCGCGCTCC TGATCCGCAA CACCGACCAG CGCAGCAAGG ACTACGGCAA CATCGCCGAG ACGTTCCGCC CGGGCCATGC CGATTACACC TACTGGCAAA AGTACGGCGT GCGCGACTAT CGCGGCGGCG GCCGCTCGTC CGCGCGGCTG ACGGCGCCCG TCGTCGGCGC CGGCGCGATC GCGAAGAAGT GGCTGCGCGA GCGCTTCGGC GTCGAGGTGC GCGGCTACAT GAGCGCGCTC GGCGAAATCG AGATCCCGTT CGTCGACTGG TCGCACGTGC GCGAGAACCC GTTCTTCGCG CCGAACGCCG ACATCGTGCC GCAACTCGAG GGCTACATGG ACGCGCTGCG CAAGGACGGC GATTCGATCG GCGCGCGCAT CGATGTCGTC GCGTCGGGCG TGCCGGTCGG CTGGGGCGAG CCGCTGTTCG ACCGGCTCGA CGCCGACATC GCGCACGCGA TGATGGGGAT CAACGCGGTG AAGGGCGTCG AGATCGGCGC GGGTTTCGCG AGCGTCGCGC AGCGCGGTTC GGTGCACGGC GACGAGCTGA CGCCGGACGG CTTCGTCGGC AATCACGCGG GCGGCGTGCT CGGCGGCATC TCGACGGGGC AGGACATCAC GGTGTCGATC GCGATCAAGC CGACGTCGAG CATTCGCACG CCGCGCCGCT CGATCACGCG GGCGGGCGAA CCCGCCGTCG TCGAGACGTT CGGCCGCCAC GACCCGTGCG TCGGGATTCG CGCGACGCCG ATCGCCGAAT CGATGCTCGC GCTCGTGCTG ATCGATCACG CGCTGCGGCA CCGCGCGCAG TGCGGCGACG TGTCGAGCGC GACGCCGAGG ATCGCCGCGC GCGCGCCGGA CGCGCAATGA
|
Protein sequence | MSGNTLGTLF TVTTFGESHG PAIGCVIDGC PPGMALTEAD VQLELDRRKP GTSRHVTQRQ EPDQVEILSG VFEGVTTGAP IALLIRNTDQ RSKDYGNIAE TFRPGHADYT YWQKYGVRDY RGGGRSSARL TAPVVGAGAI AKKWLRERFG VEVRGYMSAL GEIEIPFVDW SHVRENPFFA PNADIVPQLE GYMDALRKDG DSIGARIDVV ASGVPVGWGE PLFDRLDADI AHAMMGINAV KGVEIGAGFA SVAQRGSVHG DELTPDGFVG NHAGGVLGGI STGQDITVSI AIKPTSSIRT PRRSITRAGE PAVVETFGRH DPCVGIRATP IAESMLALVL IDHALRHRAQ CGDVSSATPR IAARAPDAQ
|
| |