Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | COXBURSA331_A1076 |
Symbol | aroC |
ID | 5794252 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Coxiella burnetii RSA 331 |
Kingdom | Bacteria |
Replicon accession | NC_010117 |
Strand | - |
Start bp | 953221 |
End bp | 954279 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641330531 |
Product | chorismate synthase |
Protein accession | YP_001596833 |
Protein GI | 161830250 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 0.306004 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGGAA ATAGCTTTGG AAAATTATTT ACGGTTACAA CTGCCGGAGA AAGCCACGGC CCTGCGCTTG TTGCTATTGT GGACGGGTGT CCAGCGGGAT TGTCTTTAAC AGAGGCAGAT ATACAGCCTG ATATTGATCG GCGAAAAACA GGGAAATCGC GCTTCACTTC TCAACGACGC GAATCCGATC AAGTAAAAAT CCTATCCGGC GTTTTTGAGG GCGTTACAAC GGGTACACCC ATCGCATTAC TGATCGAGAA CGCCGACCAA CGGCCCCGTG ATTATTCCCA GATAAAAGAC CTATTTCGCC CAGGGCACGG AGATTATACT TACTTTAAAA AATACGGTTT TCGGGACTAC CGCGGGGGAG GGCGGGCATC CGCGCGCGAA ACGGTAATGC GTGTAGCAGC TGGAGCGATT GCCAAAAAGT ATTTGCGAGA AAAGGTCAAT TTAACCATTC AAGGCTACAC CGCAGCGGTA GGTGCTATTC GCGCGGAACG TATCGATTTA TCCGCTGTAG AAAAAAATCC CTTTTTCTTT CCCGACGAAG TCCAAATCCC TCATTTGGAA CAATTAATCA TGAAATTGCG TCGTGATGGG GATTCGATTG GCGCTCGTCT TAATGTGATT GCTAAAGGCG TTCCTTGTGG TTTGGGTGAG CCTGTTTTTG ATAAATTGGA CGCGGATATT GCTTCTGCCA TGATGGGCAT CAACGCCGTT AAAGGTGTTG AAATTGGCGA TGGTTTTGCC GTTGTCGAGC AAAAAGGCTC GTTTCATCGA GATGAACTGA GTAAGAAAGG ATTTCTTTCC AATCACGCAG GGGGTACCTT AGCAGGTATT TCATCGGGCC AAGATATCTT AGTGAGTCTT GCTTTTAAAC CGGCATCGAG TATCCGTATT CCGGGAAAAA CTTTGGATAT TAATGGCAAA GCAGTTGAAG TCGTCACTAC CGGACGTCAC GATCCTTGCG TGGGATTGCG CGCAGTTCCT ATTGCTGAGG CGATGTTGGC ATTAGTTTTA ATGGATCATT ATTTGCGGTA CAAGGCACAA CGGGGATAG
|
Protein sequence | MSGNSFGKLF TVTTAGESHG PALVAIVDGC PAGLSLTEAD IQPDIDRRKT GKSRFTSQRR ESDQVKILSG VFEGVTTGTP IALLIENADQ RPRDYSQIKD LFRPGHGDYT YFKKYGFRDY RGGGRASARE TVMRVAAGAI AKKYLREKVN LTIQGYTAAV GAIRAERIDL SAVEKNPFFF PDEVQIPHLE QLIMKLRRDG DSIGARLNVI AKGVPCGLGE PVFDKLDADI ASAMMGINAV KGVEIGDGFA VVEQKGSFHR DELSKKGFLS NHAGGTLAGI SSGQDILVSL AFKPASSIRI PGKTLDINGK AVEVVTTGRH DPCVGLRAVP IAEAMLALVL MDHYLRYKAQ RG
|
| |