Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pden_4332 |
Symbol | |
ID | 4582882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Paracoccus denitrificans PD1222 |
Kingdom | Bacteria |
Replicon accession | NC_008687 |
Strand | - |
Start bp | 1511803 |
End bp | 1512903 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639771639 |
Product | chorismate synthase |
Protein accession | YP_918092 |
Protein GI | 119387037 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.259213 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTCA ACACCTTCGG CCGCGTCTTC ACCTTCACCA CCTGGGGCGA AAGCCACGGC CCGGCGCTTG GCGCGACGGT GGACGGCTGC CCGCCGGGCG TGGCGCTGGA CGAAGGCTGG ATCCAGCAGT TCCTGGACCG CCGCCGCCCC GGCAGCTCGA AATTCACCAC CCAGCGGCAG GAGCCGGACC GGGTGCGCAT CCTCTCGGGC GTTTTCGAGG GCAGGACCAC CGGCACGCCG ATCCAGCTGA TGATCGAGAA CACCGACCAG CGCAGCAAGG ATTACGGCGA GATCGCCCAG GCGTTTCGGC CCGGCCATGC CGACATCGCC TATCACCTGA AATACGGCAT TCGCGACTAT CGCGGCGGCG GGCGCTCCAG CGCGCGCGAG ACGGCGGCGC GGGTCGCGGC GGGGGGCGTC GCGCAGGCGG TGCTGCGCGA CCTGGTGCCG GGACTGAAGA TCGCCGGCTA CATGGTGCAG ATGGGCGATC TGCATCTGGA CCGCGCGAAT TTCGATCTGG CCGAGATCGG CAACAACCCG TTCTTCCTGC CCGATGCCGG TGCCGTGCCG GCATGGGAGG ACTATCTGAA CGCCATCCGC AAGGCGCAGG ACAGCGTCGG GGCCGCCGTC GAGGTGCTGA TCCAGGGCTG CCCGCCCGGC CTTGGCGCCC CGGTCTATGC CAAGCTGGAT ACCGACCTGG CCGCCGCGAT GATGTCGATC AACGCGGTCA AGGGCGTCGA GATCGGCGAG GGCATGGCGG CGGCCGCCCT GACCGGGACC GCGAATGCCG ACGAGATCCG CATGGGCAAC GAGGGGCCGC GCTTCCTGTC CAACCATGCC GGCGGGATCC TGGGCGGCAT CTCGACCGGG CAGGACATCG TGGTCCGCTT CGCGGTCAAG CCGACCAGCT CGATCCTGAC CCCGCGCCGG ACGATCAACC GGAAGGGCGA GGAGATCGAG CTGATCACCA AGGGCCGCCA CGATCCCTGC GTCGGCATCC GCGCCGTGCC CATCGCCGAG GCCATGGCGG CCTGCGTGGT CCTGGACCAC CTGCTGCTGG ACCGGGCGCA GACCGGCGGG CGGCGCGGCA CCATCGGCTA G
|
Protein sequence | MSFNTFGRVF TFTTWGESHG PALGATVDGC PPGVALDEGW IQQFLDRRRP GSSKFTTQRQ EPDRVRILSG VFEGRTTGTP IQLMIENTDQ RSKDYGEIAQ AFRPGHADIA YHLKYGIRDY RGGGRSSARE TAARVAAGGV AQAVLRDLVP GLKIAGYMVQ MGDLHLDRAN FDLAEIGNNP FFLPDAGAVP AWEDYLNAIR KAQDSVGAAV EVLIQGCPPG LGAPVYAKLD TDLAAAMMSI NAVKGVEIGE GMAAAALTGT ANADEIRMGN EGPRFLSNHA GGILGGISTG QDIVVRFAVK PTSSILTPRR TINRKGEEIE LITKGRHDPC VGIRAVPIAE AMAACVVLDH LLLDRAQTGG RRGTIG
|
| |