Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_0785 |
Symbol | |
ID | 5456810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | - |
Start bp | 850882 |
End bp | 851979 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640876354 |
Product | chorismate synthase |
Protein accession | YP_001412065 |
Protein GI | 154251241 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.229169 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCACA ATACGTTCGG CCATCTGTTC CGCGTCACGA CCTGGGGCGA AAGCCACGGG CCGGCGCTCG GCTGCGTTGT GGACGGTGCG CCGCCGCGGC TGCCTTTGAA GGCCGAAGAT ATCCAGCAAT GGCTCGACCG GCGCAAACCC GGCCAGTCGC GGTTCACCAC GCAGCGGCGC GAGCCCGATG CGGTGAAAAT TCTCTCGGGC ACCTTCGTCG AAGACGGCAT AGAGATGACG ACGGGCACGC CGATCTCGCT GATGATCGAG AATGTCGATC AGCGGTCGAA GGACTATGGC GATATCGTCG AGAAGTTCAG ACCGGGCCAT GCCGATCTCA CCTATTTCCT GAAATATGGC ATTCGCGATT ATCGCGGGGG CGGCCGCTCT TCGGCGCGTG AAACGGCGGC CCGCGTGGCT GCCGGCGCGG TGGCGCGGGC GATGTTGCCG GAGATGATGA TCCGGGGTGC GCTCGTGCAG ATGGGGCCGC ACAAGATCGA CCGCGCCAAC TGGGACTGGA ACGAGGTGGG AAACAACCCC TTCTGGTGCC CGGACGCAAA GGCAGCGGCG GAATGGGAAA TCTATCTCGA TAGCGTCCGG AAAGCCGGTT CGTCCTGCGG TGCCGTCATC GAGATTGTAG CCAGCGGCGT ACCCGCCGGT CTCGGCTCAC CTATCTACGG AAAGCTCGAT GCGGAACTTG CAAGCGCGTT GATGAGCATA AACGCGGTGA AGGGTGTCGA GATCGGAGAT GGCTTCGGCG CTGCCGCTCT CTCCGGCGAA GAAAATGCCG ACGAGATGCA GTCGGGGCCG CATGGCATTG AGTTCAGCTC CAATCACGCG GGCGGCGTTC TTGGCGGCAT TTCCACGGGA CAAGATGTCG TCGCGCGCTT TGCCGTGAAA CCCACTTCCT CGATCCTCAG CCCTCGCAAA ACAGTCACCA AAGGCGGCGA CGACACGGAA ATCGTCACCA AGGGCCGCCA TGACCCATGC GTCGGCATCC GCGCCGTGCC TGTGGGCGAA GCGATGATGG CCTGCGTGCT TGCTGACCAG CTGCTCCGCC ATCGGGCGCA GATGGGCGGG AGCGGCCGCA ATGAGTGA
|
Protein sequence | MSHNTFGHLF RVTTWGESHG PALGCVVDGA PPRLPLKAED IQQWLDRRKP GQSRFTTQRR EPDAVKILSG TFVEDGIEMT TGTPISLMIE NVDQRSKDYG DIVEKFRPGH ADLTYFLKYG IRDYRGGGRS SARETAARVA AGAVARAMLP EMMIRGALVQ MGPHKIDRAN WDWNEVGNNP FWCPDAKAAA EWEIYLDSVR KAGSSCGAVI EIVASGVPAG LGSPIYGKLD AELASALMSI NAVKGVEIGD GFGAAALSGE ENADEMQSGP HGIEFSSNHA GGVLGGISTG QDVVARFAVK PTSSILSPRK TVTKGGDDTE IVTKGRHDPC VGIRAVPVGE AMMACVLADQ LLRHRAQMGG SGRNE
|
| |