Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2303 |
Symbol | |
ID | 3973744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 2502835 |
End bp | 2504103 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637925411 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_532176 |
Protein GI | 90423806 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0207028 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAA GGCCGCCGAT TTCGCCAAAG AGCCCGCGTG CCGCGCTGGA GCCTGAGCAG GTGCCGCCGC CGCCGAAGCG GTCCGACCGC GCCCGCAATC CGCTGGTGGT GATCGGCAAT GCGATCATCA CCGTGGTGCT GGTGGTGATG ATCGGCACCG GCGGCATCTA CGTCTACGGC AAGCAGAAGA TCGAAGCTCC CGGTCCCTTG CAGGACGACA AGATCGTCAA CATTCCGCAG CGCGCCGGGA TGGGCGACAT CGGCGACATT CTGCAGCGCG AAGGCGTGAT CGATAATAAT CGTTGGGCCT TCATCGGCAG CGTGTTCGCG CTGAAGGCGC GCGCCGATCT GAAGCCCGGC GAATACTCGT TTCAGAAGAA CGCCAGCCTG CGCGACGTGA TCGCCACCAT CGTCGAAGGC AAGGTGGTGC AGCACGCGGT GACGATCCCC GAAGGTTTGA CCTCGGAACA GATCCTGGCG CGGCTGACGG AGAACGACAT CTTCTCCGGC AATGTGCGGG AGATGCCGCG CGAGGGCACC TTGCTGCCGG AGACCTACAA ATTCCCGCGC GGCACCACCC GCGAATCGGT GATCGTGCGG ATGCAGCAGG CGCAGAAGCG GGTGCTCGCC GAGATCTGGG AGCGCCGCAA TCCCGACGTG CCGGTGAAGA CCCCGGAGCA ATTGGTGACG CTGGCCTCCA TCGTCGAGAA GGAGACCGGC AAGGCCGACG AGCGCAGCCG GGTCGCCGCG GTCTACGTCA ATCGGCTGCG CCAGAAGATG AAGCTGCAGT CCGATCCGAC CATCATCTAC GGCCTGGTCG GCGGCAAGGG CACGCTGGGC CGCCCGATCA AGCGCAGCGA GATCATCCAG CCGTCGCCCT ACAACACCTA TGTGGTCGAG GGCCTGCCGC CGGGGCCGAT CGCCAATCCG GGCCGCGCCT CGCTGGAAGC CGCCGCCAAT CCGGCGCGCA CCCGCGATCT GTTCTTCGTC GCCGACGGCA GCGGCGGACA CAGCTTCACC GAGACCTACG ATCAGCACCA GAAGAACGTC GCCAGGCTGC GCACCCTGGA ACGCCAGATC CAGAACGACA CCGTCGAGCC GCCGGACGAC GCCGCGCCGG CGGCGTCGCC GGCCGCGCCG GATGCCAACG CCGCGGTGCC GGCCACCGCT GCGCCGTCGA AATCGCAGAA ACGGACGCGT GCCGCCGCGC CGGCGCGGCA AGGTGCTGCG CAGCCGGCGG CGCCGACCGC GCCGGCCAGC CAAGAATAG
|
Protein sequence | MSERPPISPK SPRAALEPEQ VPPPPKRSDR ARNPLVVIGN AIITVVLVVM IGTGGIYVYG KQKIEAPGPL QDDKIVNIPQ RAGMGDIGDI LQREGVIDNN RWAFIGSVFA LKARADLKPG EYSFQKNASL RDVIATIVEG KVVQHAVTIP EGLTSEQILA RLTENDIFSG NVREMPREGT LLPETYKFPR GTTRESVIVR MQQAQKRVLA EIWERRNPDV PVKTPEQLVT LASIVEKETG KADERSRVAA VYVNRLRQKM KLQSDPTIIY GLVGGKGTLG RPIKRSEIIQ PSPYNTYVVE GLPPGPIANP GRASLEAAAN PARTRDLFFV ADGSGGHSFT ETYDQHQKNV ARLRTLERQI QNDTVEPPDD AAPAASPAAP DANAAVPATA APSKSQKRTR AAAPARQGAA QPAAPTAPAS QE
|
| |