Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2470 |
Symbol | |
ID | 3910259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2826269 |
End bp | 2827516 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637884369 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_486086 |
Protein GI | 86749590 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.16603 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAA GGCCGCCGAT CTCGCCGAGA AGTCCGCGCG CGGCGCTGGA GCCGGAGCAA CTTCCGCCAC CGCCGAAACG ATCGGATCGT GCGCGCAATC CGCTGGTGAT TATTGGCAAC GCCATCATCA CGCTGGTGCT CGTACTGATG ATCGGCGCAG GCGGCATCTA TGTCTACGGC AAGCAGAAGA TCGAGGCTGC CGGGCCGTTG CAGGACGACA AGGTCGTCAA CATTCCGCAG CGCGCCGGGC TGGGCGACAT CGCCGAGATC CTGCAGCGCG AGGGCGTGAT CGAAAACAAT CGCTGGGTGT TCATCGGCAG TGTGCTGGCC CTGAAGGCGC GGGCGGATCT CAAGCCCGGC GAATATTCAT TTCAGAAGGA AGCCAGCCTT CGCGACGTGA TCGGCACCAT CGTCGAAGGC AAGGTGGTTC AGCACGCCGT GACGATCCCC GAAGGTTTGA CCTCCGAGCA GATCGTCGCG CGGCTCACGG ACAACAACAT CCTCACCGGA AGCCTCCGCG AAATTCCGCG AGAGGGAACC TTGCTGCCCG AGACCTACAA ATTTCCCCGG GGCACGCCGC GCGAGCAGGT CATCAACAGG ATGCAGCAGG CGCAGAAGCG TGTGCTGAGC GAAGTCTGGG AACGTCGCAA CCCGGAAATC CCGGTCAAAT CGCCGGAGCA ATTGGTGACG CTGGCGTCGA TCGTGGAAAA GGAGACCAGC AAGCCGGACG AGCGAAGCCG TGTCTCTGCC GTGTTCGTCA ACCGCCTACA GAAGAAGATG CGATTGCAGT CCGACCCGAC GATCATCTAC GGGCTCGTCG GCGGCAAGGG CACGCTCGGG CGACAGATCA AGCGCAGCGA AATTCAGCAG CCGTCCCCCT ACAACACCTA TGTCATCGAC GGTCTGCCGC CGGGCCCGAT CGCGAATCCC GGCCGGGCGT CGCTCGAAGC TGCGGCCAAT CCCGCGCGAA CGCGCGATCT GTACTTCGTC GCGGACGGCA GCGGCGGTCA CGCCTTCAGC GACAGCTATG ACCAGCATCT GAAGAACGTC GCCAAGCTGC GGGCGCTGGA GCGCCAGACC CAGAACGACA CGATCGAGCC CGCCGAAGAT ACCCCGCCGA CGGCGACGGT GGCTCCCGAC GCGAACGCAT CGGTCCCGTC GCCCGCGCCG GTGACGCGAC CATCGAAGAA CAGCGGCGCA TCCAAGAAGC GAAACGGCGC CGCTCCGGCT GCAGCCGGCC AGGATTAG
|
Protein sequence | MSERPPISPR SPRAALEPEQ LPPPPKRSDR ARNPLVIIGN AIITLVLVLM IGAGGIYVYG KQKIEAAGPL QDDKVVNIPQ RAGLGDIAEI LQREGVIENN RWVFIGSVLA LKARADLKPG EYSFQKEASL RDVIGTIVEG KVVQHAVTIP EGLTSEQIVA RLTDNNILTG SLREIPREGT LLPETYKFPR GTPREQVINR MQQAQKRVLS EVWERRNPEI PVKSPEQLVT LASIVEKETS KPDERSRVSA VFVNRLQKKM RLQSDPTIIY GLVGGKGTLG RQIKRSEIQQ PSPYNTYVID GLPPGPIANP GRASLEAAAN PARTRDLYFV ADGSGGHAFS DSYDQHLKNV AKLRALERQT QNDTIEPAED TPPTATVAPD ANASVPSPAP VTRPSKNSGA SKKRNGAAPA AAGQD
|
| |