Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0533 |
Symbol | aroB |
ID | 3970774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 575703 |
End bp | 576851 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637923649 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_530427 |
Protein GI | 90422057 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.630888 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGCGC CGCTCAATCA CTCCGCCCCG ATCACCGTCG AGGTCGCGCT CGGCGACCGC GGCTATGACA TCGTGATCGG CCGCGACGTG CTGCGTTCGC TCGGGACCCG CATCGCCGCG TTGCGGCCCG GCGCCCGCAC GGCGATCGTC ACCGACCGTA ATGTCGCCAC CTGCTGGCTG GCGCAGACCC AGGCCGCGCT CGACGATGTC GGCATCGTTT CGATGCCGAT CGTGGTCGAG GGCGGCGAAG GCTCGAAGAG CTATGCCGGG CTGCAGCAGG TCTGCGAGGC GCTGATCGCC GCCAAGATCG AACGCAACGA TCTGGTGATC GCGCTCGGCG GCGGCGTGGT CGGCGATCTC GCTGGCTTTG CCGCCTCCAT CGTCCGCCGC GGCCTCGATT TCGTGCAGGT GCCGACCTCG CTGCTGGCGC AGGTGGATTC CTCGGTCGGC GGCAAGACCG GGATCAACTC GCCGCACGGC AAGAATCTGG TCGGCGCGTT TCATCAGCCG GTGCTGGTGA TCGCCGACAC CGCGGTGCTC GACACGCTGT CGCCGCGGCA GTTTCGCGCC GGCTATGCCG AAGTGGCGAA GTACGGCGCG CTCGGCGACG AGGCGTTCTT CGCCTGGCTC GAGGCCAACC ACGCCGAGAT CGTGCGCGGC GGCAGCGCCC GCGAACACGC CATCGCCACC TCCTGCCGCG CCAAGGCGGC GATCGTGGCG CGCGACGAGC GCGAGACCGG CGAGCGCGCG CTGCTCAATC TCGGCCACAC CTTCGGCCAC GCGCTGGAAG CCGCCACCGG CTTCTCCGAA CGGCTGTTCC ACGGCGAAGG CGTCGCCGTC GGCATGGTGC TGGCGGCGCA GTTTTCCGCG GAACGTGGCA TGTTGTCGAA CGACGCCGCG GCGCGGCTGT CGCATCACCT CGCCGAAGTG GGACTGCCGA CAAGGCTGCA GGACATCGCC GGTTTCGCGC AGGAGGGCCT GGCCGACGCC GACGCCTTGA TGGCGCTGAT GGCGCAGGAC AAGAAGGTCA AGCGCGGCCG GCTCACCTTC ATTCTGCTGG AAGCGATCGG CCGCGCGGTG ATCGCACACG ACGTCGAGCC GGAACCGGTT CGCGATTTTC TGGCGCGCAA GCTCGCGGAC AAGACTTGA
|
Protein sequence | MTAPLNHSAP ITVEVALGDR GYDIVIGRDV LRSLGTRIAA LRPGARTAIV TDRNVATCWL AQTQAALDDV GIVSMPIVVE GGEGSKSYAG LQQVCEALIA AKIERNDLVI ALGGGVVGDL AGFAASIVRR GLDFVQVPTS LLAQVDSSVG GKTGINSPHG KNLVGAFHQP VLVIADTAVL DTLSPRQFRA GYAEVAKYGA LGDEAFFAWL EANHAEIVRG GSAREHAIAT SCRAKAAIVA RDERETGERA LLNLGHTFGH ALEAATGFSE RLFHGEGVAV GMVLAAQFSA ERGMLSNDAA ARLSHHLAEV GLPTRLQDIA GFAQEGLADA DALMALMAQD KKVKRGRLTF ILLEAIGRAV IAHDVEPEPV RDFLARKLAD KT
|
| |