Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3741 |
Symbol | aroB |
ID | 3837198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 4291370 |
End bp | 4292527 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637827866 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_428822 |
Protein GI | 83595070 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCCCAG ACACAGCCGC CGATGCCGCC GCCGACCCCG CCTTGCAGTC GTCCGTTCTG ACCGTTTCCC TGGGCGAGCG CAGCTATCCC ATTCATATCG GCCCGGGCTT GCTGGGCCGC GCCGGGGCTT TGATCGCCCC CCTCTTGCGC AAGCCCCGGG TTTTCGTCGT CACCGATGCC ACCGTCGCCG CGCTGCATCT CGATCCCTTG CTGGCCTCCC TCGGCGCGGC GGGCATTGCC CATGATCACG TCGTGCTGCC GGCCGGCGAG GCGACCAAAA GCTTTTCCCA GCTTGAAGAG CTGCTTGATC TTTTGCTGGC GGCGCGGTTC GAGCGCTCGA CCACCCTGCT CGCCCTCGGC GGCGGGGTGA TCGGCGATCT GGTCGGCTTC GCCGCCGCCA TCTTGCTGCG CGGCGTCGAT TTCATCCAGA TCCCCACCAC CTTGCTCGCC CAGGTCGACA GCAGCGTTGG CGGCAAGACC GGCATCAATA CGGCCTATGG CAAGAATCTG GTCGGCGCCT TCCACCAGCC GCGTCTGGTG CTGGCCGACA CCACCGTGCT GGATACCCTG CCCCGCCGCG AATTGCTGGC GGGCTATGGC GAGGTGGTCA AATACGGCGT CATCGATGAT CCCGCCTTCT TCGACTGGCT TGAAGAGCAT GGCTCCGCCC TGATCGCCGG TGACGGCGGG GCGCGCATCC ACGCCGTTCT GACCGCCTGC CGCGCCAAGG CCCGGGTGGT CGCCGAGGAC GAACGCGAAG GCGGACGCCG CGCCCTGCTC AACCTTGGCC ACACCTTCGG CCACGCGCTG GAGGCCGAAA CCGGCTTTGG TCCCACCTTG CTCCATGGCG AGGCGGTGGC CCTCGGCATG GTGATGGCCC TTGATCTGTC GGTGCGCCTG GGGCTGTGTC CGCCCGCCGA CGCCGCCCGC TTGCGCGCCC ATCTTGATCA CGTCGGCCTG CCCACCGATC CCCGGCGTCT CGAGGGTGCC CCGGCCTGGA ACGCCGAGCG CCTGCTGGCG GCGATGGATC ACGACAAGAA GGTCGAGGAT GGCAAGGTGA CCTTCGTGCT GGCGCGCGGC ATCGGCCGGT CGCTGCTGTG GCGCGAGGCC GATACGGCCA GCGTGCTCGC CACCCTGCGC GCCGCCGTGG CGCCCTGA
|
Protein sequence | MSPDTAADAA ADPALQSSVL TVSLGERSYP IHIGPGLLGR AGALIAPLLR KPRVFVVTDA TVAALHLDPL LASLGAAGIA HDHVVLPAGE ATKSFSQLEE LLDLLLAARF ERSTTLLALG GGVIGDLVGF AAAILLRGVD FIQIPTTLLA QVDSSVGGKT GINTAYGKNL VGAFHQPRLV LADTTVLDTL PRRELLAGYG EVVKYGVIDD PAFFDWLEEH GSALIAGDGG ARIHAVLTAC RAKARVVAED EREGGRRALL NLGHTFGHAL EAETGFGPTL LHGEAVALGM VMALDLSVRL GLCPPADAAR LRAHLDHVGL PTDPRRLEGA PAWNAERLLA AMDHDKKVED GKVTFVLARG IGRSLLWREA DTASVLATLR AAVAP
|
| |