Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_0504 |
Symbol | aroB |
ID | 6408153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 547256 |
End bp | 548401 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 642710416 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001989539 |
Protein GI | 192288934 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCCC CGCAGAAACA CTCCGCTCCG ATCACCGTCG AAGTCGCCCT CGGCGACCGC GCCTATGAGA TCGTGATCGG CCGCGACGTG ATCGCCTCGC TGGGCGAGCG GATCGCCAAG CTGCGGCCCG GCGCACGCAC CGCGATCGTC ACCGATCGCA CCGTGGCGAA GACCTGGCTG AAGCGCACCG AGGAGGTGCT GGATCAGGTC GGCATCGCGC ATGCGTCGGT GATCGTCGGC GAAGGCGAAA GCTCCAAAAG CTATGCGGGG CTCGAGCAGG TGTGCGAGGC GCTGATCGCC GCCAAGATCG AGCGCAACGA TCTGGTGATC GCGCTCGGCG GCGGCGTGAT CGGCGATCTT GCCGGTTTCT CGGCCTCGCT GCTGCGCCGC GGCGTCGATT TCGTGCAGGT GCCGACCTCG CTGCTGGCGC AGGTCGACTC CTCGGTCGGC GGCAAGACCG GCATCAACTC GCCGCAGGGC AAGAATCTGA TCGGCACGTT CCATCAGCCG GTGCTGGTCT TGGCCGACAC CGCGATCCTC GACACCCTGT CGCCGCGCCA ATTCCGCGCC GGCTACGCCG AAGTCGCGAA ATATGGCGCG CTCGGTGACG AAGCGTTCTT CGCCTGGCTC GAAGCCAACC ATGCCGAGCT GTTCAGCGGC GGCGCCGCGC GCGAGCACGC GGTGGCCACC TCATGCCGAG CCAAGGCCGC GATCGTCGCC CGCGACGAAC GCGAGACCGG AGACCGCGCG CTGCTCAATC TCGGTCACAC CTTCGGCCAT GCGCTGGAAG CAGCGACCGG GTTCTCCGAC CGGCTGTTCC ACGGCGAGGG CGTGGCGATC GGCATGGTGC TGGCGGCGGA ATTCTCGGCC GAGCGCGGCA TGATGCCGGC AGCCGACGCG CAGCGGCTAG CCAAGCACCT CGCCGACGTC GGCCTGCCGA CCCGGCTGCA GGACATCGCC GGCTTCACCC AGGAAGGCCT TGCCGACGCC GACCGCCTGA TGGCGCTGAT GTCGCAGGAC AAGAAGGTCA AGCGCGGCGA ACTCACCTTC ATCCTGATGG AAGGCATCGG CCGCGCGGTG ATCGCCAACA AGGTCGAGCC GGCGCCGGTG CGCGACTTCC TGCAGCGGAA ACTGGCGCAA GCCTAA
|
Protein sequence | MNAPQKHSAP ITVEVALGDR AYEIVIGRDV IASLGERIAK LRPGARTAIV TDRTVAKTWL KRTEEVLDQV GIAHASVIVG EGESSKSYAG LEQVCEALIA AKIERNDLVI ALGGGVIGDL AGFSASLLRR GVDFVQVPTS LLAQVDSSVG GKTGINSPQG KNLIGTFHQP VLVLADTAIL DTLSPRQFRA GYAEVAKYGA LGDEAFFAWL EANHAELFSG GAAREHAVAT SCRAKAAIVA RDERETGDRA LLNLGHTFGH ALEAATGFSD RLFHGEGVAI GMVLAAEFSA ERGMMPAADA QRLAKHLADV GLPTRLQDIA GFTQEGLADA DRLMALMSQD KKVKRGELTF ILMEGIGRAV IANKVEPAPV RDFLQRKLAQ A
|
| |