Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0721 |
Symbol | aroB |
ID | 5898176 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 780163 |
End bp | 781272 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641561203 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001682352 |
Protein GI | 167644689 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.879879 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCACCA CCATCCCCGT GGGCCTGGGC GCGCGCGCCT ATGACGTGGT CATCGGAACC GGCCTGATCG ACCGGGCGGG CGAGCACATC GCGCCGCTGC TCAAGCGCAA GCGCGTGGCC ATCGTCACCG ACACCATCGT CGGCGAGCAC CACGGCGAGC GGCTGGCCAA TGCGCTGGAG CACGCCGGCG TGGCGACCGA CGTGATCCTG GTGCCGCCCG GCGAGGAGAC TAAGAGCTTC GAGGGCCTGG CCGACCTCAG CGATCGCCTG CTGGCCCTGG GCCTGGAGCG CGGCGACATG GTCATCGCGT TCGGCGGCGG GGTGGTCGGC GACCTGACCG GCTTCGCGGC GGCGATCTAC AAGCGCGGGA TCGACTTCAT CCAGATCCCC ACCACCCTGC TGGCCCAGGT GGACTCGTCG GTGGGCGGAA AGACCGCCAT CGACACCCCG CGCGGCAAGA ACCTGATCGG CGCCTTCCAC CAGCCGCGCC TTGTGCTGGC CGACCTCGAC ATCCTGGCCA CCCTGCCCGC CCGCGAGCTG GCCTGCGGCT ATGCCGAGGT CATCAAGTAC GGCCTGCTGG GCGATTTCGC CTTCTTCGAA TGGCTGGAGG CCAACGTCCA CGCCGTGCTG GAGCGCGACA CCGCCGCCCT GGTGAAGGCC GTGGGCCGCT CGGTGGAGAT GAAGGCGCAG ATCGTCGCCG AGGATGAGCG GGAGGTCGGC CGCCGGGCGC TGCTGAACCT GGGTCACACC TTCGGCCACG CGGTCGAGGG CGAGATGGGC TTTGGCGACG CGCTCAAGCA CGGCGAGGCC GTCGGTCTGG GCATGGCGCA GGCCTTCCGG TTCTCGGTCC GCCAGGGCCT ATGCTCGGCC CAGGACGCCG CCCGCGCCGA GGCCGCGATC AAGGCCGCCG GCCTGCCGAC CAAGCTGTCG GACATCCGCC CCGAACCGTT CAGCGCCGAC GCCCTGATCG CCCACACCGC CCAGGACAAG AAGGCGCAGG GCGGGACCTT GACCTTCGTC CTGGTCCGCG CGATCGGCGA CGCCTTCGTG GCCAAGGACG TGGACCGGCA AGCACTGCGG GCGTTCCTGG TGGAGGAAGG CGCGGTTTAA
|
Protein sequence | MITTIPVGLG ARAYDVVIGT GLIDRAGEHI APLLKRKRVA IVTDTIVGEH HGERLANALE HAGVATDVIL VPPGEETKSF EGLADLSDRL LALGLERGDM VIAFGGGVVG DLTGFAAAIY KRGIDFIQIP TTLLAQVDSS VGGKTAIDTP RGKNLIGAFH QPRLVLADLD ILATLPAREL ACGYAEVIKY GLLGDFAFFE WLEANVHAVL ERDTAALVKA VGRSVEMKAQ IVAEDEREVG RRALLNLGHT FGHAVEGEMG FGDALKHGEA VGLGMAQAFR FSVRQGLCSA QDAARAEAAI KAAGLPTKLS DIRPEPFSAD ALIAHTAQDK KAQGGTLTFV LVRAIGDAFV AKDVDRQALR AFLVEEGAV
|
| |