Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1899 |
Symbol | aroB |
ID | 3917120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2011490 |
End bp | 2012599 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640444643 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_497173 |
Protein GI | 87199916 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.867319 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGTAA TCCCCGTCGC CATCGCCGGA GCGCCCTATG AAGTGCGGAT CGAGGCCGGC GTTCTGGCCC GTGCGGGCGA ACATTGCCGC CCCTTTCTGC GCAAGGACCG CGTGGCTATC GTCACCGACG AGCACGTCGC CGCAGAGTGG CGGGAAACCG TCACTGCCTC GTTCGACAGC GTCGGCGTGC GCAGCGAATG GCTCGTTCTC CCCGCTGGCG AGAGCACCAA GAGCTGGGAA CACCTCGCCC GCCTGGTCGA CTGGCTGCTG GAACAGGAAG TCGAGCGCAA GGACCGTATT GTCGCGCTCG GCGGAGGCGT GATCGGCGAT CTCACCGGGT TTGCCGCCTC GATCGTCAAG CGTGGCTGCG GTTTCATCCA GATCCCGACG ACCCTCCTGG CGCAGGTCGA TTCCAGCGTC GGCGGAAAGA CCGCGATCAA CACGCCCGCT GGCAAGAACC TCGTCGGCGC GTTTCACCAG CCTGCGCTGG TCCTCGCCGA TCCGCTCGCG CTCGACACGC TGCCGCTGCG CGATGTGCGG GCCGGGTACG CCGAAGTGGT GAAATATGGC CTGATCGACG ATGCGCCCTT CTTCGAGTGG TGTGAGGCGA ACGGCGCAAA GCTCCTCGCA GGCGACCTGG CAGCGCGCGA GACGGCCATC GCACACAGCG TCGCGGCAAA GGCGCGGATC GTGGCGGCGG ACGAGAAGGA AATCGCCGGC ATCCGTGCGC TACTCAATCT CGGGCACACT TTCGGGCACG CGCTCGAGGC CGAGACCGGC TTTACCGATC GCCTGCACCA TGGCGAAGGC GTGGCGCTGG GGATGGTGCT GGCGGCACGA TTCTCGGCGC GGCAGGGGCT GATGTCGAGG CAGGATGCCG AACGCGTGGC TCGCCATGTC GAAGCGGTGG GCCTGCCTGC CACGCTCCGC GAGCTTGGGC TTTCCTGCGA CGGCCGCCGC CTTGCCGATC ACATGCTTCA CGACAAGAAG ATGGACGCGG GCACATTGCC CTTCCTTCTC ATGCGCGGGA TCGGGCAGAC CTTCCTGGCA AAGGACGTCG ATCTGACGGA AGTGGCCGCT TTCCTCGACG AGGAACTCGC CAGAACCTGA
|
Protein sequence | MAVIPVAIAG APYEVRIEAG VLARAGEHCR PFLRKDRVAI VTDEHVAAEW RETVTASFDS VGVRSEWLVL PAGESTKSWE HLARLVDWLL EQEVERKDRI VALGGGVIGD LTGFAASIVK RGCGFIQIPT TLLAQVDSSV GGKTAINTPA GKNLVGAFHQ PALVLADPLA LDTLPLRDVR AGYAEVVKYG LIDDAPFFEW CEANGAKLLA GDLAARETAI AHSVAAKARI VAADEKEIAG IRALLNLGHT FGHALEAETG FTDRLHHGEG VALGMVLAAR FSARQGLMSR QDAERVARHV EAVGLPATLR ELGLSCDGRR LADHMLHDKK MDAGTLPFLL MRGIGQTFLA KDVDLTEVAA FLDEELART
|
| |