Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sala_1800 |
Symbol | aroB |
ID | 4082191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingopyxis alaskensis RB2256 |
Kingdom | Bacteria |
Replicon accession | NC_008048 |
Strand | + |
Start bp | 1896382 |
End bp | 1897488 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638010175 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_616845 |
Protein GI | 103487284 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAGC TGACCGTCGA ACTGGGCGCC CGCAGCTATC CGATCCTGAT CGGCGACGGG CTGATCCGCG ACATTGGCGC GCATGTCGCG CCGCTGTTGA AGCGACCGCG GACCATGATC GTCACCGACA GCCATGTCGC CGACCATTAT CTTGCACCCA TCGGCACCGC GCTGGCGATG GAGAATATCG CCTGCTCCTC CTTCGTGCTC GACCCCGGCG AGGCGACAAA GAGCTGGTCC GGCCTCGCGC GCCTCACCGA ATGGCTGATC GGCGAAGGCA TCGAACGCAG CGACCATGTC ATCGCGCTGG GGGGCGGTGT GATCGGCGAT CTCGTCGGTT TTGCGTGCAG CATCGTCAAG CGCGGCTGCG CCTTCATTCA GGTGCCGACG ACTTTGCTCG CACAGGTCGA CAGCAGCGTC GGCGGCAAGA CCGCGATCAA TGTCCCTGCG GGCAAGAATC TGATCGGCGC CTTTCACCAG CCGGCGATGG TGGTGATCGA CCCGACCACG CTCGAAACGC TGCCCCGCCG CGAACTCGGT GCGGGCTATG CGGAAGTCGT CAAATATGGG CTGATCGACG ACGCGGACTT CTTCGCCTGG TGCGAGGCGC ATGGCGCCGC GCTGCTGGCA GGCGACAGTG CGGCGCGGGC CCATGCGATC GCGCACAGCG TTGCCGCCAA GGCGCGCATC GTCGCCGCCG ACGAGCGCGA GACGCAGGAT ATTCGGGCGT TGCTCAACCT CGGCCACAGC TTCGGCCACG CGCTCGAGGC CGAAACCGGC TATTCGGACC GGCTGCTCCA CGGCGAGGCG GTGGCGGCGG GCATGGTGCT CGCACACCAG TTTTCGGCCG CGAACGGACT TTGCCCCGCC GCCGACGCGG CGCGTGTCCG CGACCATCTC GCCAGCGTTG GCCTGCCGCA CAGCCTGGCA AGCGCGGGGA TCAATGGCGG CGGCGCTCAG CTCGCCGCGC ATATGGCGCA CGACAAGAAG GTGCGCGGCG GCAGACTGCC GCTGATCCTG TCGCGCGGCA TCGGGCAGAG CTTCGTCACC GACGCATATG ACCTAGATGC CGTCGCCGCC TTCCTCGACG AGCAGCGTAG CGTATGA
|
Protein sequence | MEKLTVELGA RSYPILIGDG LIRDIGAHVA PLLKRPRTMI VTDSHVADHY LAPIGTALAM ENIACSSFVL DPGEATKSWS GLARLTEWLI GEGIERSDHV IALGGGVIGD LVGFACSIVK RGCAFIQVPT TLLAQVDSSV GGKTAINVPA GKNLIGAFHQ PAMVVIDPTT LETLPRRELG AGYAEVVKYG LIDDADFFAW CEAHGAALLA GDSAARAHAI AHSVAAKARI VAADERETQD IRALLNLGHS FGHALEAETG YSDRLLHGEA VAAGMVLAHQ FSAANGLCPA ADAARVRDHL ASVGLPHSLA SAGINGGGAQ LAAHMAHDKK VRGGRLPLIL SRGIGQSFVT DAYDLDAVAA FLDEQRSV
|
| |