Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spro_4604 |
Symbol | aroB |
ID | 5605908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Serratia proteamaculans 568 |
Kingdom | Bacteria |
Replicon accession | NC_009832 |
Strand | - |
Start bp | 5079201 |
End bp | 5080301 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640940170 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001480825 |
Protein GI | 157372836 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000760296 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGAGAA TTACCGTAAC GCTTGGGGAG CGCAGCTACC CGATAACCAT AGCCGCCGGA TTGTTTAACG ATCCGGCTTC TTTTATGCCG CTGAAGGCGG GTGAACAGGT CATGCTGGTC ACCAACCAAA CCCTGGCGCC ACTCTATCTG GACCACGTCC GGAAGGTGTT GGAGCAGGCA GGCGTCATGG TGGATCAGGT GATTTTGCCT GATGGCGAAC AGTATAAATC TCTGGCCGTA CTCGAGCAGG TGTTCTCGGC ACTGTTGGAA AAGCCGCACG GTCGTGATAC CACGCTGATT GCCCTTGGGG GCGGCGTGAT TGGCGATCTT ACCGGCTTTG CCGCCGCCTG TTATCAGCGC GGTGTCCGCT TTATTCAGGT CCCTACCACG CTGTTGTCGC AGGTGGACTC TTCCGTTGGC GGTAAAACCG CCGTCAATCA TCCGCTCGGC AAGAACATGA TCGGCGCCTT CTATCAACCC GTTTCTGTGG TGGTTGATCT CGATTGCCTG AAAACCTTAC CGGCGCGTGA GCTCTCCTCT GGTTTGGCTG AAGTGATCAA GTACGGGATT ATTCTCGACC ACGATTTCTT CGTCTGGCTG GAAAACAATA TCGATGCCCT GGTGGCGCTG GATATGCAGG CTCTGGCCTA CTGTATCCGT CGCTGCTGCG AGCTGAAAGC TGAGGTGGTT GCTGCTGACG AACGCGAAAG CGGGCTGCGC GCGCTGCTGA ATCTGGGCCA TACTTACGGC CATGCGATCG AAGCCGAAAT GGGCTATGGT GTATGGTTGC ACGGTGAGGC CATTGCCGCC GGTATGGTGA TGGCGGCAGA AACCGCGCAC CGTCTCGGCC AGTTCTCCCG CGAAGATATT GAACGTATTA AAGCACTGTT GTTGCGCGCC GGTTTACCAG TGTGTGGCCC GCAGGAAATG GCTCCGGGAA CTTATCTGCC GCATATGATG CGCGATAAGA AAGTCCTGGC CGGTGAATTG CGCCTGGTAC TGCCGACGGC CATTGGCCAG GCGGAAGTCC GTGGCGGAGT GGGGCATGAT ATGGTGCTCG CTTCGATCGC AGCTTGCTTT CCTGACGGAA TGTCTAAGTA A
|
Protein sequence | MERITVTLGE RSYPITIAAG LFNDPASFMP LKAGEQVMLV TNQTLAPLYL DHVRKVLEQA GVMVDQVILP DGEQYKSLAV LEQVFSALLE KPHGRDTTLI ALGGGVIGDL TGFAAACYQR GVRFIQVPTT LLSQVDSSVG GKTAVNHPLG KNMIGAFYQP VSVVVDLDCL KTLPARELSS GLAEVIKYGI ILDHDFFVWL ENNIDALVAL DMQALAYCIR RCCELKAEVV AADERESGLR ALLNLGHTYG HAIEAEMGYG VWLHGEAIAA GMVMAAETAH RLGQFSREDI ERIKALLLRA GLPVCGPQEM APGTYLPHMM RDKKVLAGEL RLVLPTAIGQ AEVRGGVGHD MVLASIAACF PDGMSK
|
| |