Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_4002 |
Symbol | aroB |
ID | 7086218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 4762732 |
End bp | 4763808 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 643462877 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_002359898 |
Protein GI | 217975147 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00000746069 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCAAA TTCAGGTTGA TTTAGGTGTA CGTAGTTATC CCATTTACAT TGGCCAGAAT TTGATGAGTG ATGGCGAGAC CCTGTCTCGC TACCTGCTTA AAAAACGTAT TCTTATCGTC ACCAATGAAA CTGTCGCGCC TTTGTATCTT AAACAGATAC AAGAGACGAT GGCTTCGTTT GGTGAGGTAG AGAGTGTTAT CCTCCCCGAT GGCGAACAAT TCAAAGACTT AGCACATCTA GATACTATTT TTACTGCATT GCTGCAGCAA AACTATGGTC GAGATTCTGT GCTGGTGGCT TTGGGTGGCG GCGTAATTGG TGATATGACG GGCTTTGCCG CGGCATGTTA TCAACGTGGG ATCGATTTTA TTCAAATTCC GACAACCCTA TTGTCGCAGG TGGATTCTTC CGTCGGCGGT AAAACGGCTG TTAACCATCC TCTTGGTAAA AACATGATTG GGGCCTTTTA TCAGCCACAA ATCGTGCTTA TCGATACTTT ATGTTTACAT ACGCTTCCAG CGCGCGAGTT TGCGGCGGGA ATGGCGGAAG TCATCAAGTA TGGCATCATG TGGGATGCTG ATTTTTTTCA ATGGCTTGAA GATAATGTAA CGGCACTAAA AACCTTAGAT GCCCAAGCAT TGATTTATGC TATCTCCCGT TGCTGTGAGA TTAAGGCCGA TGTAGTTAGC CAAGACGAAA CTGAGCAGGG TGTACGTGCT TTATTGAATC TGGGTCATAC CTTTGGTCAT GCGATTGAAG CCGAAATGGG CTACGGTAAT TGGTTGCATG GTGAAGCCGT GTCAGCTGGC ACAGTCCTTG CTGCTCAAAC AGCTAAGGCA CTGGGGCTTA TCGATGAGTC AATAGTTTGT CGTATCATAG AGTTACTACA AGCTTTTGAT CTTCCAGTGA GTGCGCCGGA ATCTATGGAT TTCGACAGTT TCATTCAACA TATGCGACGC GATAAAAAAG TTTTAGGCGG TCAGATTCGA CTGGTGCTCC CAACGGCTAT AGGCCGCGCG GATGTGTTTA GTCAAGTCAC TGAATCTACC CTCGAACAGG TTATTCGCTG CGCATAA
|
Protein sequence | MKQIQVDLGV RSYPIYIGQN LMSDGETLSR YLLKKRILIV TNETVAPLYL KQIQETMASF GEVESVILPD GEQFKDLAHL DTIFTALLQQ NYGRDSVLVA LGGGVIGDMT GFAAACYQRG IDFIQIPTTL LSQVDSSVGG KTAVNHPLGK NMIGAFYQPQ IVLIDTLCLH TLPAREFAAG MAEVIKYGIM WDADFFQWLE DNVTALKTLD AQALIYAISR CCEIKADVVS QDETEQGVRA LLNLGHTFGH AIEAEMGYGN WLHGEAVSAG TVLAAQTAKA LGLIDESIVC RIIELLQAFD LPVSAPESMD FDSFIQHMRR DKKVLGGQIR LVLPTAIGRA DVFSQVTEST LEQVIRCA
|
| |