Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_0525 |
Symbol | aroB |
ID | 3774763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | - |
Start bp | 509226 |
End bp | 510332 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637798933 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_399544 |
Protein GI | 81299336 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTTC AAATCCCTGT TGCTCTGCCG CAAAACGCCT ACGAGATTGC GATCGCGAAT GGGGGATTGG CCGCAGCAGG GACTTGGTTG CAGCAAGCTG ATCTGAAGGC GGGCACGAAA CTTCTGATTG TGACCAACCC GGCGATCGGG CGGCGTTACG GCGATCGCCT CGTGGCAGCA CTGCAAGAAG CAGGTTTCAT CGTCGACTGC CTGACCCTAC CGGCTGGCGA ACGCTACAAA ACGCCAGCAA CAGTTCAACG CATCTATGAC AAAGCCCTAG AACTGCGGCT GGAGCGCCGT TCGGCCTTGG TCGCCTTGGG TGGCGGCGTG ATTGGTGACA TGACAGGTTT TGCGGCGGCA ACCTGGTTGC GCGGCATTAG CTTTGTGCAG ATCCCGACCT CCCTGCTGGC AATGGTTGAT GCTTCGATTG GGGGCAAAAC CGGCGTCAAT CATCCCCGTG GCAAAAACCT GATCGGGGCG TTTCATCAAC CCAAGCTGGT GCTAATTGAT CCAGAAACGC TACAAACCCT GCCCGTACGG GAGTTCCGTG CCGGCATGGC TGAGGTAATT AAGTATGGCG TGATTTGGGA TCGGGATTTG TTTGAGCGGT TGGAAGCAAG CCCCTTTCTC GATCGCCCGC GATCGCTACC GGCCAATCTC CTAACGCTGA TCTTAGAGCG CTCCTGTCGC GCCAAAGCAG AGGTGGTTGC CAAGGATGAA AAAGAATCGG GCTTGCGGGC CATCCTCAAC TACGGCCATA CGATTGGCCA CGCCGTCGAA AGTCTGACAG GCTATCGCAT CGTTAACCAT GGCGAAGCTG TGGCGATCGG GATGGTTGCG GCGGGACGGC TGGCTGTGGC GCTAGGACTC TGGAATCAGG ATGAATGTGA TCGCCAAGAA GCCGTGATTG CTAAAGCGGG CTTACCAACA CGCCTACCAG AAGGGATTGA TCAAGCTGCA ATCGTCGAGG CTCTACAACT CGACAAAAAA GTGCAGGCAG GCAAGGTACG GTTTATTCTG CCAACGACGC TCGGCCACGT CACGATTACC GATCAGGTAC CGAGCCAAAC CCTGCAAGAG GTGCTGCAGG CGATCGCCAA CCCCTAA
|
Protein sequence | MSVQIPVALP QNAYEIAIAN GGLAAAGTWL QQADLKAGTK LLIVTNPAIG RRYGDRLVAA LQEAGFIVDC LTLPAGERYK TPATVQRIYD KALELRLERR SALVALGGGV IGDMTGFAAA TWLRGISFVQ IPTSLLAMVD ASIGGKTGVN HPRGKNLIGA FHQPKLVLID PETLQTLPVR EFRAGMAEVI KYGVIWDRDL FERLEASPFL DRPRSLPANL LTLILERSCR AKAEVVAKDE KESGLRAILN YGHTIGHAVE SLTGYRIVNH GEAVAIGMVA AGRLAVALGL WNQDECDRQE AVIAKAGLPT RLPEGIDQAA IVEALQLDKK VQAGKVRFIL PTTLGHVTIT DQVPSQTLQE VLQAIANP
|
| |