Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Syncc9902_1050 |
Symbol | aroB |
ID | 3742473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus sp. CC9902 |
Kingdom | Bacteria |
Replicon accession | NC_007513 |
Strand | - |
Start bp | 1014682 |
End bp | 1015815 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637771224 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_377058 |
Protein GI | 78184623 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0440827 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCTCCC CTAGTTCCAT GACGACGACG ATTCACGCTC ACCGCATCAA CGTCGCCCTC GAGCGCAACC CCTACGACAT CGTCATAGGG GACGGTGTCT TGGGAATAGT GGGCGATGAA CTCCATCGAC TCGGGGCAAA GCCAGGAAAA AAAATCCTTG TGGTCAGTAA CGCAGACGTT GCCGGTCCCT ACGGCGAAGC CTGCCTCAAC AGCTTGAAAG AGAAGGGATT CAACGTTGAA TTGCTCGTGA TCGAAGCGGG CGAAGAGCAA AAACACTTAC GCACCGTTTC GCAAATCCAC GACGCAGCAT TTGCGTTCAA GTTGGAACGG AGTTCGATGC TGTTGGCCCT TGGCGGTGGG GTCGTGGGAG ACATGACCGG ATTTGCCGCA GCAACGTGGT TGCGAGGTGT CGGCGTCGTT CAAGTTCCAA CAACATTGTT GGCGATGGTG GATGCCTCCA TCGGAGGGAA GACAGGGGTG AATCACCCTG GCGGCAAAAA TCTGATCGGC GCTTTTCATC AGCCGCATCT GGTCTTGATC GATCCAACAA CCTTGGACAC CCTCCCAATT CGAGAATTCC GAGCTGGGAT GGCCGAAGTG ATCAAGTATG GAATTCTCGG GGATACCGAC CTGTTCCTAG AACTTGAATC ATGCAAAGAT CCAAGTAGCC CCAATGGATT AGGGCGAACA ACACTCGAAT CGATCCTGCA ACGATCAGCG GCAGCCAAGG CCAGCATCGT GGCGGCTGAC GAACGGGAGG GAGGCCTCCG CGCCGTCCTG AACTACGGCC ATACCTTTGG CCATGTGGTC GAGGCGCTGT GTGGTTATGG AACTTGGCTA CATGGCGAAG CTGTTGCTCT CGGAATGGTT GCTGTTGGGG AGCTAGCCGT CCAACGGGGC AACTGGTCAC GGGGCGATGC AGAGCGTCAA AAACAGCTGA TTGCCAAAGC AGGCTTACCC ACAACTTGGC CCGAACTCAA TCTGGAGGCT GTGTTGCACA CACTTCAAGG CGACAAAAAA GTCCGGGACG GTCAGCTTCG GTTTGTGATC CCGACAGAGA TCGGTTCGGT CGAGATTCAA AACGACATCA GCCGCGACGA GATCACGCAC TGTCTGGAGC ACTTGGCAAC TTGA
|
Protein sequence | MGSPSSMTTT IHAHRINVAL ERNPYDIVIG DGVLGIVGDE LHRLGAKPGK KILVVSNADV AGPYGEACLN SLKEKGFNVE LLVIEAGEEQ KHLRTVSQIH DAAFAFKLER SSMLLALGGG VVGDMTGFAA ATWLRGVGVV QVPTTLLAMV DASIGGKTGV NHPGGKNLIG AFHQPHLVLI DPTTLDTLPI REFRAGMAEV IKYGILGDTD LFLELESCKD PSSPNGLGRT TLESILQRSA AAKASIVAAD EREGGLRAVL NYGHTFGHVV EALCGYGTWL HGEAVALGMV AVGELAVQRG NWSRGDAERQ KQLIAKAGLP TTWPELNLEA VLHTLQGDKK VRDGQLRFVI PTEIGSVEIQ NDISRDEITH CLEHLAT
|
| |