Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_2333 |
Symbol | |
ID | 8391653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 2348277 |
End bp | 2349335 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644980302 |
Product | 3-deoxy-7-phosphoheptulonate synthase |
Protein accession | YP_003138044 |
Protein GI | 257060156 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCATTG TAATGAAACC TGATTCTCCA GAATCGGAAA TCGAGAACAT TATTGGAGAA GTTAAACAAT GGCAATTGAC TCCCGAAAAA ATTCAAGGGA CAACTCAAGT TGTCATTGGG TTGGTGGGGG ATACGGCGAC GATGGAAATA TCCCGCATTC AAGAATTGAG TCCTTGGATT CAAGAGGTGT TACGGGTCGG AAAACCCTTC AAACGCGCTA GTCTGGAATT TCGTCACGGT CATTACAGTG AAGTGGTGGT CAACACCCCC AATGGACCTG TTCCCTTTGG CAAAAATCAT CCGGTCGTGG TCGTAGCGGG TCCTTGTTCG GTGGAAAATG AAGAAATGAT CATCGAAACC GCTAAGCGAG TTAAGGCGGC TGGAGCCCAT TTTCTACGGG GTGGAGCCTA TAAACCGCGT ACCTCTCCCT ATGCTTTCCA AGGCCACGGC GAAAGTGCTT TAGAATTGTT GGCTGCCGCA CGAGCAGCCA CTGGGTTAGG GATTATCACA GAAGTCATGG ACACGGCTGA TTTAGAGAAG GTGGCTGAAG TGGCTGATGT GATTCAAGTT GGCGCAAGAA ATATGCAGAA TTTCTCCCTA CTCAAAAAAG TGGGAGCCCA GGATAAACCT GTTTTGCTTA AACGGGGAAT GGCCGCGACT ATCGATGATT GGTTAATGGC AGCCGAATAC ATTTTAGCGT CAGGTAACAA TAACGTTATT CTCTGTGAGC GCGGGATTCG GACGTTTGAT CAAAAATATA CCCGTAATAC CTTGGATTTA TCGGTGATTC CGGTTTTACG CGATTTAACC CATTTACCGA TGATGATTGA TGCTAGTCAT GGAACCGGAA AGTCAGATTA TGTGCCTTCT ATGTCCATTG CAGCACTGGC GGCGGGAGCA GATTCGTTGA TGATTGAAGT TCATCCGAAC CCGGCTAAGG CACTCTCGGA TGGTCCTCAG TCCTTGACTC CTGAAAAGTT TGACCGTCTG ATGCAGGAGT TATCGGTCAT TGGTAAAACG GTGGGACGCT GGAGTCAACC AGCCGTGGCG TTGGCTTAA
|
Protein sequence | MIIVMKPDSP ESEIENIIGE VKQWQLTPEK IQGTTQVVIG LVGDTATMEI SRIQELSPWI QEVLRVGKPF KRASLEFRHG HYSEVVVNTP NGPVPFGKNH PVVVVAGPCS VENEEMIIET AKRVKAAGAH FLRGGAYKPR TSPYAFQGHG ESALELLAAA RAATGLGIIT EVMDTADLEK VAEVADVIQV GARNMQNFSL LKKVGAQDKP VLLKRGMAAT IDDWLMAAEY ILASGNNNVI LCERGIRTFD QKYTRNTLDL SVIPVLRDLT HLPMMIDASH GTGKSDYVPS MSIAALAAGA DSLMIEVHPN PAKALSDGPQ SLTPEKFDRL MQELSVIGKT VGRWSQPAVA LA
|
| |