Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_3689 |
Symbol | |
ID | 8393031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 3767813 |
End bp | 3768664 |
Gene Length | 852 bp |
Protein Length | 283 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644981614 |
Product | 3-deoxy-7-phosphoheptulonate synthase |
Protein accession | YP_003139336 |
Protein GI | 257061448 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.550176 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.691037 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAACG CTCAACTCGC CCTAAAATCC AATCCTGAAG CCATTACCCT TGTTTCTTTA ACGGATACTG TCGTTTTCGG CGGTCAAGAT ATCGTAATGA TTGGTGGCCC CTGTGCGGTG GAAAGTTTGG CACAAATGGA AGCCGTCGCT AATGGCCTAA GTTTAGCACC AGTTCAAGCC TTGCGGGGAG GAGCTTTTAA ACCAAGAACC TCCCCGCATT CTTTCCAAGG ACTCGGCCTA GAAGGGTTGA AAATTCTTGC AGAGGTTAAA CGGCGTTACG GTATTCCCGT GGTAACAGAA GTGATGTCTA CTGAGCAAAT AGAAGCAGTA GCAACCTACG CTGATATGTT ACAGGTTGGT AGCCGAAATA TGCAAAATTT TGAGTTACTC AAAGCATTAG GCCACGTCAA TAAACCCATC CTCCTCAAAC GGGGTTTAGC GGCTACCCTT GAAGAGTTTA TTGGTGCAGC AGAATACATT TTGAGCCATG GTAACACACA GGTAGTCCTC TGTGAGCGAG GAATCCGCAG TTTTGATAAC TACACTCGCA ATGTGCTAGA TTTAGGGGCA GTTGTCGCTC TAAAACAGCT TACCTGTTTA CCCGTGATTG TTGATCCCTC CCACGCAGCC GGTAGACGGG AATTAATTGC GCCTTTAGCT AAGGCAGCGA TCGCGGCCGG GGCTGATGGA TTAATTATAG AATGCCATCC TCAGCCCGAA AAATCGGTTT CTGACGCAGC ACAAGCCCTT TCTTTAGAAG AAATGGTCAG TTTAGTCCAT AGCTTACAAC CTATCGCTCA AGCTATTGGC CGTTCTATTG TACCGTTTTC TCGTTCACTA ATTGCAGCTT GA
|
Protein sequence | MFNAQLALKS NPEAITLVSL TDTVVFGGQD IVMIGGPCAV ESLAQMEAVA NGLSLAPVQA LRGGAFKPRT SPHSFQGLGL EGLKILAEVK RRYGIPVVTE VMSTEQIEAV ATYADMLQVG SRNMQNFELL KALGHVNKPI LLKRGLAATL EEFIGAAEYI LSHGNTQVVL CERGIRSFDN YTRNVLDLGA VVALKQLTCL PVIVDPSHAA GRRELIAPLA KAAIAAGADG LIIECHPQPE KSVSDAAQAL SLEEMVSLVH SLQPIAQAIG RSIVPFSRSL IAA
|
| |