Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_3947 |
Symbol | |
ID | 8393297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 4068606 |
End bp | 4069571 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 644981872 |
Product | hydratase/decarboxylase |
Protein accession | YP_003139586 |
Protein GI | 257061698 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3971] 2-keto-4-pentenoate hydratase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.536465 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00797291 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGAAGA TAAACTATTT TTTGTTCCCC TTCTTCATTT TACTAAGTCC CCTTCCTGAA CTGGCACAAG TTAAAATCAA AGATCCGTTA TTTAAGTCAA ATTATCATCA GATACACAAT ACAAATATTG CTGACTTTAA AGATAACTTT ATAACTTTAT CGAATCAAGA CTTAGATGAG TTAGCAGAAA AGTTAGCTAA TTATTATTTG ACCAAACAAA AAATTGATAA TTTTCCTAAC AATTTAACTT CTAATCAGTC TCTTCTTATC CAATCTAAAT TTGTCAGCAA TTTAATTTAT AACCAAGGCA ATATCATTGG TTATAAAGCA GGTTTGACCA ACCAAAAACT CCAAGAAAGA TTTAACACAA ATCAACCTGT ATTAGGAACT TTACTCGAAA AAATGTTATT GCCATCAGGA ACAATCGTTT CCTCTAAATT TGGTGCTATT CCTATGATGG AAGGAGATTT AATGGTCAGA GTGAAAAGTG AGAAAATTAA TCAAGCAAAA ACACCCCAAG AAGTCTTAAA CTATTTAGAT GCTGTTATTC CATTTTTAGA ATTACCTGAT TTAATGTATA GCCAAGATCT AAAATTAAAT AAGGAAATGT TAGTCGCTAT TAATGTTGGT GCAAGATTAG GAATTATGGG AGAACCTATT CCGTTAGAAG CAACGAAAGA ATGGCACACT AAGTTAAGTA ATATTCAGGT TACTATTAAA GATGAATTGG GTCAAGAATT AGCCCAAGGA AACGGTAAAG CATTATTAGG AGATCCCTTA ACAGTTGTAC TCTGGATTAA AGATGAGCTA CGATCTCAAG GAAAAAGCCT AAAAAAAGGT GATTTGTTAT CTTTAGGAAG TATTACCCCT TTAATACCCG TTAAACCAGG AAAAACAATT TCAGCGCAGT ATTTAGGATT AAATGAAGCG AGCCCAGTTC AACTATCCGT CCACTTTGAA GAATAA
|
Protein sequence | MTKINYFLFP FFILLSPLPE LAQVKIKDPL FKSNYHQIHN TNIADFKDNF ITLSNQDLDE LAEKLANYYL TKQKIDNFPN NLTSNQSLLI QSKFVSNLIY NQGNIIGYKA GLTNQKLQER FNTNQPVLGT LLEKMLLPSG TIVSSKFGAI PMMEGDLMVR VKSEKINQAK TPQEVLNYLD AVIPFLELPD LMYSQDLKLN KEMLVAINVG ARLGIMGEPI PLEATKEWHT KLSNIQVTIK DELGQELAQG NGKALLGDPL TVVLWIKDEL RSQGKSLKKG DLLSLGSITP LIPVKPGKTI SAQYLGLNEA SPVQLSVHFE E
|
| |