Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_4397 |
Symbol | |
ID | 7104845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 4620279 |
End bp | 4622252 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 643477376 |
Product | short chain dehydrogenase |
Protein accession | YP_002374475 |
Protein GI | 218249104 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only [S] Function unknown |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) [COG3347] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGTT TATGGAATGA CCAAGAAGTG GTCAAATATC AAGGAGATTT GGCCTTAAGG GTGTATACCT CAAGATTATT AGGTCAAGAA CCCTCCCTAG TTTTGCATGG AGGGGGCAAC ACTTCCGTTA AAATTCGTCA AGAAAACCTA GTCGGGGAAG TAGAAGACAT TCTCTACGTT AAAGGGAGTG GGTGGGACTT AGCAACCATT GAAGAAGCAG GGTTTTCCCC CGTGAAAATG CCCCATCTGT TAAAATTAGC CGAACTTCCT AGCTTATCTG ACTCCCAGAT GGTCAATGAA TTGAAAACGC AGATGATTAA GGCCAGTGCC CCATCTCCCT CCGTTGAAAC CATCCTTCAT GCCATTTTAC CCTACAAATA CGTCGATCAT ACCCATGCTG ATGCGGTTGT CACCATCACC AATACCGCGA CAGGATGGGA ACGGATGCAG GAAATTTATG GCGATCGCGT CGTTCTTATC CCCTACATTA TGCCAGGCTT TGACCTCGCC CGTCTGTGTG CCGAAAAATT CGCCGCCGAA GCTGGAAAAC AGACCATTGG CATGGTATTA ATGAATCATG GTATCTTTTC CTTTGGGGAA ACCGCCCAAG CGTCCTACGA ACGGATGATC GAATTAGTTA GTCAAGCAGA GGACTATTTA GCGCAACATA ACGCCTGGAA AATTCCCCAA AAATCGGTTA TTCATCCCGA AAAAAGCCTT AGTTACACTC TGGCTAAGCT GAGATCTCAA GTTTCTCAGT CCGCAGGATT TCCCGTTATT GCATCCTTGC ATTGCAACGA TCAAACCCTT GCTTTTACCC AACGTCCCGA CATTGCAGAG ATTTCTCAAC AAGGACCCGC TACTCCCGAT CACGTCATTC GGACGAAACG GCTACCCCTC CTAGGCAATG AGGTAGACAG TTACGTTCAA GCCTATCGAA CCTATTTTAA CACCCATGCG CCCCAAGCGA AAGAACCCAA AACCATCTTA GACCCCGCTC CTAGGGTCAT TCTCGACCCT GAATGGGGAA TGTGTACCAT TGGTCGTAAT GCCAAAGACG CAGCCATTGT TGCCGATATT TACCACCATA CCATCGAAAT TATTCAACGG TCTACACTTC TGGGGGGATA TCAGGCCTTA AGTGCTCAAG ATCTCTTTGA TATGGAATAC TGGGAACTGG AACAGGCAAA ATTGGCTAAA GGAGGTCAAA CGCCGATATT TTCTGGTGAA ATTGCCCTAG TAACAGGGGC TGCATCGGGT ATTGGTAAAG CTTGCGTCGA TTCTCTCCTA AAACGAGGTG CTGCGGTGGT GGGACTGGAT ATTAATGAAG CGATCGCCAA TTTGTATCAA CGCCCTGATT TTTGTGGCAT TCCCTGCGAT ATTACCGATG AAACTGCCCT AAAAGCTGCC CTCGAAAGGG TGATCAGAAC CTTTGGGGGA CTAGATATGT TAATTCTCAA CGCGGGAATT TTTCCCCCCG GATGCGCGAT AGAAGGACTT TCTACCGAAG AATGGCGACG GGTGATGTCA ATTAATTTGG ATGCCAATTT AGTCATCATG CGAGAATGTC ATCCTTTCCT CAAATTAGCT CCTAAAGGGG GTAGGGTGGT CATTATTGGC TCAAAAAATG TCCCTGCTCC AGGTCCGGGA GCCGCAGCTT ATTCTGCTTC TAAAGCGGCG TTAAATCAAT TAACTCGCGT AGCCGCTTTA GAATGGGGTA AGGACAATAT TCGCCTTAAT TCGGTGCATC CTAATGGAGT ATTTGACACC GGAATTTGGC GAGAAGAGGT CTTAGAAGCC CGTGCACAGC ATTATGGCCT TACCATAGAG CAATATAAAA CGAATAATGT CTTAAAAGTC GAAGTTACCA GTCAGGATGT CGCTGAAATG GTTGCTCATC TATGTAGTGA TGTATTTGCC AAAACAACCG CCGCACAAAT TCCTATTGAT GGGGGAAATG AACGGGTTAT TTAG
|
Protein sequence | MKSLWNDQEV VKYQGDLALR VYTSRLLGQE PSLVLHGGGN TSVKIRQENL VGEVEDILYV KGSGWDLATI EEAGFSPVKM PHLLKLAELP SLSDSQMVNE LKTQMIKASA PSPSVETILH AILPYKYVDH THADAVVTIT NTATGWERMQ EIYGDRVVLI PYIMPGFDLA RLCAEKFAAE AGKQTIGMVL MNHGIFSFGE TAQASYERMI ELVSQAEDYL AQHNAWKIPQ KSVIHPEKSL SYTLAKLRSQ VSQSAGFPVI ASLHCNDQTL AFTQRPDIAE ISQQGPATPD HVIRTKRLPL LGNEVDSYVQ AYRTYFNTHA PQAKEPKTIL DPAPRVILDP EWGMCTIGRN AKDAAIVADI YHHTIEIIQR STLLGGYQAL SAQDLFDMEY WELEQAKLAK GGQTPIFSGE IALVTGAASG IGKACVDSLL KRGAAVVGLD INEAIANLYQ RPDFCGIPCD ITDETALKAA LERVIRTFGG LDMLILNAGI FPPGCAIEGL STEEWRRVMS INLDANLVIM RECHPFLKLA PKGGRVVIIG SKNVPAPGPG AAAYSASKAA LNQLTRVAAL EWGKDNIRLN SVHPNGVFDT GIWREEVLEA RAQHYGLTIE QYKTNNVLKV EVTSQDVAEM VAHLCSDVFA KTTAAQIPID GGNERVI
|
| |