Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_0304 |
Symbol | |
ID | 8389608 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 294323 |
End bp | 297187 |
Gene Length | 2865 bp |
Protein Length | 954 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644978345 |
Product | putative transcriptional regulator, Crp/Fnr family |
Protein accession | YP_003136103 |
Protein GI | 257058215 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0664] cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGGCTA AAATTCCCGA AAAGCAGATG CACATCGTCC GGTCTATCCT AGCCGGGGGT TGGCTGCTCC TAATCTGTTC CCTATTCTAC GATCCCATTT CCCCTTGGTT AACCTACCCC GATAGCACTT GGAGTCCTTT TCGGTTAAAC CCCAACGCTT GTATCCAAGT CCAAGGCCAA TGTCTTCCCC AAGAACCCTA TCCCATGGGG GCTTCCCTCT TTTGGGGGTT AATTGTTCCT ACAGGGGTCT TTGTCCTGTT AGTCTTTGGC CATGAATTTT GGCGGCGCAT CTGTCCCTTG AGTTTTTTCT CCCAACTCGC CCGCCGCTTG GGGAAACAAC GACAACGTAA ACGGGTCAAT GCCGAAACAG GTCAGGTTCG CTATGAGTTA GTCAAAATCA AACATGATTC CTGGTTAGGC CAGAACTACA TTAAAGTCCA AATGGGCTTA TTGTACGTTG GAGTCTGTGG CCGTATCCTC TTCTACGACT CGATGCGACC GGTTTTAGGA AGTTTTTTGC TGTTTAGCAT TGGTGCAGCT ATTCTAGTGG GCTACCTCTA CGGGGGAAAA TCTTGGTGTC AGTATTTCTG TCCTATGGCT CCGGTGCAGG AATTTTATAG CGAACCGAGA GGGTTACTCA ACAGTATCGC CCATGAAGGT CAAAGTGGTG TCATTACTCA ATCGATGTGC CGTCAAATCA ACGAAGATGG CACAGAATCG AGTGCTTGCG TGGCCTGCCA TAGTCCCTGT GTCGATATTG ATGCCGAAAG AGCCTATTGG GATCGCATGA AACGCCCCGA TTATAAATTG CTCTACTATA GCTATGCGGG GTTAGTGGTG GGCTTTTTCT CCTACTATTA CCTCTATTCG GGCAATTGGA ATTACCTCCT ATCAGGAGCC TGGACTCGTC ACGAAAATCA ATTAAATCTC CTGTTTACCC CAGGATTTTA TTTTTTTGGA ACTCCTATCC CTATTCCTCG GTTAGTGGCT GCTCCCCTAA CCATTGCCGT CTCTATGGCG GTTGCTTATT ATATCGGACG CTATCTCGAA AAATTCTACA TCACCTATCA ACGGCGACGG AGTCCGAATA TACCCCCCCT ACAGCTACAA CACGAAATTT TCACCCTGAC GACCTTTTGG GTGTTCAATT TCTTTTTTAT CTTCGCTGGA CGCTCCTATA TTTCTAAGTT ACCCATCCAG CTACAATACC TCTTCAATAT AACTGTGGTG GTGGTCAGTA CCCTCTGGTT ATATCGGACT TGGGGACGAA CTAATGATCG CTATTCCCGT GAAAGTTTGG CAGGTCGTTT ACGTCAACAA TTAACCCGTC TACAGGTGGA TGTCTCCCGT TTCTTGGAAG GCCGTTCCCT GGAATCCTTA AACTCGGATG AGGTGTATGT TTTAGCCAAG ATTCTCCCCG GATTTACAGG AGATAAGCGT CTAGAAGCCT ATAAAGGGAT TTTACGCGAT TCCTTGGAAG AAGGCTACGT CAACAGTTCC AGTAGCTTAG AAGTGTTGCG TCAGATGCGC GAAGAATTGG GGGTCAGTGA ACAGGAACAC TTGACCATTT TGACCGAATT AGGCGTAGAA GATCCGGATT TATTCGATCC GAACCAACAA CGGAGCCGAG AAAATCAACT ACGGATGCAA AGCTTACGCT CTCGCATTCG AGGAATGGTC GGGGGTGGAA AACGCCGACG GGGAGCCCAA GGGTTAGCCA AAGACCTGTT TAAAGTGGTT AAGAAGGAGA AATCTATCGG GGATGTTATT GAAAAAGAAG GAACGGTGCG ATCGCTGTCC CAACAATACG GGTTAACCCT CGAAGAAGAA GCCCAAATTC TGGCGGATTT AGACGAAGAC TCTCAAATCC TCCGTCGGGG TAATATTCTC CTTGAACAAC TCAACAACCT ACGGGAACAA GAATTAGCTC TCCTACATCC CCCGTCGAGT TTGCACTCTC CCCATCTACG GACGGGATTA CAAATTTTAC GGTCTACTGT CGCTCAAAAG CAGCGAGTCA TTGCTAAAGG GGTTCTTAAT ATTCTTGAAG AATTACAGAC CGGAACCGAA GCAACTCGCT TTGCCTTAGC TTTGGCTAGT TTAACCCACC ATGTTTTACC TGAATTATTG GAAAATGGTG ATCCGTTGTG GGAAGATCGC CTTGATGGGA CGATTTTCTC CCGCATGGAA GAACAACTCA AGCAAACCGA TGATCAATCG ACTCAGGTTG AGGATACGGT GATGGTAGGC TATTTGGAGG CGTTATTTAC CGAACCCGAT TCGTTAACCA AGGCAGTTAG TTTGTATCTC TTAGCCAAAG TCAACCAGGA ACGGGCACAA CAACAAGCGC GTCAATTATT AGAGAGTCAA TTGATGTTGA ATACCCTCTT AAAAGAAACC GCACAACAGA TTCTGCAAAC GTCTGAGGAG AAAGAGTCTA TTACCCCCCT CGAAAAACTC TTGTATCTGT CGAGTTACGA GTTATTAATG GGGTTAAAAG CGGAAAGCTT GATGAATTTA GCCTATCAAG TTTCCTTGAA GGAATACGAG CGATCGCAGG TTATCTTAGA ACAAGGAGAA ATCTCTAAAG ACCTCTTTTT GTTACTACGC GGCAAGTTAG AAGCGAAGCA TCGCTTAGAA GATGGGGAGG TCGAGACAGA GGAAGTCGTC CCGGTGGTTC CTTTAAATGA GTTAGAAGTC TTAGGAAGAA TGGAAACCGA CTCTACCTAT ACTGTAACCT CTGCTAAAGC GTCTCTGTTG GCGATCGAAG TAGCCATGTT TGACTCGCTT CTATCTCAAG ATACTGCTTT CTCTCGTCAA GTGATCGAGC AAGAAAGCCG TCGCTTACAA CAGTTAAGCA GCTAA
|
Protein sequence | MLAKIPEKQM HIVRSILAGG WLLLICSLFY DPISPWLTYP DSTWSPFRLN PNACIQVQGQ CLPQEPYPMG ASLFWGLIVP TGVFVLLVFG HEFWRRICPL SFFSQLARRL GKQRQRKRVN AETGQVRYEL VKIKHDSWLG QNYIKVQMGL LYVGVCGRIL FYDSMRPVLG SFLLFSIGAA ILVGYLYGGK SWCQYFCPMA PVQEFYSEPR GLLNSIAHEG QSGVITQSMC RQINEDGTES SACVACHSPC VDIDAERAYW DRMKRPDYKL LYYSYAGLVV GFFSYYYLYS GNWNYLLSGA WTRHENQLNL LFTPGFYFFG TPIPIPRLVA APLTIAVSMA VAYYIGRYLE KFYITYQRRR SPNIPPLQLQ HEIFTLTTFW VFNFFFIFAG RSYISKLPIQ LQYLFNITVV VVSTLWLYRT WGRTNDRYSR ESLAGRLRQQ LTRLQVDVSR FLEGRSLESL NSDEVYVLAK ILPGFTGDKR LEAYKGILRD SLEEGYVNSS SSLEVLRQMR EELGVSEQEH LTILTELGVE DPDLFDPNQQ RSRENQLRMQ SLRSRIRGMV GGGKRRRGAQ GLAKDLFKVV KKEKSIGDVI EKEGTVRSLS QQYGLTLEEE AQILADLDED SQILRRGNIL LEQLNNLREQ ELALLHPPSS LHSPHLRTGL QILRSTVAQK QRVIAKGVLN ILEELQTGTE ATRFALALAS LTHHVLPELL ENGDPLWEDR LDGTIFSRME EQLKQTDDQS TQVEDTVMVG YLEALFTEPD SLTKAVSLYL LAKVNQERAQ QQARQLLESQ LMLNTLLKET AQQILQTSEE KESITPLEKL LYLSSYELLM GLKAESLMNL AYQVSLKEYE RSQVILEQGE ISKDLFLLLR GKLEAKHRLE DGEVETEEVV PVVPLNELEV LGRMETDSTY TVTSAKASLL AIEVAMFDSL LSQDTAFSRQ VIEQESRRLQ QLSS
|
| |