Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2113 |
Symbol | |
ID | 5734001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2653451 |
End bp | 2654392 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279254 |
Product | CRP/FNR family transcriptional regulator |
Protein accession | YP_001544881 |
Protein GI | 159898634 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0664] cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000556838 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAAAG TGCTTCTCAT CAGCAGCGAT CTACCGGATC AAGCTCTGCT TGGCCAGTTG CTGGATGCTG GATATCGATT GAGTTTGTTG CGGCTTGAGC AACTAGCTAA GCAAACAACC ATCGCCGATC AAAGCCTCTT GGTCGGCTGT TTTACCAATG AGGATGAATT AGCGCAACTC ACCGAAAACT TGGGCAGTCA AGAGCCACTA TGGTGGGGCT GGAATCAGAG TAACTATCCG CATTTAACCT TGAAGGCCTA TGAAGCTGGC GCTCGTCATG TGATTACCAA CGACGCAAAC GCTAGCCAAA TCATCCAAAA CTTAAATGCG CTGCACGTCA GCCAACTTCA TCAGCAAACT ACCCGCGGTC GAGAGCAGCA GTATCCACGT GGGGCAATGG TGCATTTACA AGCCGATCAA GCCTTATTGA TTGAAACTGG AATTTTGAGC TTGCAAGTCA ATCATCCTGA TGGCTCGAAC GTGCTTCTAG GTTTGTTCGG GCCAAACCAA TTAGTCTGTG GCCATCCACA CGATGGCTGT GCGATCTATC TGCAAGCGCA TACCCCGATT AGCGCTCAGC TCTTGCCGTG GCAACGGGTC TTGAGTGAGC CTTCGTTGAT CGAACGATTA CGCATGCGGC TGCAACTAAT GGAAGCGTGG GCTTCGTGCC AAGCCCATCC TTACCTTGAT CAACGAGTTT TAGGCATTTT GAATTTGTTG GCCGAGCAAT TTGGCAAACA GCATAACCAG GGCTTATTGA TTGACGTGCG GATTACCCAT GAACAACTAG CCTCAGCGGT TGGCTCAACC CGCGCAACGA TCTCACGGAT TATCCGCGAT TTGCGCACAC GAAGTATGCT TGATAGCCAC TTCAGCGGCA GTAATGAGCG TTTTTGGCTC CCAATCGTGC CCCACTATCA TCACACCCAT CCCTTTGCTT AA
|
Protein sequence | MLKVLLISSD LPDQALLGQL LDAGYRLSLL RLEQLAKQTT IADQSLLVGC FTNEDELAQL TENLGSQEPL WWGWNQSNYP HLTLKAYEAG ARHVITNDAN ASQIIQNLNA LHVSQLHQQT TRGREQQYPR GAMVHLQADQ ALLIETGILS LQVNHPDGSN VLLGLFGPNQ LVCGHPHDGC AIYLQAHTPI SAQLLPWQRV LSEPSLIERL RMRLQLMEAW ASCQAHPYLD QRVLGILNLL AEQFGKQHNQ GLLIDVRITH EQLASAVGST RATISRIIRD LRTRSMLDSH FSGSNERFWL PIVPHYHHTH PFA
|
| |