Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_2345 |
Symbol | |
ID | 7105237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 2415218 |
End bp | 2416126 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643475386 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002372515 |
Protein GI | 218247144 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0350] Methylated DNA-protein cysteine methyltransferase [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | [TIGR00589] O-6-methylguanine DNA methyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCAG TTCAGATGAA TTTCTTGATC GAAACCAGCA CATCTGATGC CGAAACCTAT GAGCGGATTG CTCAAGCGAT CGCGTTTATG CGTCAGAATC ATTTGAATCA GCCGGATTTA GCAACAATCG CTCAGCAGGT GCATCTTAGC GAATATCACT TTCAACGGCT TTTCACCCGA TGGGCAGGCA TCAGCCCGAA GCACTTTGTG CAATATCTCA CGGTGGAATA TGCAAAATCA AAAATCGCTG AGACTGCTAA TCTGCTTGAT TTGACTGCGG AAGTTGGGCT ATCGAGTCCA GGACGGTTAC ATGACCTGTT TGTGAAGTTG GAAGCCATGT CGCCTGGTGA GTTTAAGGCA GGGGGTATCG GGTTACAGAT TGGGTATGGC ATTCATCCAA CCCCATTCGG AGACTGCTTA ATTGCGACGA CTCCCCGTGG TATCTGTAAT CTTCATTTTC TAGATATCAC TAGCAAAGAT GCTGTTGAAC AGGCTTTCCG CTTAGAGTGG GCAAATGCCG ATATCAGACA AGATCAACAA GCTACTCAAG AAATCTGCGA TCGCATTTTC GAGCCAAGTA CAACTAAAAA CAAACCTTTG GTTCTTCATG TAAAAGGGAC GAATTTTCAG ATTCAAGTTT GGCGGGCACT CTTGAGCGTT CCTTTTGGCG GAATCACAAC TTATCAAGGA TTAGCAGCAG GGATGGGTCG CCCAACGGCA GCAAGAGCCG TCGGTAACGC ATTGGGAAGT AATCCAGTGG CGTACTTGAT TCCCTGCCAT CGAGTAATCC GAGAATCCGG TGAGTTCGGG GGTTTTCGTT GGGGACTCGA ACGTAAGACT GTTCTGCTAG GTTGGGAAGC AAGTCGAAAT CAGAATCAGA ATCAGAACAG GGAGCAGGAT TTGACATGA
|
Protein sequence | MKPVQMNFLI ETSTSDAETY ERIAQAIAFM RQNHLNQPDL ATIAQQVHLS EYHFQRLFTR WAGISPKHFV QYLTVEYAKS KIAETANLLD LTAEVGLSSP GRLHDLFVKL EAMSPGEFKA GGIGLQIGYG IHPTPFGDCL IATTPRGICN LHFLDITSKD AVEQAFRLEW ANADIRQDQQ ATQEICDRIF EPSTTKNKPL VLHVKGTNFQ IQVWRALLSV PFGGITTYQG LAAGMGRPTA ARAVGNALGS NPVAYLIPCH RVIRESGEFG GFRWGLERKT VLLGWEASRN QNQNQNREQD LT
|
| |