Gene PCC8801_2345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2345 
Symbol 
ID7105237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2415218 
End bp2416126 
Gene Length909 bp 
Protein Length302 aa 
Translation table11 
GC content47% 
IMG OID643475386 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002372515 
Protein GI218247144 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0350] Methylated DNA-protein cysteine methyltransferase
[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID[TIGR00589] O-6-methylguanine DNA methyltransferase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAG TTCAGATGAA TTTCTTGATC GAAACCAGCA CATCTGATGC CGAAACCTAT 
GAGCGGATTG CTCAAGCGAT CGCGTTTATG CGTCAGAATC ATTTGAATCA GCCGGATTTA
GCAACAATCG CTCAGCAGGT GCATCTTAGC GAATATCACT TTCAACGGCT TTTCACCCGA
TGGGCAGGCA TCAGCCCGAA GCACTTTGTG CAATATCTCA CGGTGGAATA TGCAAAATCA
AAAATCGCTG AGACTGCTAA TCTGCTTGAT TTGACTGCGG AAGTTGGGCT ATCGAGTCCA
GGACGGTTAC ATGACCTGTT TGTGAAGTTG GAAGCCATGT CGCCTGGTGA GTTTAAGGCA
GGGGGTATCG GGTTACAGAT TGGGTATGGC ATTCATCCAA CCCCATTCGG AGACTGCTTA
ATTGCGACGA CTCCCCGTGG TATCTGTAAT CTTCATTTTC TAGATATCAC TAGCAAAGAT
GCTGTTGAAC AGGCTTTCCG CTTAGAGTGG GCAAATGCCG ATATCAGACA AGATCAACAA
GCTACTCAAG AAATCTGCGA TCGCATTTTC GAGCCAAGTA CAACTAAAAA CAAACCTTTG
GTTCTTCATG TAAAAGGGAC GAATTTTCAG ATTCAAGTTT GGCGGGCACT CTTGAGCGTT
CCTTTTGGCG GAATCACAAC TTATCAAGGA TTAGCAGCAG GGATGGGTCG CCCAACGGCA
GCAAGAGCCG TCGGTAACGC ATTGGGAAGT AATCCAGTGG CGTACTTGAT TCCCTGCCAT
CGAGTAATCC GAGAATCCGG TGAGTTCGGG GGTTTTCGTT GGGGACTCGA ACGTAAGACT
GTTCTGCTAG GTTGGGAAGC AAGTCGAAAT CAGAATCAGA ATCAGAACAG GGAGCAGGAT
TTGACATGA
 
Protein sequence
MKPVQMNFLI ETSTSDAETY ERIAQAIAFM RQNHLNQPDL ATIAQQVHLS EYHFQRLFTR 
WAGISPKHFV QYLTVEYAKS KIAETANLLD LTAEVGLSSP GRLHDLFVKL EAMSPGEFKA
GGIGLQIGYG IHPTPFGDCL IATTPRGICN LHFLDITSKD AVEQAFRLEW ANADIRQDQQ
ATQEICDRIF EPSTTKNKPL VLHVKGTNFQ IQVWRALLSV PFGGITTYQG LAAGMGRPTA
ARAVGNALGS NPVAYLIPCH RVIRESGEFG GFRWGLERKT VLLGWEASRN QNQNQNREQD
LT