Gene PCC8801_3059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3059 
Symbol 
ID7105442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3200689 
End bp3201747 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content41% 
IMG OID643476083 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_002373196 
Protein GI218247825 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR00586] mutator mutT protein
[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTAC GGCGATCGCT TCTATTATGG TATCAGCATC AGGGACGAGA GTTACCCTGG 
AGAAATATCG ATGATCCCTA TGCTATCTGG GTTTCGGAGA TTATGCTGCA ACAAACCCAG
GTTAAGACTG TTATTCCCTA TTATCAGCGA TGGTTAGCAC AATTTCCTAA TATTCAAACA
TTAGCAACCT CTGACTTGCA AACTGTTCTC AAGGCTTGGG AAGGCTTAGG CTATTATACC
CGTGCGCGAA ATCTTTATAA AACGGCTCAG ATTATTTTAA AGGATTACAG GGGAATTTTT
CCCAGAGAGT TAGAAAAAGT CGTAAAATTG CCAGGAATTG GACGAACGAC GGCTGGAGGC
ATCCTCAGTT CAGCGTTTAA TCAACCAATC TCTATTTTAG ATGGTAACGT CAAGCGAGTG
TTAGCAAGAT TAGTTGCCCT TAGCGATCCT CCTGCAAAAG CGATACAATT TTTATGGGAC
GTATCGGATA GTTTACTCGA TCCCGACAAT CCTAGGGATT TTAACCAAGG GTTGATGGAT
TTAGGGGCAA CCATTTGCAC CCGAAGTCAG CCAAAATGTT TATTGTGTCC CTGGTTATCC
CACTGTCAAG CTTATCAACA AGGAAAACAA AATCAACTCC CCATGCGTGA AGATTCCTCT
CCCTTACCCC ACAAAAAAAT TGGTGTTGCA GTGATTTATA ATAATGCAGG AGAAATCTTG
ATTGATCGCC GTCCCGATAA AGGATTATTA GGAGGGTTAT GGGAATTTCC TGGGGGAAAG
ATTGAAGAAA ATGAAACGGT AGAAGAGTGT ATTAAACGAG AAATTTTAGA AGAAATTGCC
ATTGATATCG AAGTGGGAGA ACATTTAATT ACCCTCGATT ATGCCTATAC TCATTTTAAA
GTCACTTTAA TTGTTCATCT GTGTCGTCAT GTTGCTGGAG AACCCCAAGC GATCGAATGT
CAAGAAATTC GCTGGACAAC CTTAGATGAA ATTGATAGTT TTCCGTTTCC TAAAGCCAAT
AGTAAGATTA TCGAAGCTTT AAGAAACAAT CAACCATAA
 
Protein sequence
MALRRSLLLW YQHQGRELPW RNIDDPYAIW VSEIMLQQTQ VKTVIPYYQR WLAQFPNIQT 
LATSDLQTVL KAWEGLGYYT RARNLYKTAQ IILKDYRGIF PRELEKVVKL PGIGRTTAGG
ILSSAFNQPI SILDGNVKRV LARLVALSDP PAKAIQFLWD VSDSLLDPDN PRDFNQGLMD
LGATICTRSQ PKCLLCPWLS HCQAYQQGKQ NQLPMREDSS PLPHKKIGVA VIYNNAGEIL
IDRRPDKGLL GGLWEFPGGK IEENETVEEC IKREILEEIA IDIEVGEHLI TLDYAYTHFK
VTLIVHLCRH VAGEPQAIEC QEIRWTTLDE IDSFPFPKAN SKIIEALRNN QP