Gene Cyan7425_4181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan7425_4181 
Symbol 
ID7290132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7425 
KingdomBacteria 
Replicon accessionNC_011884 
Strand
Start bp4231952 
End bp4233112 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content53% 
IMG OID643587154 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_002484855 
Protein GI220909544 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR00586] mutator mutT protein
[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.215625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.631474 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCAGAG CGAAAGATAG CCAGTCTTAT TATCAGCTCC CTTGGGGTGG GCAGCGTTCC 
AGTTCAGCCT TGCCAGGGGT AGATTTTACG CCAGCCCAAA TTTTAGATTT GCAGCGATCG
CTCCTGCACT GGTACCGTCA GCATGGTCGT TCCCTACCCT GGCGAGAAAC CAGTGATCCT
TATGCAATCT GGGTTTCCGA AATCATGCTG CAGCAAACCC AGGTGCAAAC CGTTATTCCC
TACTACCAGC GCTGGTTAGC GGCTTTACCG ACGATCGCCA CCGTAGCTGC GGCTGAACAA
CAACAGGTTT TAAAACTCTG GCAGGGCCTG GGCTACTATT CCAGAGCTAG AAATTTGCAC
CAGGCCGCCC AGTTGATTCA GCAGGAGTTT GCCGGCCAGT TTCCCTCCCA GTTAGAAGCG
GTTTTGAAGT TACCGGGCAT TGGCCGCACC ACCGCTGGAG GCATTCTCAG TTCAGCTTTT
GCTCAACCAG TGGCAATTCT GGATGGTAAT GTGAAGAGGG TATTAGCCCG CTTGCTTGCC
CTACCTGTTC CACCCAGAAA AGCTAAAGGG TTCCTCTGGC AATGGTCCGA TCGCCTGCTC
GATCGCACTC AGCCACGGGA GTTCAATCAA GCTTTAATGG ATTTGGGGGC AACGGTGTGT
GTGCCGAAAA AGCCCGACTG TCCCCTTTGT CCCTGGTCTA ACCACTGTCA AGCCCTGCAG
TTAAATTTGC AATCCGAGTT GCCCGTGACT GAAACCGCTG CTCCCCTGCC CCACAAATCC
ATTGGTGTAG CTGTGATCTG GAATGATCGC GGTGAAATTT TGATCGATCG GCGTCCGCAG
AAGGGATTGC TAGGGGGGCT ATGGGAATTT CCGGGAGGGA AAATTGAACC GGGAGAAACC
GTGATGGCCT GTATCCAGCG AGAAATCCGG GAAGAATTGG CGATTGAAAT TGAGGTGGGA
GAACCACTGA TCACGATCGA CCATGCCTAT ACTCACTTTA AGGTCACCCT GAACGTGCAC
CACTGCCGGT ACGTAAGCGG GGAACCCCAA CCTCTGGGCT GCGATGAAGT GCGCTGGGTT
ACCCTGGAAG AGATCGATCA GTATCCCTTT CCCAAGGCCA ACGAGCAGAT CATTGCTGCT
TTGCGAAAGA ATCAGAAATA G
 
Protein sequence
MARAKDSQSY YQLPWGGQRS SSALPGVDFT PAQILDLQRS LLHWYRQHGR SLPWRETSDP 
YAIWVSEIML QQTQVQTVIP YYQRWLAALP TIATVAAAEQ QQVLKLWQGL GYYSRARNLH
QAAQLIQQEF AGQFPSQLEA VLKLPGIGRT TAGGILSSAF AQPVAILDGN VKRVLARLLA
LPVPPRKAKG FLWQWSDRLL DRTQPREFNQ ALMDLGATVC VPKKPDCPLC PWSNHCQALQ
LNLQSELPVT ETAAPLPHKS IGVAVIWNDR GEILIDRRPQ KGLLGGLWEF PGGKIEPGET
VMACIQREIR EELAIEIEVG EPLITIDHAY THFKVTLNVH HCRYVSGEPQ PLGCDEVRWV
TLEEIDQYPF PKANEQIIAA LRKNQK