Gene P9211_17541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_17541 
SymbolmutY 
ID5730628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1578725 
End bp1579924 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content35% 
IMG OID641286139 
ProductA/G-specific DNA glycosylase 
Protein accessionYP_001551639 
Protein GI159904295 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR00586] mutator mutT protein
[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTTG GCTTTAGTCA AAATGATTTA GCAAATGGGG TTTTGGAAAA TCCTCAGAAG 
ATAGACGCTC TTAGGTCAAC TCTTCTTCAA TGGTTTAAAT CAAATGGACG TCATTATATC
CCTTGGAAAT TAACTAAGGA TGGAACTTTA CCAAACGAGA ATCAATATCT TGCTGTATAT
CCAATTTTGG TTGCTGAAGT AATGCTGCAA CAGACTCAGT TGAAAGTTGT TTTACCATAT
TGGGAAAAGT GGATGTTAGC TCTGCCAACT CTTGTTGATT TGGCAAAGGC TGAAGAGGAT
AAAGTCCTTT TACTTTGGCA AGGACTTGGT TATTACTCCA GAGCAAGAAG ATTACATGTT
ACTTCTAGGA TTCTTTTGAA TTTAATTGGC ATACCTAACT CTTTAAATCC AGCTAATTGG
CCTAAGGATT TAGAAAGCTG GATGAATTTA CCTGGCATAG GGCGTAATAC GGCTGGCAGT
ATCATTTCTT CAGCATTTAA CCTCCCTAGC CCTTTACTAG ATGGGAATGT TAAACGGGTT
TTAACTAGAT TAATTGGTAG TACTAAAACT CCAAACAAAG ATTTGGCAAG GCTTTGGAAA
TTGAGTGATT TGTTACTAGA TAAAAATCTT CCTAGAACAT TTAATCAAGC TTTAATGGAT
CTAGGTGCGA CAATTTGTAC CAAGTACAAT CCTATTTGTA CCAATTGCCC ATGGCAAAAT
TATTGTTCTG CTTATAATTC AGGAAATCCT GAAAATCTAC CAGTAAAAGG TCAAAAATTG
ATTCTTTCAA AAGCTGTTAT AGGCGTTGGC TTGATTCTTA ATAAAAATCA GGATGTTTTG
ATTGATCAGA GACTTGATGA AGGAAGTATG GGAGGAATGT GGGAATTTCC CGGAGGTAAA
AAGGAAAAAG ATGAATCAAT AGAAATGACT ATTGCAAGAG AACTACGCGA AGAATTAGGC
GTTGAGGTAA AGGTAGGGAA AAAGCTTATT GAATTTGATC ATTCCTATAC CCATAAGAAA
TTACATTTTA TAGTTCATTT GTGTGAATTA ATTTCTGGGA AACCAAAACC TTTATCAAGC
CAGGAAGTCC GATGGGTAAA ATTAAGTGAT CTTCAAAATT ATCCGTTTCC TAAAGCTAAC
TCATATATGA TTTCTGCTCT TAAAGAATAT TTTCTTATAT CCAAGACAAA GATGAAGTAA
 
Protein sequence
MAVGFSQNDL ANGVLENPQK IDALRSTLLQ WFKSNGRHYI PWKLTKDGTL PNENQYLAVY 
PILVAEVMLQ QTQLKVVLPY WEKWMLALPT LVDLAKAEED KVLLLWQGLG YYSRARRLHV
TSRILLNLIG IPNSLNPANW PKDLESWMNL PGIGRNTAGS IISSAFNLPS PLLDGNVKRV
LTRLIGSTKT PNKDLARLWK LSDLLLDKNL PRTFNQALMD LGATICTKYN PICTNCPWQN
YCSAYNSGNP ENLPVKGQKL ILSKAVIGVG LILNKNQDVL IDQRLDEGSM GGMWEFPGGK
KEKDESIEMT IARELREELG VEVKVGKKLI EFDHSYTHKK LHFIVHLCEL ISGKPKPLSS
QEVRWVKLSD LQNYPFPKAN SYMISALKEY FLISKTKMK