Gene PsycPRwf_0168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPsycPRwf_0168 
Symbol 
ID5205777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePsychrobacter sp. PRwf-1 
KingdomBacteria 
Replicon accessionNC_009524 
Strand
Start bp202029 
End bp203303 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content49% 
IMG OID640598380 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_001279077 
Protein GI148651984 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000132987 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGC CAATCGCTAA CGTAAACACC AATATTAATA CCAGCACCAG TACTAATACC 
AACACCCATC TCAAGCCACA CACCTTTAAT CACAGTGACG CTGAGCTCAC CGACTTTGCT
CCGCGCATTC TAAATTGGTT TGACATAAGC GGTCGTCATG ACCTGCCATG GCAACAGCAT
AAGACAGATA CCCCAAACCC CTATATCGTT TGGTTGTCTG AGGTGATGCT ACAACAAACC
CAAGTCACCA CAGTTATTCC CTATTTTCAG CGTTTTATAA CGTCGTTTCC CACCGTACAA
GATTTGGCCA ATGCTCAGTG GGATACAGTG GCTGAACACT GGGCAGGGCT TGGCTACTAT
GCCCGTGCTC GCAACTTACA CAAAGGCGCC AAGCAGCTGG TCGAAATTAT TAAGACAACA
GGCCGCTTTC CACAAACCGT TGAAGATTGG GAGGCGATTT CAGGTGTGGG TCAGTCTACG
GCTGGGGCGA TTGTGGCCAT GGGCCTACAT GGCTATGGGG TGATCTGTGA CGGTAACGTC
AAACGGGTGA TTACCCGCTG GGCCGGTATT GATGGCGATA TCACTAAGTC TGCGACCAAT
AAAGCGTTGT GGGCTTTAGC TGAGCGCCTG ACCCCTACTG AGGATAGCGG ACATTTTGCC
CAAGCGATGA TGGATATGGG CGCCACCTTA TGTACCCGCC GTCACCCAAG CTGCGAGGTC
TGCCCTATAA ATAGTGACTG CATTGCTTAT GCAGAGGGTA AGCAAGACTT TTATCCTGTT
AAAGCCAAAA AGAAAGCCAA ACCGAGCAAG TTTAGCAAGG TGATATTGAT TCAAAATGTG
CAGGGTGAAT TGTTATGGCT ACAGCGTCCT GATAGTGGCA TTTGGGGCGG TCTGTGGGTA
TTGCCGATGC AATTTGAAAA AAAGACACAG GGTAAAACAG TGATTAGCAC CAGCTTGCAA
GAGGCAGCAT ATGAATCAGA AAATACGTTG GCCGAGCAGA TTATCGATAA ATGGATAGCC
GATAATAACG GACAGCTACA GCTGCAATCC ATCAGTGCAG AGCTATTTGA TGACGCACCT
ATCAAACATA CGCTGACCCA CTTTCATTGG TATTTGCAGC CGCAAGCACT GCCTTTGACG
CCCGCTCAAA GCCAAGAGCT GAGTGCAACA CTGGCGGACG CTGGCATTCA CTTTGTTTGG
CAGACTGCCA CACATGCCAA GGCCCATCTA GGACTGCCTA AGGCGATGCT CAAAATATTA
CAAAGCTTAG CGTAG
 
Protein sequence
MSQPIANVNT NINTSTSTNT NTHLKPHTFN HSDAELTDFA PRILNWFDIS GRHDLPWQQH 
KTDTPNPYIV WLSEVMLQQT QVTTVIPYFQ RFITSFPTVQ DLANAQWDTV AEHWAGLGYY
ARARNLHKGA KQLVEIIKTT GRFPQTVEDW EAISGVGQST AGAIVAMGLH GYGVICDGNV
KRVITRWAGI DGDITKSATN KALWALAERL TPTEDSGHFA QAMMDMGATL CTRRHPSCEV
CPINSDCIAY AEGKQDFYPV KAKKKAKPSK FSKVILIQNV QGELLWLQRP DSGIWGGLWV
LPMQFEKKTQ GKTVISTSLQ EAAYESENTL AEQIIDKWIA DNNGQLQLQS ISAELFDDAP
IKHTLTHFHW YLQPQALPLT PAQSQELSAT LADAGIHFVW QTATHAKAHL GLPKAMLKIL
QSLA