Gene Pnuc_1916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnuc_1916 
Symbol 
ID5052259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 
KingdomBacteria 
Replicon accessionNC_009379 
Strand
Start bp1994722 
End bp1995867 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content45% 
IMG OID640472090 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_001156692 
Protein GI145590095 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAAA CGGTAATCGA TTCTTTTTCA AAAAAACTCA TCGCGTGGCA TGCCCAGAGC 
GGGCGCTCTG GATTGCCTTG GCAGGGCAAT CGCGATCCTT ATGCGGTATG GGTTTCCGAA
ATCATGTTGC AGCAAACTCA GGTAGCTACT GTTCTCGAGC GTTATCCCCG CTTTATGAAG
CGCTTCCCGA CGGTTAAAAA ACTGGCTGCT GCAGACGTTG ATGACGTTCT AGCGGAGTGG
GCTGGTTTGG GTTATTACTC TCGCGCAAGA AATTTACATG CGTGCGCACA ACAGATTGTT
CGAGAATTTG CAGGAAAGTT TCCACAAGAT CCTGCGCTAC TAGAGCAGTT AAAGGGAATT
GGGCGATCCA CTGCAGGTGC TATTGCTGCT TTTGCATTTC ATGAGCGAGC ACCAATTTTA
GATGCCAATG TAAAACGAAT CTTGGCACGT TTATTTGCAA TTGAAGGCGC CATTCAAGAT
AAAGCAGTCA ATGACTCTTT GTGGAAATTA GCTACAGAAT TATTGCCTCT AAAGCCGCAA
GATATGCCTA CCTATACGCA GGCATTAATG GACTTTGGAG CAACGTGGTG CACTTCTCGT
AAGCCAGTCT GTTTAAGTGG TGAGAAAAAA TGTCCTTTTG CTAAGGACTG TCAGGCAAAC
CTCAGCGATC AAGTGCTTGC TTTGCCGAAA AAGGTCATCA AGAGTAAATC CCCGGAGTTT
GATTGCCATA TGTTGTTGTT GCGCTCAGGC AATTCGGTAC TACTTCAAAA GCGTCCAGAT
AAAGCGATAT GGGGCGGTCT ATGGTCTTTA CCGGAATCGC CTTGGGCTCC GAGGGACCTC
AGTTTTTTAA AAGAGGAGCT AAGCTCAAGC AATTTATTAA GTCTTACGCT TCCTGAAGAA
AAATCAGTCC TTTTTTTGAG AAATTGCACA CCCCCAAACA GGGGGTTTTA CATCAAACAT
GTTTTTACGC ATCGCTGTCT ATGGATGCAG ATATGGGAGG TGAATGCAAT GAAGACCATC
CCATTTACGA ATCCAAATTT GAAATGGGTG CCCCTAAAGC AATTGGGCCG CCATGGATTA
CCGCAGCCAA TTAAAGTTTT GTTACAGGGA TTGAGTCTAG CTCGCGATGG CGATCTAAAA
AATTAA
 
Protein sequence
MSKTVIDSFS KKLIAWHAQS GRSGLPWQGN RDPYAVWVSE IMLQQTQVAT VLERYPRFMK 
RFPTVKKLAA ADVDDVLAEW AGLGYYSRAR NLHACAQQIV REFAGKFPQD PALLEQLKGI
GRSTAGAIAA FAFHERAPIL DANVKRILAR LFAIEGAIQD KAVNDSLWKL ATELLPLKPQ
DMPTYTQALM DFGATWCTSR KPVCLSGEKK CPFAKDCQAN LSDQVLALPK KVIKSKSPEF
DCHMLLLRSG NSVLLQKRPD KAIWGGLWSL PESPWAPRDL SFLKEELSSS NLLSLTLPEE
KSVLFLRNCT PPNRGFYIKH VFTHRCLWMQ IWEVNAMKTI PFTNPNLKWV PLKQLGRHGL
PQPIKVLLQG LSLARDGDLK N