Gene Plav_0701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0701 
Symbol 
ID5454570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp762662 
End bp764509 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content65% 
IMG OID640876270 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_001411981 
Protein GI154251157 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0292905 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGCGG CGGGCAGCGC CCTCAACCAA ACCTTTCAGT TGCCTTGGAC CGAGAATCGG 
GACTCGCGCG GCTCAGCGCC GCCGCTGCGC GCCGATGACC CGCTCGCCAA GCTTTTCCAG
CGACTTTCTG AGCGCCGGTT CTTCAATTTT TTCAAGGCCT TGGTCAAGCG CCGCGCGCTC
CTCCGGCGCC AGCGGCCTGA TCTTGCGATA GCGCGGCCGC GCCTTGACCG GCAGCGGGCC
TTGCACGAGT TTCAGCCGCG CGACGGCGCT GTATCCATAA TAGGAATTGA TGCGCTCGAC
GATCTGCGGT TCGAGGTGAC GGATTTCGAC CGCGACCGGA CCGTCGACAC GGATGGTGAG
AACCGCGCCG CTGCGCCGGC CCTTGCCCGC GTCGTCACCG GAAGTCCTTG GAAACACAAG
CTTTTCCGGC GCCGAAAATT CCGCGAGGCC GCCGCCGACG ATCTCCGGCC AATGCGCCAG
CACATGCGCT TCCTGGAAGC CGCGCTTCAC GAAGGCGGCG CGCGCCACCT GCATCACCCG
CGCGCCGACG GGCGGCAGGC TGAAGCGGCG CGACGCAATC CTGTTGCCGG TGATGCCGTT
CATCTCGTTT CCTCTGAACC GGCGCGCATT CTGACTCGAA ATCGTCATCC ATGCGAATTG
ACGGATGAGG GCATTCCAAG CGAAGACTGG TGCATGAGCA TGAAGGCGAA AAAAGCCAAG
GAAATCAGCG GGAAAGCGGC CGCCGCGCCG CTGCTCGCCT GGTATGACAA ACATGCCCGC
GTGCTGCCCT GGCGTGCGCG AAAAGGCGAA CGGGCGGACC CTTATGCGGT CTGGCTTTCC
GAAATCATGC TGCAACAGAC GACGGTGGCG ACGGTCGGGC CTTATTTTAC GGGCTTTTTG
AAGCGCTGGC CGAATGTGGA GGCACTGGCC GCCGCGCCGC AGGAAGAGGT GATGAAGGCA
TGGGCGGGGC TCGGTTATTA CAGCCGGGCG CGCAATCTTC ATGCCTGTGC GAAGGAAGTA
TCGTCCGAGT ACGGAGGGAA ATTTCCCGAT ACGGTGGAAG GACTTGAGAG CCTGCCGGGC
ATCGGCCCCT ATACGGCGGC GGCAATCGCG GCGATCGCCT TTGGAAGAGC TGCGACGGTG
GTGGATGGAA ATGTCGAGCG CGTGGTGGCG CGGCTTTTCG AGATCGAAAC GCCGCTGCCG
GCGGCGAAGC CGGATATCCG GGAGAAGGCC CGCACCCTGA CGCCGGAGCA GCGGGCAGGC
GACTTCGCGC AGGCCATGAT GGACCTCGGC GCGACCATCT GCACGCCGCG CTCTCCCGCC
TGTAATCGCT GTCCGATCAA TGACTTATGC GATGCACGGG CGGCGGGCAC GCAAAATCTC
CTGCCTGCCC GGGCGCCGAA AAAGGCCCGC CCGACGCGCC GCGGCGCCTG CTTCTGGCTG
GTGCGGGAGG GGCATGTCTG GCTCCGCCGG CGGCCCGACA AGGGGCTCCT CGGCGGCATG
CTGGAAGTGC CCGGAACGCC CTGGGATGAG AGCGACCGTC ATCGAACTGT CATCGAACTG
TCACACGACG AAAATGGAGG CCGCCGGGGT GTCGCCGGAG AGGTGCTCGA CCACGCGCCG
ATGGAAGCGG AATGGCGACT TGTCCCCGGT TTGGTTGAAC ACACGTTCAC GCATTTTCAT
CTCGAGCTGG AAGTCTTCAC CGCGACGACG CGAAAAAAAA TTGTCCCCGG GCGCGAAGGC
ATGTGGGTGC CGCTTGAGGA GGTCGCCGGC GAGGCGCTGC CGACCGTGAT GCGGAAGGTC
GCGGCACATG CGATGCCGGA TGCGGGACCG CTTTTTGTGA AGCGGTGA
 
Protein sequence
MAAAGSALNQ TFQLPWTENR DSRGSAPPLR ADDPLAKLFQ RLSERRFFNF FKALVKRRAL 
LRRQRPDLAI ARPRLDRQRA LHEFQPRDGA VSIIGIDALD DLRFEVTDFD RDRTVDTDGE
NRAAAPALAR VVTGSPWKHK LFRRRKFREA AADDLRPMRQ HMRFLEAALH EGGARHLHHP
RADGRQAEAA RRNPVAGDAV HLVSSEPARI LTRNRHPCEL TDEGIPSEDW CMSMKAKKAK
EISGKAAAAP LLAWYDKHAR VLPWRARKGE RADPYAVWLS EIMLQQTTVA TVGPYFTGFL
KRWPNVEALA AAPQEEVMKA WAGLGYYSRA RNLHACAKEV SSEYGGKFPD TVEGLESLPG
IGPYTAAAIA AIAFGRAATV VDGNVERVVA RLFEIETPLP AAKPDIREKA RTLTPEQRAG
DFAQAMMDLG ATICTPRSPA CNRCPINDLC DARAAGTQNL LPARAPKKAR PTRRGACFWL
VREGHVWLRR RPDKGLLGGM LEVPGTPWDE SDRHRTVIEL SHDENGGRRG VAGEVLDHAP
MEAEWRLVPG LVEHTFTHFH LELEVFTATT RKKIVPGREG MWVPLEEVAG EALPTVMRKV
AAHAMPDAGP LFVKR