Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_0701 |
Symbol | |
ID | 5454570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 762662 |
End bp | 764509 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640876270 |
Product | A/G-specific adenine glycosylase |
Protein accession | YP_001411981 |
Protein GI | 154251157 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0292905 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGCGG CGGGCAGCGC CCTCAACCAA ACCTTTCAGT TGCCTTGGAC CGAGAATCGG GACTCGCGCG GCTCAGCGCC GCCGCTGCGC GCCGATGACC CGCTCGCCAA GCTTTTCCAG CGACTTTCTG AGCGCCGGTT CTTCAATTTT TTCAAGGCCT TGGTCAAGCG CCGCGCGCTC CTCCGGCGCC AGCGGCCTGA TCTTGCGATA GCGCGGCCGC GCCTTGACCG GCAGCGGGCC TTGCACGAGT TTCAGCCGCG CGACGGCGCT GTATCCATAA TAGGAATTGA TGCGCTCGAC GATCTGCGGT TCGAGGTGAC GGATTTCGAC CGCGACCGGA CCGTCGACAC GGATGGTGAG AACCGCGCCG CTGCGCCGGC CCTTGCCCGC GTCGTCACCG GAAGTCCTTG GAAACACAAG CTTTTCCGGC GCCGAAAATT CCGCGAGGCC GCCGCCGACG ATCTCCGGCC AATGCGCCAG CACATGCGCT TCCTGGAAGC CGCGCTTCAC GAAGGCGGCG CGCGCCACCT GCATCACCCG CGCGCCGACG GGCGGCAGGC TGAAGCGGCG CGACGCAATC CTGTTGCCGG TGATGCCGTT CATCTCGTTT CCTCTGAACC GGCGCGCATT CTGACTCGAA ATCGTCATCC ATGCGAATTG ACGGATGAGG GCATTCCAAG CGAAGACTGG TGCATGAGCA TGAAGGCGAA AAAAGCCAAG GAAATCAGCG GGAAAGCGGC CGCCGCGCCG CTGCTCGCCT GGTATGACAA ACATGCCCGC GTGCTGCCCT GGCGTGCGCG AAAAGGCGAA CGGGCGGACC CTTATGCGGT CTGGCTTTCC GAAATCATGC TGCAACAGAC GACGGTGGCG ACGGTCGGGC CTTATTTTAC GGGCTTTTTG AAGCGCTGGC CGAATGTGGA GGCACTGGCC GCCGCGCCGC AGGAAGAGGT GATGAAGGCA TGGGCGGGGC TCGGTTATTA CAGCCGGGCG CGCAATCTTC ATGCCTGTGC GAAGGAAGTA TCGTCCGAGT ACGGAGGGAA ATTTCCCGAT ACGGTGGAAG GACTTGAGAG CCTGCCGGGC ATCGGCCCCT ATACGGCGGC GGCAATCGCG GCGATCGCCT TTGGAAGAGC TGCGACGGTG GTGGATGGAA ATGTCGAGCG CGTGGTGGCG CGGCTTTTCG AGATCGAAAC GCCGCTGCCG GCGGCGAAGC CGGATATCCG GGAGAAGGCC CGCACCCTGA CGCCGGAGCA GCGGGCAGGC GACTTCGCGC AGGCCATGAT GGACCTCGGC GCGACCATCT GCACGCCGCG CTCTCCCGCC TGTAATCGCT GTCCGATCAA TGACTTATGC GATGCACGGG CGGCGGGCAC GCAAAATCTC CTGCCTGCCC GGGCGCCGAA AAAGGCCCGC CCGACGCGCC GCGGCGCCTG CTTCTGGCTG GTGCGGGAGG GGCATGTCTG GCTCCGCCGG CGGCCCGACA AGGGGCTCCT CGGCGGCATG CTGGAAGTGC CCGGAACGCC CTGGGATGAG AGCGACCGTC ATCGAACTGT CATCGAACTG TCACACGACG AAAATGGAGG CCGCCGGGGT GTCGCCGGAG AGGTGCTCGA CCACGCGCCG ATGGAAGCGG AATGGCGACT TGTCCCCGGT TTGGTTGAAC ACACGTTCAC GCATTTTCAT CTCGAGCTGG AAGTCTTCAC CGCGACGACG CGAAAAAAAA TTGTCCCCGG GCGCGAAGGC ATGTGGGTGC CGCTTGAGGA GGTCGCCGGC GAGGCGCTGC CGACCGTGAT GCGGAAGGTC GCGGCACATG CGATGCCGGA TGCGGGACCG CTTTTTGTGA AGCGGTGA
|
Protein sequence | MAAAGSALNQ TFQLPWTENR DSRGSAPPLR ADDPLAKLFQ RLSERRFFNF FKALVKRRAL LRRQRPDLAI ARPRLDRQRA LHEFQPRDGA VSIIGIDALD DLRFEVTDFD RDRTVDTDGE NRAAAPALAR VVTGSPWKHK LFRRRKFREA AADDLRPMRQ HMRFLEAALH EGGARHLHHP RADGRQAEAA RRNPVAGDAV HLVSSEPARI LTRNRHPCEL TDEGIPSEDW CMSMKAKKAK EISGKAAAAP LLAWYDKHAR VLPWRARKGE RADPYAVWLS EIMLQQTTVA TVGPYFTGFL KRWPNVEALA AAPQEEVMKA WAGLGYYSRA RNLHACAKEV SSEYGGKFPD TVEGLESLPG IGPYTAAAIA AIAFGRAATV VDGNVERVVA RLFEIETPLP AAKPDIREKA RTLTPEQRAG DFAQAMMDLG ATICTPRSPA CNRCPINDLC DARAAGTQNL LPARAPKKAR PTRRGACFWL VREGHVWLRR RPDKGLLGGM LEVPGTPWDE SDRHRTVIEL SHDENGGRRG VAGEVLDHAP MEAEWRLVPG LVEHTFTHFH LELEVFTATT RKKIVPGREG MWVPLEEVAG EALPTVMRKV AAHAMPDAGP LFVKR
|
| |