Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0652 |
Symbol | |
ID | 6979368 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 673402 |
End bp | 674499 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643395364 |
Product | A/G-specific adenine glycosylase |
Protein accession | YP_002280175 |
Protein GI | 209548258 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.88939 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.228228 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATCA CCACACCCGA CACGCCCACC GCAAAGCCCC TGCTCGACTG GTACGACCGC CATCACCGCG ATCTGCCCTG GCGCATTTCG CCCGGCATGG CGGCCGGCGG CGTCAAACCC GATCCCTATC GCGTCTGGCT TTCCGAGGTG ATGCTGCAGC AGACGACGGT GCAGGCGGTC AAACCCTATT TCGCCAAGTT TCTGCAGCGC TGGCCTGAGG TGACCGATCT TGCCGCGGCC GAAAACGATG CCGTGATGGC CGCCTGGGCC GGGCTCGGCT ATTACGCGCG GGCCCGCAAC CTGAAGAAAT GCGCCGAAGC GGTGGCGAAA GAGCATGGCG GGGTCTTTCC CGATACCGAG GAAGGGCTGA AGTCGCTCCC CGGCATCGGC GATTATACGG CCGCCGCCGT CGCCGCCATC GCCTTCAACC GGCAGGCGGC TGTGATGGAC GGCAATGTCG AGCGGGTGAT CTCAAGGCTT TATGCGATCG AAACGCCGCT TCCCGCGGGA AAGCCGCTGA TGAAGGAAAA GGTGGCGCGG CTGACACCCG CGACGCGGCC CGGCGATTTC GCCCAGGCGA TGATGGATCT CGGGGCGACG ATCTGCACGC CGAAGCGGCC GGCCTGTTCG CTCTGTCCCT TCCGCGGCGC CTGCGCGGCG CTGAAACTTT CCGATCCCGA GCTATTTCCC GTCAAGGCGG CGAAGAAGGA GAAGCCGGTG CGGCAGGGTG CGGCTTTCGT CGCGGTCACC GCAGACGGCG AGATCCTGCT CAGGCGGCGC GCCGAAAGCG GCCTGCTCGG CGGCATGACC GAGGTGCCGA CAACGGCCTG GACGGCGCGG CTCGACGGCG AAACCTCGGC TGCGGCCGCG CCCTTCGAGG CGGCATGGCA GGCCTGCGGC ACCGTCATCC ATGTCTTCAC CCATTTTGAA CTCCGGCTGT TGATCTGGCG CGCGGCGATC GCCGGCAAGG TGGATGACCG TCCGAATGAC GGATGGTGGG AGCCGGTTAC AAATCTTGAA GCGCAGGCCT TGCCGACCAT CATGAAAAAA GCGATCGCAG CGGCTATTCC TCTCGCGTTC AAAACATCCA AGGGATGA
|
Protein sequence | MTITTPDTPT AKPLLDWYDR HHRDLPWRIS PGMAAGGVKP DPYRVWLSEV MLQQTTVQAV KPYFAKFLQR WPEVTDLAAA ENDAVMAAWA GLGYYARARN LKKCAEAVAK EHGGVFPDTE EGLKSLPGIG DYTAAAVAAI AFNRQAAVMD GNVERVISRL YAIETPLPAG KPLMKEKVAR LTPATRPGDF AQAMMDLGAT ICTPKRPACS LCPFRGACAA LKLSDPELFP VKAAKKEKPV RQGAAFVAVT ADGEILLRRR AESGLLGGMT EVPTTAWTAR LDGETSAAAA PFEAAWQACG TVIHVFTHFE LRLLIWRAAI AGKVDDRPND GWWEPVTNLE AQALPTIMKK AIAAAIPLAF KTSKG
|
| |