Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4368 |
Symbol | |
ID | 6983142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 4535561 |
End bp | 4536466 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643399096 |
Product | formamidopyrimidine-DNA glycosylase |
Protein accession | YP_002283852 |
Protein GI | 209551935 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0306824 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGAAT TGCCAGAAGT CGAAACGGTA AAACGCGGCC TGGCGCCGGC GATGGAGGGT GCTCGTGTCG CCAAGCTTGA GCTGCGCCGC GGCGATCTGC GCTTTCCCTT TCCCGACGCT TTCGCCGACA GGGTTTCCGG TCGCACCATC GTCAGCCTTG GCCGTCGCGC CAAATATCTG CTGGTCGATC TCGACGACGG CAACACGCTG ATTTCCCATC TCGGCATGTC CGGCTCTTTT CGCATCGAGG AGGGTCGCAT CGAGGAGGGC GCTGGAGCGG CCACGCCTGG CGAATTCCAC CATGCCCGCT CGAAGGACGA GAAGCACGAC CACGTCGTCT TTCATCTGGA AAGTCCAGCC GGTCCGCGCC GTGTCGTTTA TAACGATCCG CGCCGTTTCG GCTTCATGGA TATGGTGGGG CGCGCCGACC TTGCCGCCCA TCCCTTCTTC CGTGATCTCG GCCCGGAGCC GACGGGAAAC GAGCTCGGCG CCGCCTATCT CGCCGAGCGC TTCCGCCACA AGGCGCAGCC GTTGAAGAGT GCGCTGCTCG ACCAGAAGAA CATTGCCGGT CTCGGCAATA TATATGTCTG CGAGGCGCTG TGGCGCGCCC ACCTTTCGCC GATCCGCGCC GCCGGCACGC TGGCAACCGC AGGCGGCCGG CCGAAAGAGC AGCTTAACCT GCTCGTGGCC TCGATCCGCG ATGTCATTGC CGATGCGATC ACCGCCGGCG GATCGTCGCT GCGCGACCAT ATCCAGACCG ACGGATCGCT CGGCTATTTC CAGCATTCCT TCTCCGTCTA TGATCGCGAA GGTCAGGCTT GCCGCACGCC CGGCTGCGGC GGTACGGTCG CCCGCATCGT CCAGGCGGGC CGCTCCACCT TCTATTGCGC CACCTGCCAG AAGTAA
|
Protein sequence | MPELPEVETV KRGLAPAMEG ARVAKLELRR GDLRFPFPDA FADRVSGRTI VSLGRRAKYL LVDLDDGNTL ISHLGMSGSF RIEEGRIEEG AGAATPGEFH HARSKDEKHD HVVFHLESPA GPRRVVYNDP RRFGFMDMVG RADLAAHPFF RDLGPEPTGN ELGAAYLAER FRHKAQPLKS ALLDQKNIAG LGNIYVCEAL WRAHLSPIRA AGTLATAGGR PKEQLNLLVA SIRDVIADAI TAGGSSLRDH IQTDGSLGYF QHSFSVYDRE GQACRTPGCG GTVARIVQAG RSTFYCATCQ K
|
| |