Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4639 |
Symbol | |
ID | 8015381 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4764069 |
End bp | 4764959 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644827214 |
Product | formamidopyrimidine-DNA glycosylase |
Protein accession | YP_002978414 |
Protein GI | 241207318 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0230348 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGAAT TGCCAGAAGT CGAAACGGTG AAACGCGGCC TGACGCCGGC GATGGAGGGC ACGCGCGTCA CCAGGCTGGA GCTGCGCCGA GGCGATCTGC GCTTCCCCTT TCCCGACGCT TTCGCAGACA GGGTTTCCGG CCGCACCATC GTTGGCCTTG GCCGCCGCGC CAAATATCTG CTGGTCGATC TCGACGACGG CAACACGCTG ATTTCGCATC TCGGCATGTC CGGTTCGTTT CGCATCGAGG AGGGTGCAGT CTCCGGCATG CCGGGCGAAT TCCACCATGC CCGCTCGAAG GACGAGAAGC ACGATCACGT CGTCTTCCAT CTGCAAGGTT TAGGCGGCCC GCGCCGCGTC GTCTATAACG ATCCGCGCCG TTTCGGCTTC ATGGATATGG TGGGACGTGC CGATCTCGCC GCTCATCCCT TCTTCCGCGA TCTCGGCCCG GAGCCGACAG GAAACGAGCT TGGCGCCGCC TATCTGGCTG AACGCTTCCG CGACAAGGCG CAGCCGCTGA AGAGCGCGCT GCTTGACCAG AAGAACATTG CCGGTCTGGG CAACATATAT GTCTGCGAGG CGCTTTGGCG CGCGCATCTT TCGCCGATCC GCGCCGCCGG TACGCTGGTG ACGCCAGGGG GCCGGCCGAA GGCGCAGCTC GACCTGCTCG TTGCCTCGAT CCGCGACGTC ATCGCCGATG CGATCGCCGC CGGCGGATCG TCGCTGCGCG ACCATATCCA GACCGACGGA TCGCTCGGCT ATTTCCAGCA TTCCTTCTCC GCCTATGATC GCGAAAGTCA GGCTTGCCGC ACGCCCGGCT GCGGCGGTAC GGTCGCCCGC ATCGTCCAGG CAGGTCGCTC CACCTTCTAT TGCGCCACCT GTCAGAAGTA A
|
Protein sequence | MPELPEVETV KRGLTPAMEG TRVTRLELRR GDLRFPFPDA FADRVSGRTI VGLGRRAKYL LVDLDDGNTL ISHLGMSGSF RIEEGAVSGM PGEFHHARSK DEKHDHVVFH LQGLGGPRRV VYNDPRRFGF MDMVGRADLA AHPFFRDLGP EPTGNELGAA YLAERFRDKA QPLKSALLDQ KNIAGLGNIY VCEALWRAHL SPIRAAGTLV TPGGRPKAQL DLLVASIRDV IADAIAAGGS SLRDHIQTDG SLGYFQHSFS AYDRESQACR TPGCGGTVAR IVQAGRSTFY CATCQK
|
| |