Gene Rleg2_4368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4368 
Symbol 
ID6983142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4535561 
End bp4536466 
Gene Length906 bp 
Protein Length301 aa 
Translation table11 
GC content64% 
IMG OID643399096 
Productformamidopyrimidine-DNA glycosylase 
Protein accessionYP_002283852 
Protein GI209551935 
COG category[L] Replication, recombination and repair 
COG ID[COG0266] Formamidopyrimidine-DNA glycosylase 
TIGRFAM ID[TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0306824 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAAT TGCCAGAAGT CGAAACGGTA AAACGCGGCC TGGCGCCGGC GATGGAGGGT 
GCTCGTGTCG CCAAGCTTGA GCTGCGCCGC GGCGATCTGC GCTTTCCCTT TCCCGACGCT
TTCGCCGACA GGGTTTCCGG TCGCACCATC GTCAGCCTTG GCCGTCGCGC CAAATATCTG
CTGGTCGATC TCGACGACGG CAACACGCTG ATTTCCCATC TCGGCATGTC CGGCTCTTTT
CGCATCGAGG AGGGTCGCAT CGAGGAGGGC GCTGGAGCGG CCACGCCTGG CGAATTCCAC
CATGCCCGCT CGAAGGACGA GAAGCACGAC CACGTCGTCT TTCATCTGGA AAGTCCAGCC
GGTCCGCGCC GTGTCGTTTA TAACGATCCG CGCCGTTTCG GCTTCATGGA TATGGTGGGG
CGCGCCGACC TTGCCGCCCA TCCCTTCTTC CGTGATCTCG GCCCGGAGCC GACGGGAAAC
GAGCTCGGCG CCGCCTATCT CGCCGAGCGC TTCCGCCACA AGGCGCAGCC GTTGAAGAGT
GCGCTGCTCG ACCAGAAGAA CATTGCCGGT CTCGGCAATA TATATGTCTG CGAGGCGCTG
TGGCGCGCCC ACCTTTCGCC GATCCGCGCC GCCGGCACGC TGGCAACCGC AGGCGGCCGG
CCGAAAGAGC AGCTTAACCT GCTCGTGGCC TCGATCCGCG ATGTCATTGC CGATGCGATC
ACCGCCGGCG GATCGTCGCT GCGCGACCAT ATCCAGACCG ACGGATCGCT CGGCTATTTC
CAGCATTCCT TCTCCGTCTA TGATCGCGAA GGTCAGGCTT GCCGCACGCC CGGCTGCGGC
GGTACGGTCG CCCGCATCGT CCAGGCGGGC CGCTCCACCT TCTATTGCGC CACCTGCCAG
AAGTAA
 
Protein sequence
MPELPEVETV KRGLAPAMEG ARVAKLELRR GDLRFPFPDA FADRVSGRTI VSLGRRAKYL 
LVDLDDGNTL ISHLGMSGSF RIEEGRIEEG AGAATPGEFH HARSKDEKHD HVVFHLESPA
GPRRVVYNDP RRFGFMDMVG RADLAAHPFF RDLGPEPTGN ELGAAYLAER FRHKAQPLKS
ALLDQKNIAG LGNIYVCEAL WRAHLSPIRA AGTLATAGGR PKEQLNLLVA SIRDVIADAI
TAGGSSLRDH IQTDGSLGYF QHSFSVYDRE GQACRTPGCG GTVARIVQAG RSTFYCATCQ
K