Gene Smed_3573 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3573 
Symbol 
ID5324461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3780080 
End bp3781000 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content64% 
IMG OID640792522 
Productformamidopyrimidine-DNA glycosylase 
Protein accessionYP_001329223 
Protein GI150398756 
COG category[L] Replication, recombination and repair 
COG ID[COG0266] Formamidopyrimidine-DNA glycosylase 
TIGRFAM ID[TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000376379 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCGGAAT TGCCCGAGGT GGAAACGGTC AAGCGCGGAC TGGCGCCGAC GATGGAGGGA 
GCACTTCTCG TGCGCGCCGA ATTGCGCCGT CCCGATCTGC GCTTTCCCTT TCCCGAGAAT
TTCGAGGACG CAGTCGCCGG CCGGCGTATC GTCGCGCTCT CGCGCCGCGC CAAATATCTG
ACGATCGAGC TGGAGGGCGG CGACGTCATC ATCGCCCATC TCGGCATGTC CGGCTCGTTC
AGGATCGAGT TTGACGGTCC CGGGGAGGGC CGCATCAAGG AGAGCGCCGA TCCCGCCGTC
CCCGGCGATT TCCACCGTCC GCGCAGCAAG GACGAGAAAC ACGACCATGT CGTCTTCCAT
CTCGATGCCT CCTGCGGCCC GGCCCGGGTC ATCTATAACG ATCCACGCCG CTTCGGCTTC
ATGGCTCTGG CGCGGCGCGA AGCGCTTGCC GAGCACGTCT TTCTTCGCGG CCTCGGCGAG
GAGCCGACCG GCAACGCTCT CGATGCGGCC TATCTCGCCG CCCGGTTCTC CGGCAAAGCG
CAGCCGCTGA AAGCCGCTCT TCTCGATCAA AGGACGATCG CCGGCCTCGG CAATATATAC
GTTTGCGAGG CATTGTGGCG TTCGGGCCTT TCGCCGAAAA GGGCGGCAGG TACGCTCGTC
GACAAGCGGG CTCGCCCGAA GCAGGCGCTG GTTCAGCTGA CGGATGCGAT CCGCGCCGTC
ATCGCAGATG CGATCGCCGC CGGCGGTTCC TCGCTCAAGG ATCACATTCA GGCGGATGGC
AGTCTTGGCT ATTTCCAGCA CAGCTTCTCC GTCTATGACA GAGAAGGCGA GGCTTGCCGC
ACGTCCGGCT GCCGCGGTAC GGTTGAGCGC ATCGTTCAGG CAGGGCGTTC GACCTTTTAC
TGTCCGCACT GCCAGAAATA G
 
Protein sequence
MPELPEVETV KRGLAPTMEG ALLVRAELRR PDLRFPFPEN FEDAVAGRRI VALSRRAKYL 
TIELEGGDVI IAHLGMSGSF RIEFDGPGEG RIKESADPAV PGDFHRPRSK DEKHDHVVFH
LDASCGPARV IYNDPRRFGF MALARREALA EHVFLRGLGE EPTGNALDAA YLAARFSGKA
QPLKAALLDQ RTIAGLGNIY VCEALWRSGL SPKRAAGTLV DKRARPKQAL VQLTDAIRAV
IADAIAAGGS SLKDHIQADG SLGYFQHSFS VYDREGEACR TSGCRGTVER IVQAGRSTFY
CPHCQK