Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3573 |
Symbol | |
ID | 5324461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 3780080 |
End bp | 3781000 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640792522 |
Product | formamidopyrimidine-DNA glycosylase |
Protein accession | YP_001329223 |
Protein GI | 150398756 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0000376379 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCCGGAAT TGCCCGAGGT GGAAACGGTC AAGCGCGGAC TGGCGCCGAC GATGGAGGGA GCACTTCTCG TGCGCGCCGA ATTGCGCCGT CCCGATCTGC GCTTTCCCTT TCCCGAGAAT TTCGAGGACG CAGTCGCCGG CCGGCGTATC GTCGCGCTCT CGCGCCGCGC CAAATATCTG ACGATCGAGC TGGAGGGCGG CGACGTCATC ATCGCCCATC TCGGCATGTC CGGCTCGTTC AGGATCGAGT TTGACGGTCC CGGGGAGGGC CGCATCAAGG AGAGCGCCGA TCCCGCCGTC CCCGGCGATT TCCACCGTCC GCGCAGCAAG GACGAGAAAC ACGACCATGT CGTCTTCCAT CTCGATGCCT CCTGCGGCCC GGCCCGGGTC ATCTATAACG ATCCACGCCG CTTCGGCTTC ATGGCTCTGG CGCGGCGCGA AGCGCTTGCC GAGCACGTCT TTCTTCGCGG CCTCGGCGAG GAGCCGACCG GCAACGCTCT CGATGCGGCC TATCTCGCCG CCCGGTTCTC CGGCAAAGCG CAGCCGCTGA AAGCCGCTCT TCTCGATCAA AGGACGATCG CCGGCCTCGG CAATATATAC GTTTGCGAGG CATTGTGGCG TTCGGGCCTT TCGCCGAAAA GGGCGGCAGG TACGCTCGTC GACAAGCGGG CTCGCCCGAA GCAGGCGCTG GTTCAGCTGA CGGATGCGAT CCGCGCCGTC ATCGCAGATG CGATCGCCGC CGGCGGTTCC TCGCTCAAGG ATCACATTCA GGCGGATGGC AGTCTTGGCT ATTTCCAGCA CAGCTTCTCC GTCTATGACA GAGAAGGCGA GGCTTGCCGC ACGTCCGGCT GCCGCGGTAC GGTTGAGCGC ATCGTTCAGG CAGGGCGTTC GACCTTTTAC TGTCCGCACT GCCAGAAATA G
|
Protein sequence | MPELPEVETV KRGLAPTMEG ALLVRAELRR PDLRFPFPEN FEDAVAGRRI VALSRRAKYL TIELEGGDVI IAHLGMSGSF RIEFDGPGEG RIKESADPAV PGDFHRPRSK DEKHDHVVFH LDASCGPARV IYNDPRRFGF MALARREALA EHVFLRGLGE EPTGNALDAA YLAARFSGKA QPLKAALLDQ RTIAGLGNIY VCEALWRSGL SPKRAAGTLV DKRARPKQAL VQLTDAIRAV IADAIAAGGS SLKDHIQADG SLGYFQHSFS VYDREGEACR TSGCRGTVER IVQAGRSTFY CPHCQK
|
| |