Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0647 |
Symbol | |
ID | 5321483 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 698202 |
End bp | 699071 |
Gene Length | 870 bp |
Protein Length | 289 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640789583 |
Product | phage SPO1 DNA polymerase-related protein |
Protein accession | YP_001326338 |
Protein GI | 150395871 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1573] Uracil-DNA glycosylase |
TIGRFAM ID | [TIGR00758] uracil-DNA glycosylase, family 4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.268073 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTCGG CCAACGACCT CTCCCCCTCC GAACTCGCAG CCCTTCTCGC CTTCCATGCG GAAGCGGGTG TCGAATGGTT GATCGAGGAA GAACCCCTCG ACCGGCTGGC GGAATTCGAA GCTATGAGGA TGAGGCGCAC GGAGAGCCGT GCCGCGCCAC AAGCCAACGA GATGGCAGCC GAACGGCGGC AACCCGTACC TGAGACCACC GCGCGCGAGC GCGCTCAGGC CGCGCCGACC GCGCGCCCTC CTGTCGTCGC AATACCGGAC GAACAGGCCA TCAAAGAGGC ACAGTTCGTC GCCGGTGCCG CACGATCGCT CGGAGAGTTG CGCACCGCCA TGGAGGCGTT TTCCGGGTGC AATCTCAGAA ATAGCGCCCG CAACCTCGTC TTTGCCGAAG GCAGTGCTGC CTCCGGAGTC ATGATCATCG GTCCGATGCC CTACGCGGAC GACGACCGCG ACGGTCACCC CTTTGCGGGC AGGCACGGCC AGATGCTCGA ACGCATGCTG TTCGGCATCG GCCTGTCGCG TGACGACGTC CTTCTCACAA ATACAGTGCC CTGGCGTCCG CCGGGGAACC GCGTGCCGAG CGCTCGCGAG GCGGATATCT GCCGCCCTTT CATCGAGCGC CAGATCGAAC TGGCCGAACC CAAACAGCTG CTGCTTCTCG GCAATTTCAC CGCTCGCTTC TTCTTCGGCG GCGCTGAAAT GATCCATCAG TTGCGCGGCG AATGGCGCGA GCTCACATTT GCTGGGACCA GCATACCGGC GCTCGCCACC CTGCATCCCC AGGACCTCAT CGCAGCCCCG ATAAACAAGC GCTTCGCCTG GCTGGACCTG CTGGCGTTCA AATCCCGTCC CGAATTGTGA
|
Protein sequence | MISANDLSPS ELAALLAFHA EAGVEWLIEE EPLDRLAEFE AMRMRRTESR AAPQANEMAA ERRQPVPETT ARERAQAAPT ARPPVVAIPD EQAIKEAQFV AGAARSLGEL RTAMEAFSGC NLRNSARNLV FAEGSAASGV MIIGPMPYAD DDRDGHPFAG RHGQMLERML FGIGLSRDDV LLTNTVPWRP PGNRVPSARE ADICRPFIER QIELAEPKQL LLLGNFTARF FFGGAEMIHQ LRGEWRELTF AGTSIPALAT LHPQDLIAAP INKRFAWLDL LAFKSRPEL
|
| |