Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1085 |
Symbol | |
ID | 8012210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 1063909 |
End bp | 1064808 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644823668 |
Product | phage SPO1 DNA polymerase-related protein |
Protein accession | YP_002974919 |
Protein GI | 241203823 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1573] Uracil-DNA glycosylase |
TIGRFAM ID | [TIGR00758] uracil-DNA glycosylase, family 4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.735603 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTCCG CCAACGACCT TTCCCCAGCC GAGCTCGCAG CGCTTCTGCA TTTCCATGCC GATGCCGGCG TGGAATGGCT GCTGGAGGAA GAGGCGATCG ACCGCTTCGC CGAGTTCGAG GCAATGAAGG CCGCCCGCCG GCCGGCGGCA CAGGCGCAGC AGCAACGTCC CGCCGCTGGG GAACGCCCTG CCCCAGGTCA GTCACCGGCG CGCCCGAACG CCGCAGCCCG CCCAGCGCCT GCCGAACGCG CAGCGTCAGG TCCGCAACCG GCAATCCCGG ATGGCGAGGC GGTGCAGCAG GCGCGCTTCG TCGCCGAAAC CGCGCGATCG CTCGTCGAGC TCAAGACCGC GATCGAAACC TTCAACGGCT GCAACCTCAA GCACAGCGCC CGCTCGACCA TCTTTGCCAG CGGCGATACC GAAAGCGGGA TCATGGTAAT CGGCTCGGCG CCGAGCGCCG AAGACGATCG CGAGGGTTTG CCCTTCTCCG GAAAATCCGG TCAGCTGTTC GACAAGATGC TGGCGGCGAT CGGGCTGACG CGCTCAACCA TTCTGTTGAC GCAGGTCATC CCCTGGCGGC CGCCTGGCAA TCGTGCGCCC TCGGCGGCGG AAATGGACAT CTGCCGACCC TTCATCGAGC GGCAGATCGC GCTTGCCGAA CCGAAAGCGA TCCTGCTGCT CGGCAATTTT TCGGCGCGTT TCTTCTTCGG CGAAAACGAT ACGATTCACG GCCTACGCGG CCGTTGGAAG GAGATTGCGG CTGCGGACTG TGTCATTCCT GCCATAGCCA GCCTGCATCC GCAGGATCTG TTAACCGCAC CTGTAAACAA GCGGCTGGCC TGGAACGACC TGCTCGCCTT TCAAGCGAAG CTTAAGTCCC TCTCTTTGCT TAGAAATTAG
|
Protein sequence | MISANDLSPA ELAALLHFHA DAGVEWLLEE EAIDRFAEFE AMKAARRPAA QAQQQRPAAG ERPAPGQSPA RPNAAARPAP AERAASGPQP AIPDGEAVQQ ARFVAETARS LVELKTAIET FNGCNLKHSA RSTIFASGDT ESGIMVIGSA PSAEDDREGL PFSGKSGQLF DKMLAAIGLT RSTILLTQVI PWRPPGNRAP SAAEMDICRP FIERQIALAE PKAILLLGNF SARFFFGEND TIHGLRGRWK EIAAADCVIP AIASLHPQDL LTAPVNKRLA WNDLLAFQAK LKSLSLLRN
|
| |