Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6601 |
Symbol | |
ID | 8022851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 27116 |
End bp | 28096 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644833470 |
Product | DNA polymerase III subunit epsilon |
Protein accession | YP_002984604 |
Protein GI | 241666520 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00020877 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGTGC ACAGAGATTC GCAACTCGAC ATGTTTGCCA AGGCATCGCC CGTGACGGCT AAGGCGCGCG GTCCAGCGCG CCGGCGGCCG TCGCAGCCGG TCGTCCACTC CGATGAGAAC ATGGCGCGGG CGCTCGAAGA GAGCGGCAAC TATCGCATTT TGAGAAAGCT GGTCGCCCGC CCGATTGCGT CAGTCAAACG GCCCGGATTT TCGCGCCTTG GCGTCATTCT CGATACGGAG ACCACCGGTC TCAACCACCG CAGCGACGAG ATCATCGAAA TCGGCGCCGT CGCCTTCACC TTCAACGATG ATGGGGCGAT CGGCGATATC GTCGGCATTT ACGGCGGCCT GCAACAGCCG TCCCGGCCGA TCCCGCCCGA GATCACCCGG CTGACGGGTA TCACCGATGC GATGGTCGAA GGGCAGCTCA TCGATATCCA GTCGCTGAGA ACTCTGATCG AGCCGGCGGA TCTGATCATC GCCCATAATG CCGGATTCGA CCGGCCGTTC TGCGAGGCCT TCTCAAAGAT TTTCACCGGC AAGGCCTGGG CATGCTCCGT TTCGGAGATC GACTGGAGCG CCCGCGGCTT CGAAGGCACG AAGCTCGGCT ATCTCGTCGG CCAGGCCGGA TATTTCCATG AAGGCCATCG TGCCGTGGAC GACTGCCATG CGCTGCTGGA AATCCTCGAT CGAGAGCAAC ACGACGGTGA AAGCCCGTTC ACCGAGCTTT ACCGCGCCAG CCAGCGCTCG CGCATCCGCG TCTTTGCCGA ACACAGCCCG TTCGAGATGA AGGATCATCT GAAGGCAAGG GGTTATCGCT GGTCGGACGG CAGCGACGGC CGCCTGAAGT CCTGGTGGAT CGAAGTCGGC GAAGAGGATC TCAACGACGA ACTCTCCTAT CTGCGCTCGG ATATTTACCG ATGGGCCGAA GCGGAGCCGC CGATGGTGCG GCTGACGGCC TTCGATCGTT TCAAACTCTG A
|
Protein sequence | MSVHRDSQLD MFAKASPVTA KARGPARRRP SQPVVHSDEN MARALEESGN YRILRKLVAR PIASVKRPGF SRLGVILDTE TTGLNHRSDE IIEIGAVAFT FNDDGAIGDI VGIYGGLQQP SRPIPPEITR LTGITDAMVE GQLIDIQSLR TLIEPADLII AHNAGFDRPF CEAFSKIFTG KAWACSVSEI DWSARGFEGT KLGYLVGQAG YFHEGHRAVD DCHALLEILD REQHDGESPF TELYRASQRS RIRVFAEHSP FEMKDHLKAR GYRWSDGSDG RLKSWWIEVG EEDLNDELSY LRSDIYRWAE AEPPMVRLTA FDRFKL
|
| |