Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4410 |
Symbol | |
ID | 5318123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 904990 |
End bp | 906108 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640776214 |
Product | putative DNA topoisomerase I protein |
Protein accession | YP_001313147 |
Protein GI | 150376551 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3569] Topoisomerase IB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.017671 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAGAG CAATGTCGGT CGACGTGCTG GAATGTCGAC CTGCCGGGCT CGCCGATCTC CCGCAGCGGC TCGCGGCGAT CGGGTCTGTG CCCGAGGAAA CTGGGCTCGT CTACGTCAGC GACAGCGAGC CGGGTATTCG GCGCCAAAGA CGTGGCAAAG GGTTCGCGTA TCGCATGCCG GACGGATCGA TCGTTACCGA TCCGTCTGTT AAAAGCCGCA TAGCGGCACT GGGATTACCG CCTGCCTATG AGAACGTCTG GATCTGCCTT GACGAACGAG GCCACCTGCA GGCCACCGGC TATGATGCGC GCGGCCGAAA GCAGTATCGT TATCACAGCG AATGGCAGGC CCTCAGGAGC GCCGACAAAT TTGCGCAACT GACGGAATTC GGCAAAGCTT TGCCTAAGAT ACGCCGCACC ATACGGCGCC ACATGCAGGG CGGCGTGGAA AACATGCAGA CCGTGCTTGC GGCGCTCGTG GCCCTGCTGG ACGAAGCGCA TCTGCGCACC GGCAACCAGG CTTATGTGCA GGCCAACGGA AGTTATGGCG CAACCACTTT GCTGAAGCGG CATCTCAGAC TTGGCGACGG CTTTATAGAA TTGAAATTCA CCGGAAAGGG TGGCAAGCGC GTTCAGCGGG TGCTCCGTCG CCCGAAGTTG CAGCGACTGC TCGAAGAAAT AGCCGATCTC CCGGGCAGGC AGCTTTTCGT CTGGAAGGAT GAAAACGACG CGCTTCGACC GGTCGATTCC GGCCGGCTCA ACCGGTATCT CACGGACATG GCCGGCACAG CGATCTCGGC AAAGACATTC CGAACATGGG GCGGCACTCT CGCCGCTTTC ACGGTCGCGC GGACCTCGAT CGAGCGGGGT GAGTGGCCGA CGATCAAACA GATGAGCGAG GCGGCTGCAT CCGTGCTTCA CAACACGCCC GCAATCAGCC GGAGCAGTTA CATTCATCCG GATGTGCTCG CCCTTGCCGA CAAGTCGGCC CCGGTCTCCG CGCGGCAGCT TCAGGCGCGT GGGCGTTCAG GAAGTGAATT GCGTGTGGAG GAACAGCGTT TGCTAGGCTT TCTTCAGCGC AGCGCGGGGA CGAAAAAGCG CCTGCCCTTG CCCCAGTGA
|
Protein sequence | MARAMSVDVL ECRPAGLADL PQRLAAIGSV PEETGLVYVS DSEPGIRRQR RGKGFAYRMP DGSIVTDPSV KSRIAALGLP PAYENVWICL DERGHLQATG YDARGRKQYR YHSEWQALRS ADKFAQLTEF GKALPKIRRT IRRHMQGGVE NMQTVLAALV ALLDEAHLRT GNQAYVQANG SYGATTLLKR HLRLGDGFIE LKFTGKGGKR VQRVLRRPKL QRLLEEIADL PGRQLFVWKD ENDALRPVDS GRLNRYLTDM AGTAISAKTF RTWGGTLAAF TVARTSIERG EWPTIKQMSE AAASVLHNTP AISRSSYIHP DVLALADKSA PVSARQLQAR GRSGSELRVE EQRLLGFLQR SAGTKKRLPL PQ
|
| |