Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3202 |
Symbol | |
ID | 3836648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 3689610 |
End bp | 3692357 |
Gene Length | 2748 bp |
Protein Length | 915 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637827317 |
Product | DNA topoisomerase I |
Protein accession | YP_428284 |
Protein GI | 83594532 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGTCG TCGTCGTTGA ATCTCCGGCC AAAGCCAAAA CCATCAATAA ATATCTGGGC AAGGACTACG TGGTCCTCGC GTCCTACGGC CACATCCGCG ACCTTCCCCC CAAGGACGGC TCCGTCCGCC CGGACGAAGG CTTCGCCATG GACTGGGAGA TCGACGCCAA ATCCGAAAAG CACGTCAAGG ACATCACCCA GGCCCTGAAG TCGGCCGACG GACTGCTGCT CGCCACTGAC CCCGATCGCG AGGGAGAGGC GATTTCGTGG CATGTCCGCG ATGTTTTGGA AAAGCGCAAG GCGCTGGTTG GCAAGCAGGT CCAGCGCATC ACCTTCAACG CCATCACCCG CTCGGCGGTT GAAGAGGCCT TGCGCAATCC CCGCGATCTC GATACGCCGC TGGTCGAGGC CTATCTGGCC CGCCGGGCGC TGGATTATCT GGTGGGCTTC ACCCTGTCGC CGGTGCTGTG GCGCAAGCTG CCCGGATCGC GCTCGGCCGG CCGGGTGCAA TCGGTGGCCC TGCGGCTGAT CTGCGAACGC GAGCAAGAGG TCGAGCGCTT TGTTTCGCGC GAATACTGGA CGGTCGACGC CATCCTCGCC TCGCCCCAGG GGCAGATCTT CCGCGCCCGC CTCAGCCAGC TTGACGGCCA GAAGCTTGAT AAATTCGGCC TGCCCGACGA AACGACCGCC CTGGCCGCCC TCGAGCGGAT CAAGGCGGCG ACGCTGCGGG TGGAAAGTGT CGAGCGCAAG CAGGCGCGGC GCAATCCCGC CGCTCCCTTC ACCACCTCGA CCATCCAGCA GGAAGCCTCG CGCAAGCTGT ATTTTTCGGC CCGCCAGACC ATGCAGGTGG CCCAGAAGCT TTACGAGGGC GTGGACCTGG GCGGCGAGAC CGTCGGCCTG ATCACCTATA TGCGAACCGA TGGCGTCTCG ATCGCCCCCG AGGCGGTCTT CGCCACCCGC GATCTGATCG GCGCCGAATT CGGCGCGGCC TATCTGCCCG AACAGCCCCG CGTCTATAAG ACCAAGGCCA AGAACGCCCA GGAGGCGCAC GAGGCCATCC GCCCCACCGA TGTGGCGCGC ACGCCGCAAA GCGTCGCCCC TTATCTTACC GCCGAACAGC GTAAGCTGTA CGAACTGGTG TGGAAGCGCA CGGTCGCCAG CCAGATGGCC AGCGCCATCC TTGATCAGGT CGCCGTCGAC ATCGCCGATC CCGAGGCCAA GGTGGTGCTC CACGCCACCG GCTCGGTCAT CCAGTTCGAC GGCTTCCTCA AGGTCTACCG CGAGGACTTC GACGATCGCC CCGAAAGCAG CCCCGATGGG GCCGGCGAGG ACGAAAACCG CCTGTTGCCG CCCTTGCGCG AAGGCGATGG CGTCAAGCGC GAGGACGTGA AGGCCGATCA GCATTTCACC CAGCCGCCGC CGCGCTATAC GGAAGCCAGT TTGGTCAAGC GCATGGAAGA ACTGGGCATC GGCCGTCCCT CGACCTACGC CTCGATCCTC AGCGTTCTTC AGGACCGGGA ATATGTGCGC CTTGATGCCC GGCGCTTCAT TCCCGAAGAC CGCGGCCGTC TGGTCACCGC CTTCCTGGAA AACTTCTTTT CGCGCTATGT GCAGTATTCC TTCACCGCCG ATCTGGAAAA CCAGCTCGAC GAGATTTCCG ACGGCAAGCT TGGCTGGAAA ACCGTCCTTG AACGCTTCTG GATGGATTTC AAGGCGGCGA TCGAAGGCAC CGCCACCCTG CGCGTCTCGG AAGTTCTGGA CGCCCTGGAC AAGGAACTGG GGCCCCACCT GTTCCCCCAG GCCGAGGACG GCCACGATCC GCGCGTCTGC CCGGTGTGCT CGGCCGGTCG GTTGGGTCTG CGCATCGGCA AGTTCGGCGC CTTCGTCGGC TGCTCGAACT ATCCCGACTG CAAGTTCACC CGGCCGCTGG TCACCAAGGA AGGCGAAGGC GGCGACGGCG CCGCCCTGGC CGAGGATGGC ACCAAGCCGC TGGGTAAGGA CCCGGTCAGC GGCGAGGACG TGACCCTGCG CAAGGGCCCC TATGGGCTTT ATGTGCAAAA GGGCGAGGCG GCGCCGGTGG AAAAGGGCAA GAAGGCGGTC AAGCCGCCGC GCGTGTCGAT CCCCAAGGAC ATGGACGCCG CCACCATCGA TCTTGATATC GCCCTGAAGC TGCTGTCGCT GCCGCGCCCG GTCGGCGATC ACCCCGAAAC CGGCACGATG ATCAGCGCCG GCATCGGCCG CTTCGGCCCC TATATCAAGC ACGGCGACAT CTATAAATCG CTGCCCAAGG ACGATGACGT TCTGACCATC GGCCTCAACC GCGCCGTCAG CCTGTTGGCC GAGGCCGGCA AGGGCGGCCG CGCCCGTCAG CCGGCGCGCA GCCTGGGCGA CCATCCCGCC GACACCAAGC CGGTGACCAT CCACGACGGC CGCTTCGGTC CCTATGTCCA GCATGGCGGC GTGCGCGCGA CCATTCCGCG CACCGCCGAT CCGGCGACCT ATACCCTGGC CGAGGCGGTC GAGCTGATCG CCGCCAAGGC GGCCAAGGAC GGCGACGGCA AGAAGGCCCC GGCGAAGAAG GCGGCGACCA AGGCTCCGGT CAAAAAAGCG GCGGCCGAGA AAAAGGCGCC GGCGAAGACC GCGACCAAGA AAGCCGCGCC GAAAAAGGCC GCCGACTCGG CCGAAGACGC CCCGGCCAAG GCGCCGCGCA AGAGCAAGGC CAAACCGGCG AGCGCCGATC CGGACTGA
|
Protein sequence | MHVVVVESPA KAKTINKYLG KDYVVLASYG HIRDLPPKDG SVRPDEGFAM DWEIDAKSEK HVKDITQALK SADGLLLATD PDREGEAISW HVRDVLEKRK ALVGKQVQRI TFNAITRSAV EEALRNPRDL DTPLVEAYLA RRALDYLVGF TLSPVLWRKL PGSRSAGRVQ SVALRLICER EQEVERFVSR EYWTVDAILA SPQGQIFRAR LSQLDGQKLD KFGLPDETTA LAALERIKAA TLRVESVERK QARRNPAAPF TTSTIQQEAS RKLYFSARQT MQVAQKLYEG VDLGGETVGL ITYMRTDGVS IAPEAVFATR DLIGAEFGAA YLPEQPRVYK TKAKNAQEAH EAIRPTDVAR TPQSVAPYLT AEQRKLYELV WKRTVASQMA SAILDQVAVD IADPEAKVVL HATGSVIQFD GFLKVYREDF DDRPESSPDG AGEDENRLLP PLREGDGVKR EDVKADQHFT QPPPRYTEAS LVKRMEELGI GRPSTYASIL SVLQDREYVR LDARRFIPED RGRLVTAFLE NFFSRYVQYS FTADLENQLD EISDGKLGWK TVLERFWMDF KAAIEGTATL RVSEVLDALD KELGPHLFPQ AEDGHDPRVC PVCSAGRLGL RIGKFGAFVG CSNYPDCKFT RPLVTKEGEG GDGAALAEDG TKPLGKDPVS GEDVTLRKGP YGLYVQKGEA APVEKGKKAV KPPRVSIPKD MDAATIDLDI ALKLLSLPRP VGDHPETGTM ISAGIGRFGP YIKHGDIYKS LPKDDDVLTI GLNRAVSLLA EAGKGGRARQ PARSLGDHPA DTKPVTIHDG RFGPYVQHGG VRATIPRTAD PATYTLAEAV ELIAAKAAKD GDGKKAPAKK AATKAPVKKA AAEKKAPAKT ATKKAAPKKA ADSAEDAPAK APRKSKAKPA SADPD
|
| |