Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2239 |
Symbol | |
ID | 3973256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 2443522 |
End bp | 2446290 |
Gene Length | 2769 bp |
Protein Length | 922 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637925347 |
Product | DNA topoisomerase I |
Protein accession | YP_532112 |
Protein GI | 90423742 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0515029 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0535493 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATCG TCATTGTCGA GTCGCCTGCG AAGGCCAAGA CGATCAACAA ATATCTGGGC AGTTCCTACG AGGTTCTGGC CTCGTTCGGC CATGTCCGCG ACCTTCCGGC GAAGAACGGC TCGGTCGATC CCGACGCCAA TTTCCAGATG ATCTGGGAAA TCGATCCCAA GGCCGCCGGC CGGCTCAACG ACATCGCCAA GGCGCTGAAG GGCGCCGACA AGCTGATCCT CGCAACCGAC CCTGATCGCG AGGGGGAAGC GATCTCCTGG CACGTGCTGG AGGTGTTGAA AGAGAAGCGC GCGATCAAGG ATCACAAGAT CGAACGCGTG GTGTTCAACG CCATCACCAA GCAGGCGGTC ACCGATGCGA TGAAGAACCC GCGCCAGATC GACGGCGCGC TGGTCGACGC CTATATGGCG CGCCGCGCGC TGGACTATCT GGTCGGCTTT ACACTCTCTC CCGTATTGTG GCGCAAACTG CCCGGCGCCC GCTCCGCCGG CCGCGTCCAA TCGGTGGCGC TGCGGCTGGT CTGCGACCGC GAGCTCGAGA TCGAGAAATT CGTGCCGCGG GAATACTGGT CGCTGGTGGC GACCCTGACC ACGCCGCGCG GCGAGATGTT CGAGGCCCGG CTCACCGGCG CCGATGGCAA GAAGATCCAG CGGCTCGACA TCGGCACCGG CGCCGAGGCC GAGGATTTCA AGCAGGCGAT CGAAGCGGCG CTGTTCAATG TCGCCAGCGT CGAAGCCAAG CCGGCGCGGC GCAACCCCTA CGCGCCGTTC ACCACCTCGA CGCTGCAGCA GGAGGCCAGC CGCAAGCTCG GCTTCGCCCC GGCGCACACC ATGCGGATCG CGCAGCGGTT GTATGAAGGC ATCGACATCG GCGGCGAGAC CACCGGTCTC ATTACTTATA TGCGTACCGA CGGCGTGCAG ATTGATTCCT CCGCCATCAC CCAGGCGCGC CAGGTGATCG GCGAGGACTA CGGCAAGCAA TACGTTCCGG AGGCGCCGCG GCAATACACC GCCAAGGCCA AGAACGCCCA GGAAGCCCAT GAAGCGATCC GGCCGACCGA CCTCAGCCGC CGCCCCGCCA GCTTGCGCGC CCGGCTCGAC CACGATCAGA TCCGGCTCTA CGAGCTGATC TGGATCCGCA CCATCGCCAG CCAGATGGAA TCCGCCGAAT TGGAGCGCAC CACCGTCGAG ATCGCCGCCA AGGCGGGCTC GCGGGTGCTG GAACTGCGCG CCACCGGCCA GGTGGTGAAG TTCGACGGCT TCCTGGCGGT GTATCAGGAA GGCCGCGACG ACGACGGTGA CGACGAGGAT TCCCGCCGAC TGCCGGCAAT GAGCCAAGGC GAAGCCTTGG CTCGCAAGGA CCTCGCCGTC ACCCAGCATT TCACCGAGCC GCCGCCGCGC TTCTCCGAAG CCTCGCTGGT CAAGCGGATG GAAGAGCTCG GCATCGGCCG GCCCTCGACC TACGCCTCGA TCCTGCAGGT GTTGAAGGAC CGCGGCTACG TCAAGCTCGA CAAGAAGCGG CTGCACGCCG AGGACAAGGG CCGCGTCGTG GTCGCGTTCC TGGAGAACTT CTTCGCCCGC TACGTCGAAT ACGACTTCAC CGCGGCGCTG GAGGAAAACC TCGACCGGAT TTCCAACAAC GAAATCTCCT GGCAACAGGT GCTGCGCGAT TTCTGGACCG ACTTCATCGG CGCGGTCAAC GACATCAAGG ATCTGCGCGT CGCGCAGGTG CTGGACGCGC TCGACGACAT GCTCGGCTCG CACATCTATG CGCCACGCGA CGACGGCGGC GATCCGCGGC AATGCCCGAG CTGCGGCACC GGCAAGCTCA ACCTCAAGGC CGGCAAGTTC GGCGCCTTCG TCGGCTGCAG CAACTATCCG GAATGCCGCT ACACCCGCCC GTTGGCGGCT GACGGCGGCG GCGACGGCGA CCGCATTCTC GGCAAGGACC CGGTGTCCGG CCTCGAAGTC GCGGTCAAGG CCGGCCGGTT CGGTCCCTAT ATCCAGCTCG GTGACGCCAA GGACTACGCC GAGGGCGAGA AGCCGAAACG CGCCGGCATT CCGAAAAACT CCTCGCCCGG CGACATGGAG CTCGAGCTCG CGCTGAAGCT GTTGTCGCTG CCGCGCGAAG TCGGCAAACA TCCGGAGACC GGCGAGCCGA TCAAGGCCGG CATCGGCCGC TTCGGTCCCT ATGTGCAGCA TGAGAAGACC TATGCCAGCC TGGAAGCTGG CGACGAGGTG TTCGACATCG GGCTGAACCG CGCGGTGACG CTGATCGCCG AGAAGATCCT CAAAGGCCCG AGCAAGCGAC GGTTTGGTTC GGACCCCGGC AAACCGCTCG GCGAGCATCC CTCGCTCGGC ACCGTGGCGG TGAAGAGCGG ACGTTACGGC GCTTACGTCA CCGCCGGCGG CGTCAACGCC ACGATTCCGA GCGACAAAAC TCAAGAGAGC ATCACCCTGC CCGAGGCCAT CGCGCTGATC GACGAGCGCG CGGCGAAGGG CGGCGGCAAG CCGAAGAAAG CCGCGAAGAA AGCGCCGGCC AAGAAGGCCG CGAAGTCCGA TACCGACGCA GCGGCAGAGA CGAAAAAACC CGCGAAGAAA GCCGCGGCGA AGAAGTCGGT CGCCAAGCCG AAGGCCGACG GCGTCGCCGT AAGTGCCGCG CGCGCGCCGG CGAAAGCCAA ATCCTCGACC AAGACCGCCG CAGCCAAGAA GCCTGCAAAG CCGGCGGCGA AAAAATCCGC GGGCAAAGCC AACGGCTGA
|
Protein sequence | MNIVIVESPA KAKTINKYLG SSYEVLASFG HVRDLPAKNG SVDPDANFQM IWEIDPKAAG RLNDIAKALK GADKLILATD PDREGEAISW HVLEVLKEKR AIKDHKIERV VFNAITKQAV TDAMKNPRQI DGALVDAYMA RRALDYLVGF TLSPVLWRKL PGARSAGRVQ SVALRLVCDR ELEIEKFVPR EYWSLVATLT TPRGEMFEAR LTGADGKKIQ RLDIGTGAEA EDFKQAIEAA LFNVASVEAK PARRNPYAPF TTSTLQQEAS RKLGFAPAHT MRIAQRLYEG IDIGGETTGL ITYMRTDGVQ IDSSAITQAR QVIGEDYGKQ YVPEAPRQYT AKAKNAQEAH EAIRPTDLSR RPASLRARLD HDQIRLYELI WIRTIASQME SAELERTTVE IAAKAGSRVL ELRATGQVVK FDGFLAVYQE GRDDDGDDED SRRLPAMSQG EALARKDLAV TQHFTEPPPR FSEASLVKRM EELGIGRPST YASILQVLKD RGYVKLDKKR LHAEDKGRVV VAFLENFFAR YVEYDFTAAL EENLDRISNN EISWQQVLRD FWTDFIGAVN DIKDLRVAQV LDALDDMLGS HIYAPRDDGG DPRQCPSCGT GKLNLKAGKF GAFVGCSNYP ECRYTRPLAA DGGGDGDRIL GKDPVSGLEV AVKAGRFGPY IQLGDAKDYA EGEKPKRAGI PKNSSPGDME LELALKLLSL PREVGKHPET GEPIKAGIGR FGPYVQHEKT YASLEAGDEV FDIGLNRAVT LIAEKILKGP SKRRFGSDPG KPLGEHPSLG TVAVKSGRYG AYVTAGGVNA TIPSDKTQES ITLPEAIALI DERAAKGGGK PKKAAKKAPA KKAAKSDTDA AAETKKPAKK AAAKKSVAKP KADGVAVSAA RAPAKAKSST KTAAAKKPAK PAAKKSAGKA NG
|
| |