Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3033 |
Symbol | |
ID | 4023536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 3378832 |
End bp | 3381576 |
Gene Length | 2745 bp |
Protein Length | 914 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637963232 |
Product | DNA topoisomerase I |
Protein accession | YP_570160 |
Protein GI | 91977501 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.298843 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.412116 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTCG TCATTGTCGA GTCGCCTGCG AAGGCCAAGA CGATCAACAA ATATCTCGGC TCCTCCTACG AGGTTCTGGC CTCGTTCGGG CATGTCCGCG ATCTGCCGGC CAAGAACGGG TCGGTCGATC CAGACGCGAA TTTCCAGATG ATTTGGGAGA TCGATCCCAA AGCTGCCGGC CGGCTCAACG ACATCGCCAA GGCCCTCAAA GGCGCCGACA AGCTGATCCT CGCCACCGAC CCTGATCGCG AGGGTGAGGC GATCTCCTGG CACGTGCTGG AAGTGTTGAA GCAGAAGCGC GCGCTGAAAG ACCAGAAGGT CGAGCGCGTG GTGTTCAACG CCATCACCAA GCAGTCGGTC ACCGACGCCA TGAAGCACCC GCGCGAGATC GACGGCGCGC TGGTCGACGC CTATATGGCG CGCCGCGCGC TGGATTATCT GGTCGGCTTC ACGCTCTCCC CGGTGCTGTG GCGCAAGCTG CCCGGCGCGC GTTCCGCCGG GCGGGTGCAA TCGGTGGCGC TGCGGCTTGT GTGCGACCGC GAGATGGAGA TCGAGAAGTT CGTTCCGCGC GAATACTGGT CGCTGATCGC GACCCTGACG ACGCCGCGCG GCGACAGCTT CGAGGCCCGC CTGGTCGGCG CCGACGGCAA GAAGATCCAG CGGCTCGACA TTGGTACCGG CGTCGAGGCC GAGGATTTCA AGCAGGCGAT CGAGCAGGCC AACTTCAAGG TGTCGAGCGT CGAGGCCAAG CCGGCCCGCC GCAACCCCTA CGCCCCCTTC ACCACCTCGA CGCTGCAGCA GGAAGCCAGC CGCAAGCTCG GCTTCGCGCC GGCGCACACG ATGCGGATCG CGCAACGGCT GTATGAAGGC ATCGACATCG GCGGCGAGAC CACCGGTCTC ATTACTTATA TGCGTACCGA CGGCGTCCAG ATCGACCCCT CCGCCATCAC CGAGGCGCGC AAGGTGATCG CCGAGGATTA CGGCAGCGCC TACGTTCCGG ATACGCCGCG GCAATATCAG GCCAAGGCGA AAAACGCCCA GGAAGCGCAT GAGGCGATCC GCCCGACCGA CATGTCGCGC CGCCCGGCCG ACGTCAACGG CAGGCTCGAT TCCGATCAGG CCCGACTCTA CGAACTGATC TGGGTCCGCA CCGTCGCGAG CCAGATGGAA TCGGCCGAGA TGGAGCGCAC CACCGTCGAC ATCGAGGCCA AAGCCGGGTC GCGGGTGCTG GAGCTGCGCG CCACCGGTCA GGTGGTCAAG TTCGACGGCT TTCTCGCCGC CTATCAGGAA GGCCGCGACG ACGACAGCGA GGACGAGGAT TCGCGCCGGC TGCCGGCGAT GAGCGAAGAC GAGGCGCTGA AGCGCGACGC GCTCGCCGTC ACCCAGCATT TCACCGAACC GCCGCCGCGC TTTTCGGAAG CCTCGCTGGT GAAGCGGATG GAAGAGCTCG GCATCGGCCG ACCCTCGACT TACGCCTCGA TCCTGCAGGT GCTGAAGGAT CGCGGCTACG TGAAGCTGGA GAAGAAGCGG CTGCACGGCG AGGACAAAGG TCGCGTCGTG ATCGCGTTCC TGGAGAGCTT TTTCGCGCGC TACGTCGAAT ACGACTTCAC CGCGGCGCTG GAAGAGAAAC TCGACCGCAT CTCCAACAAT GAAATCTCCT GGCAGCAGGT GCTGCGCGAT TTCTGGACCG ATTTCATCGG CGCGGTCGAC GACATTAAAG AGCTGCGGGT CGCGCAGGTG CTCGACGTGC TCGACGAGAT GCTCGGTCCG CATATCTATG CGCCGCGCGA GGATGGCGGC GATCCGCGGC AGTGCCCGAG CTGCGGCACC GGCCGGCTCA ACCTCAAGGC CGGCAAGTTC GGCGCCTTTG TCGGCTGCTC GAACTATCCG GAATGCCGCC ACACCCGCCC GCTCGCCGCC GACGGCGGCG GCGGCGATGC CGACCGCGTG CTCGGCATCG ATCCCGACAC CGGCTTCGAA GTGGCCGTCA AATCCGGTCG GTTCGGTCCT TACATCCAGC TTGGCGAAGC CAAGGACTAC GCCGAAGGCG AGAAGCCGAA GCGCGCAGGC ATTCCGAAGG GCACCTCGCC GTCCGACGTG GAGCTCGAGA TGGCGCTGCG GCTGCTCGCG CTGCCGCGCG AAGTCGGCAA GCATCCGGAG ACCGGCGAGC CGATCAAGGC CGGCATCGGC CGTTTCGGGC CCTATGTGCA GCACGAGAAG ACCTATGCCA GCCTCGAGGC CGGCGACGAT GTCCACAACA TCGGGCTGAA CCGTGCGGTG ACCCTGATCG CCGAGAAGAT CGCCAAGGGC CCGAGTAAGC GGCGCTTCGG CGCTGACCCC GGCAAGCCGC TCGGCGATCA CCCGACCCTC GGCGGCGTCG CCGTGAAAGC CGGCCGCTAC GGCGCCTATG TCACCGCCGG CGGCGTCAAC GCCACGATCC CGAACGACAA GACCCAGGAC ACCATCACGC TCGCCGAGGC CATCGCGCTG ATCGACGAGC GCGCCGCCAA GGGCGGCGGC GGCAAGGCCA AGAAGAAGGC TCCGGCGAAG AAGGCCGCAG CCTCCGGCGA GGCCAAGCCG AAGAAAGCCG CGGCCAAGAA GACCAAGCCG AAAGCCGAAA CCGCCGCCGC CAGCAAAGCG CGCGCGCCGG TGACGGCCAA GACGTCCGTG GCCAAGGCCT CCACCGCCAA AGCAACAGCC AAGCCCAAAT CACCCGCCAA AAAGAGCGCG GCCAAGAACG GATAG
|
Protein sequence | MNLVIVESPA KAKTINKYLG SSYEVLASFG HVRDLPAKNG SVDPDANFQM IWEIDPKAAG RLNDIAKALK GADKLILATD PDREGEAISW HVLEVLKQKR ALKDQKVERV VFNAITKQSV TDAMKHPREI DGALVDAYMA RRALDYLVGF TLSPVLWRKL PGARSAGRVQ SVALRLVCDR EMEIEKFVPR EYWSLIATLT TPRGDSFEAR LVGADGKKIQ RLDIGTGVEA EDFKQAIEQA NFKVSSVEAK PARRNPYAPF TTSTLQQEAS RKLGFAPAHT MRIAQRLYEG IDIGGETTGL ITYMRTDGVQ IDPSAITEAR KVIAEDYGSA YVPDTPRQYQ AKAKNAQEAH EAIRPTDMSR RPADVNGRLD SDQARLYELI WVRTVASQME SAEMERTTVD IEAKAGSRVL ELRATGQVVK FDGFLAAYQE GRDDDSEDED SRRLPAMSED EALKRDALAV TQHFTEPPPR FSEASLVKRM EELGIGRPST YASILQVLKD RGYVKLEKKR LHGEDKGRVV IAFLESFFAR YVEYDFTAAL EEKLDRISNN EISWQQVLRD FWTDFIGAVD DIKELRVAQV LDVLDEMLGP HIYAPREDGG DPRQCPSCGT GRLNLKAGKF GAFVGCSNYP ECRHTRPLAA DGGGGDADRV LGIDPDTGFE VAVKSGRFGP YIQLGEAKDY AEGEKPKRAG IPKGTSPSDV ELEMALRLLA LPREVGKHPE TGEPIKAGIG RFGPYVQHEK TYASLEAGDD VHNIGLNRAV TLIAEKIAKG PSKRRFGADP GKPLGDHPTL GGVAVKAGRY GAYVTAGGVN ATIPNDKTQD TITLAEAIAL IDERAAKGGG GKAKKKAPAK KAAASGEAKP KKAAAKKTKP KAETAAASKA RAPVTAKTSV AKASTAKATA KPKSPAKKSA AKNG
|
| |