Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3535 |
Symbol | |
ID | 6411209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 3782801 |
End bp | 3785536 |
Gene Length | 2736 bp |
Protein Length | 911 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642713413 |
Product | DNA topoisomerase I |
Protein accession | YP_001992510 |
Protein GI | 192291905 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCTCG TCATTGTCGA GTCGCCCGCG AAGGCCAAGA CGATCAACAA ATATCTCGGC TCGTCCTACG AGGTTCTGGC CTCGTTCGGG CACGTGCGCG ACTTGCCGGC GAAGAACGGA TCGGTCGACC CGGACGCCGA CTTCCAGATG ATCTGGGAAG TCGATCCGAA AGCCGCGAGC CGGCTCAACG ACATCGCCAA AGCTCTGAAG GGTGCCGACA AACTGATCCT CGCCACCGAC CCTGATCGCG AGGGCGAGGC GATCTCCTGG CATGTGCTGG AGGTGATGAA GCAGAAGCGT GCGCTGAAAG ACCAACAGGT CGAGCGCGTC GTGTTCAACG CCATCACCAA GCAGTCGGTC ACCGACGCGA TGAAGCACCC GCGCCAGATC GATGGTGCGC TGGTCGACGC CTATATGGCG CGCCGCGCTC TCGACTACCT GGTCGGGTTC ACCCTCTCCC CGGTGCTGTG GCGCAAGCTG CCCGGCGCCC GCTCCGCCGG CCGGGTGCAG TCGGTGGCGC TGCGGCTGGT GTGCGACCGC GAGACGGAAA TCGAGAAATT CGTCCCCCGC GAGTACTGGT CGCTGGTCGC CACCATGGCG ACACCACGCG GCGAGACCTT CGAGGCCCGC CTGGTCGGCG CCGACGGCAA GAAGATCCAG CGGCTCGACA TCGGCACCGG CGCCGAGGCG GAAGACTTCA AAAAGGCGAT CGAAGCGGCG AACTTCCGCG TCGCCACCGT CGAAGCCAAG CCGGCCCGCC GCAACCCCTA CGCCCCCTTC ACCACCTCGA CGCTGCAGCA GGAGGCCAGC CGTAAGCTCG GCTTCGCGCC GGCGCACACC ATGCGGGTGG CGCAGCGGCT GTATGAAGGT GTCGACATCG GCGGCGAGAC CGTCGGACTC ATTACATATA TGCGTACCGA CGGCGTGCAG ATCGACTCGT CCGCCATCAC CCAGGCGCGC AAGGTGATCG CCGAGGATTT CGGCAACGCC TATGTGCCGG ACGCACCGCG GCAATACACC GCAAAGGCCA AGAACGCGCA GGAAGCCCAC GAAGCGATCC GCCCGACCGA CCTGTCGCGC AGGCCCTCCG ACGTCAGCCG GAACCTCGAT TCCGACCAGG CCCGGCTGTA CGAGCTGATC TGGATCCGCA CCGTCGCCAG CCAGATGGAA TCCGCCGAGC TCGAGCGCAC CACCGTCGAC ATCGAGGCGA AAGCCGGGTC GCGGGTGCTG GAGCTGCGCG CCACCGGCCA GGTGGTGAAA TTCGACGGCT TCCTCGCCGC CTATCAGGAA GGCCGCGACG ACGATTCCGA GGACGAGGAT TCCCGCCGCC TCCCCGCCAT GAGCGAGAAC GAGGCGCTGA AGCGCGAGGC CCTCGCGGTC ACGCAGCATT TCACCGAGCC GCCGCCGCGG TTCTCCGAAG CGTCTCTGGT GAAGCGGATG GAAGAGCTCG GCATCGGCCG GCCCTCGACC TACGCCTCGA TCCTGCAGGT ACTGAAGGAT CGCGGCTACG TCCGGCTGGA AAAGAAGCGG CTGCACGGCG AGGACAAGGG CCGCGTCGTC GTCGCGTTTC TCGAGAGCTT CTTCACCCGC TACGTCGAAT ACGACTTCAC CGCGGGACTC GAAGAAGACC TCGACCGCAT CTCCAATAAT GAAGTGTCCT GGAAACAGGT CCTGAGCGAC TTCTGGCGCG ACTTCATCGG CGCGGTCGAC GAGATCAAGG ACTTGCGCGT CGCCCAGGTG CTCGACGTGC TCGACGAGAT GCTCGGCCCG CACATCTATC CGGCCCGCGA GGACGGCGGC GATCCGCGGC AGTGCCCGAG CTGCGGCAAC GGCCGGCTCA ACCTGAAGGC CGGCAAGTTC GGCGCCTTCG TCGGCTGCAC CAACTATCCG GAATGCCGCT ACACCAGGCC GCTCGCTGCC GATGGCGGCG CCGATGCCGA TCGCGTGCTC GGCACCGATC CGGACACCGG CCTCGAGGTC GCGGTGAAGT CCGGCCGGTT CGGGCCCTAC ATCCAACTCG GCGAGGCCAA GGATTATGGC GAGGGCGAGA AGCCCAAGCG CGCCGGCATC CCGAAGGGCA CCTCGCCGTC CGATGTCGAA CTCGACGTTG CGCTGAAGCT GCTGGCGCTG CCGCGCGAAG TCGGCAAACA TCCGGAGAGC GGCCAACCGA TCAAGGCCGG CATCGGCCGG TTCGGGCCTT ACGTGCAGCA CGAGAAGACC TATGCCAGCC TCGAAGCCGG CGACGACGTG CACACCATCG GTCTGAATCG CGCGGTGACG CTGATCGCCG AGAAGGTCGC CAAGGGTCCG AGCAAGGGCC GGTTCGGCGC CGATCCCGGC AAGGCGCTGG GCGATCACCC GACGCTCGGC GCGGTCGCGG TCAAGAAGGG CCGCTACGGC GCCTATGTCA CCGCCGGCGG CGTCAACGCC ACGATCCCGA ACGACAAGAC CGACGAGACC ATCACGTTGC CCGAGGCGAT CGTGCTGCTC GACGAGCGCG CCGCCAAGGG CGGCGGCAAA GCCAAGAAGG CGCCGGCCAA GAAGTCTGCT GCCAAGAAGG CTTCCGCCGA CGGTGAGGCC AAGCCGGTGA AGAAGGCCGC CGCCAAGAAG GCCAAGCCGA AGGCCGAAGG CGCCGCCGCC AGCAAGGCGC GCGCACCGGT GGCGGCAAAG ACCGCGGCGA AAAAGGCCGC CAAGCCTAAA GACGCGGCCA AGAGCAGCGC CGCCAAGAAC GGATAG
|
Protein sequence | MNLVIVESPA KAKTINKYLG SSYEVLASFG HVRDLPAKNG SVDPDADFQM IWEVDPKAAS RLNDIAKALK GADKLILATD PDREGEAISW HVLEVMKQKR ALKDQQVERV VFNAITKQSV TDAMKHPRQI DGALVDAYMA RRALDYLVGF TLSPVLWRKL PGARSAGRVQ SVALRLVCDR ETEIEKFVPR EYWSLVATMA TPRGETFEAR LVGADGKKIQ RLDIGTGAEA EDFKKAIEAA NFRVATVEAK PARRNPYAPF TTSTLQQEAS RKLGFAPAHT MRVAQRLYEG VDIGGETVGL ITYMRTDGVQ IDSSAITQAR KVIAEDFGNA YVPDAPRQYT AKAKNAQEAH EAIRPTDLSR RPSDVSRNLD SDQARLYELI WIRTVASQME SAELERTTVD IEAKAGSRVL ELRATGQVVK FDGFLAAYQE GRDDDSEDED SRRLPAMSEN EALKREALAV TQHFTEPPPR FSEASLVKRM EELGIGRPST YASILQVLKD RGYVRLEKKR LHGEDKGRVV VAFLESFFTR YVEYDFTAGL EEDLDRISNN EVSWKQVLSD FWRDFIGAVD EIKDLRVAQV LDVLDEMLGP HIYPAREDGG DPRQCPSCGN GRLNLKAGKF GAFVGCTNYP ECRYTRPLAA DGGADADRVL GTDPDTGLEV AVKSGRFGPY IQLGEAKDYG EGEKPKRAGI PKGTSPSDVE LDVALKLLAL PREVGKHPES GQPIKAGIGR FGPYVQHEKT YASLEAGDDV HTIGLNRAVT LIAEKVAKGP SKGRFGADPG KALGDHPTLG AVAVKKGRYG AYVTAGGVNA TIPNDKTDET ITLPEAIVLL DERAAKGGGK AKKAPAKKSA AKKASADGEA KPVKKAAAKK AKPKAEGAAA SKARAPVAAK TAAKKAAKPK DAAKSSAAKN G
|
| |