Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2419 |
Symbol | |
ID | 3909553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2772217 |
End bp | 2774952 |
Gene Length | 2736 bp |
Protein Length | 911 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637884318 |
Product | DNA topoisomerase I |
Protein accession | YP_486035 |
Protein GI | 86749539 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0550] Topoisomerase IA |
TIGRFAM ID | [TIGR01051] DNA topoisomerase I, bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0527527 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.116125 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTCG TCATTGTCGA GTCGCCTGCG AAGGCCAAGA CGATCAACAA ATATCTGGGC TCGTCCTACG AGGTTCTGGC CTCGTTCGGG CATGTGCGCG ACCTGCCGGC CAAGAATGGC TCGGTCGATC CCGACGCCAA TTTCCAGATG ATCTGGGAGA TCGATCCGAA AGCCGCCGGA CGGCTCAACG ACATCGCCAA GGCGCTCAAG GGCGCCGACA AGCTGATCCT CGCCACCGAC CCTGATCGCG AGGGCGAGGC GATCTCCTGG CACGTGCTGG AGGTGCTCAA ACAAAAGCGT GCGCTGAAGG ACCAGAAGGT CGAGCGCGTG GTGTTCAACG CCATCACCAA GCAGTCCGTC ACCGAGGCGA TGCAGCATCC GCGCGAGATC GACGGCGCGC TGGTCGACGC CTATATGGCG CGCCGGGCGC TGGATTATCT GGTCGGCTTC ACTCTTTCTC CTGTGCTGTG GCGCAAGCTG CCGGGCGCCC GCTCCGCCGG GCGCGTGCAG TCGGTCGCGC TGCGGCTGGT GTGCGACCGC GAGATGGAGA TCGAGAAGTT CGTCGCGCGC GAATACTGGT CGCTGATCGC GACGCTGACG ACGCCGCGCG GCGACGCCTT CGAGGCCCGC CTGGTCGGCG CCGACGGCAA GAAGATCCAG CGGCTCGACA TCGGCACCGG CGCCGAGGCC GAGGACTTCA AGCAGGCGAT CGAGACCGCC AATTTCAACG TGTCGAGCGT CGAGGCCAAG CCCGCCCGCC GCAATCCCTA CGCCCCGTTC ACCACCTCGA CGCTGCAGCA GGAAGCCAGC CGCAAGCTCG GCTTTGCGCC GGCGCACACC ATGCGGATCG CGCAGCGGCT GTATGAAGGC ATCGACATCG GCGGCGAAAC CACCGGTCTC ATTACTTATA TGCGAACCGA CGGCGTCCAG ATCGACCCGT CCGCCATTAC GCAAGCGCGC AAGGTGATCG CCGAGGATTA CGGCAGCGCC TATGTGCCGG ACTCGCCACG GCAATATCAG GCCAAGGCCA AGAACGCCCA GGAAGCGCAC GAAGCAATCC GCCCGACCGA CCTGTCGCGC CGCCCGTCCG AGGTCAACAA GCGGCTCGAT TCCGACCAGG CCCGGCTCTA CGAGCTGATC TGGGTCCGCA CCGTCGCCAG CCAGATGGAA TCGGCCGAGA TGGAGCGCAC CACCGTCGAC ATCGAGGCGA AGGCCGGATC GCGCGTGCTG GAGCTGCGCG CCACCGGCCA GGTTGTGAAG TTCGACGGCT TCCTCGCCGC CTATCAGGAA GGCCGCGACG ACGATTCCGA AGACGAGGAT TCGCGACGGC TGCCGGCGAT GAGCGAGAAC GAGGCGCTGA AGCGCGAAGC GCTCGCGGTG ACGCAGCATT TCACCGAACC GCCGCCGCGC TTCTCGGAAG CCTCATTGGT GAAGCGGATG GAAGAGCTCG GCATCGGCCG GCCCTCGACC TATGCGTCGA TCCTGCAGGT GCTGAAGGAT CGCGGCTATG TGAAGCTCGA AAAGAAGCGG CTGCACGGCG AGGACAAGGG CCGCGTCGTG ATCGCGTTCC TGGAGAGCTT CTTCGCCCGC TATGTCGAAT ACGACTTCAC CGCGGCGCTG GAAGAGAAGC TCGACCGCAT CTCCAACAAC GAAATCTCCT GGCAGCAGGT GCTGCGCGAT TTCTGGACCG ACTTCATCGG CGCGGTCAAT GACATCAAGG AACTGCGCGT CGCGCAGGTG CTCGACGTGC TCGACGAGAT GCTCGGCCCG CACATCTATG CACCCCGCGA GGACGGCGGC GATCCGCGGC AGTGCCCGAG CTGCGGCACT GGCCGGCTCA ACCTCAAGGC CGGCAAGTTC GGCGCCTTCG TCGGCTGCTC GAACTATCCG GAATGCCGCC ACACCCGCCC GCTTGCTGCA GATGGCGGCG GCGCCGATGC CGATCGCGTG CTCGGCCTCG ATCCCGACAC CGGCTTCGAA GTCGCGGTCA AATCCGGCCG GTTCGGCCCC TATATCCAGC TCGGCGACGC CAAGGACTAC GCGGAGGGCG AAAAGCCCAA GCGCGCCGGC ATCCCGAAGG GCACCTCGCC GTCCGACGTC GAGCTCGACG TCGCGCTGCG GCTCCTGGCG CTGCCGCGTG AAGTCGGCAA GCACCCCGAG ACCGGCGAGC CGATCAAGGC CGGCATCGGC CGGTTCGGGC CCTATGTGCA GCACGAGAAG ACCTACGCCA GCCTCGAGGC CGGCGATGAC GTCCACAACA TTGGGCTCAA TCGCGCGGTC ACGCTGATCG CCGAGAAGAT CGCCAAGGGT CCGAGCAAGC GCCGGTTTGG CGCCGATCCC GGCAAGCCGC TCGGCGATCA TCCGTCGCTC GGCCCGGTCG CCGTCAAGGC CGGCCGCTAC GGCGCCTATG TCACCGCCGG CGGCGTCAAT GCCACGATCC CGAACGACAA GACCCAGGAC ACCATCACGC TCCCCGAAGC GATCGCGCTG ATCGACGAGC GCGCCGCCAA GGGCGGTGGG GCTAAGGCCA AGAAGAAGGC GCCGGCCAAG AAAGCCGCAG CCAAGAGCGA CGCCAAGCCG GCGAAGAAAG CCGCGGCCAA GAAGCCGAAA GCCGAGGGCG CCGCCGCAAG CCCGGCGCGC GCGCCGGTGA AAGCCAAGAC GTCAACGACC AAGCCTAAAG CCGCGGCAGC CAAGCCGAAA TCACCCGCCA AAAAGAGCGC GGCCAAGAAC GGATAG
|
Protein sequence | MNLVIVESPA KAKTINKYLG SSYEVLASFG HVRDLPAKNG SVDPDANFQM IWEIDPKAAG RLNDIAKALK GADKLILATD PDREGEAISW HVLEVLKQKR ALKDQKVERV VFNAITKQSV TEAMQHPREI DGALVDAYMA RRALDYLVGF TLSPVLWRKL PGARSAGRVQ SVALRLVCDR EMEIEKFVAR EYWSLIATLT TPRGDAFEAR LVGADGKKIQ RLDIGTGAEA EDFKQAIETA NFNVSSVEAK PARRNPYAPF TTSTLQQEAS RKLGFAPAHT MRIAQRLYEG IDIGGETTGL ITYMRTDGVQ IDPSAITQAR KVIAEDYGSA YVPDSPRQYQ AKAKNAQEAH EAIRPTDLSR RPSEVNKRLD SDQARLYELI WVRTVASQME SAEMERTTVD IEAKAGSRVL ELRATGQVVK FDGFLAAYQE GRDDDSEDED SRRLPAMSEN EALKREALAV TQHFTEPPPR FSEASLVKRM EELGIGRPST YASILQVLKD RGYVKLEKKR LHGEDKGRVV IAFLESFFAR YVEYDFTAAL EEKLDRISNN EISWQQVLRD FWTDFIGAVN DIKELRVAQV LDVLDEMLGP HIYAPREDGG DPRQCPSCGT GRLNLKAGKF GAFVGCSNYP ECRHTRPLAA DGGGADADRV LGLDPDTGFE VAVKSGRFGP YIQLGDAKDY AEGEKPKRAG IPKGTSPSDV ELDVALRLLA LPREVGKHPE TGEPIKAGIG RFGPYVQHEK TYASLEAGDD VHNIGLNRAV TLIAEKIAKG PSKRRFGADP GKPLGDHPSL GPVAVKAGRY GAYVTAGGVN ATIPNDKTQD TITLPEAIAL IDERAAKGGG AKAKKKAPAK KAAAKSDAKP AKKAAAKKPK AEGAAASPAR APVKAKTSTT KPKAAAAKPK SPAKKSAAKN G
|
| |