Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2620 |
Symbol | |
ID | 3910412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3001661 |
End bp | 3003922 |
Gene Length | 2262 bp |
Protein Length | 753 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637884519 |
Product | DNA topoisomerase IV subunit A |
Protein accession | YP_486233 |
Protein GI | 86749737 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit |
TIGRFAM ID | [TIGR01062] DNA topoisomerase IV, A subunit, proteobacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.180269 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAAAA GACTGATTCC GCCGGAGCCG GCCGAGATTC ACGAAGTGCA GCTTCGTGAA GCGCTGGAAG AGCGTTACCT CGCTTATGCG CTCTCGACCA TCATGCATCG CGCCTTGCCT GACGCTCGCG ACGGGCTGAA GCCGGTGCAT CGGCGTATTC TTTATGGCAT GCGGCTGTTG CGGCTCGACC CCGGCACGCC GTTCAAGAAG TCGGCCAAGA TCGTCGGCGA CGTGATGGGC TCGTTCCATC CGCACGGCGA CCAGTCGATC TACGACGCGC TGGTGCGCCT CGCGCAGGAC TTCTCCTCGC GCTATCCGCT GGTCGACGGC CAGGGAAACT TCGGCAATAT CGACGGCGAT AATCCGGCCG CCTATCGCTA CACCGAAGCG CGCATGACCG ATGTCGCGCG GCTCTTGCTC GATGGTATCG ACGAGGACGG CGTCGCGTTT CGGCCCAACT ACGACGGCCA GGCCAAAGAG CCGGTGGTGC TGCCCGGCGG CTTTCCGAAC CTGCTTGCCA ACGGCGCGCA GGGCATCGCG GTCGGCATGG CCACCGCGAT CCCGCCGCAC AACGCCGCCG AACTCTGCGA CGCCGCGCTG CATCTGATCG ACAAGCCGGA CGCCAAGACG AAGGCGCTGC TGCGCTTCGT CAAGGGCCCG GACTTCCCGA CTGGCGGCAT CGTCATCGAT TCCAAGGAGA GCATCGCCGA GGCCTATACG ACGGGCCGCG GCGCGTTCCG CACCCGCGCC AAATGGATGC AGGAGGAGGG CGCACGCGGC ACCTGGGTCG TGGTCGTCAC CGAAATTCCG TGGCTGGTGC AGAAGTCCCG GCTGATCGAG AAGATCGCCG AACTGTTGAA CGAGAAGAAG CTGCCGCTGG TCGGCGACAT CCGCGACGAA TCCGCCGAAG ACGTGCGCGT CGTGATCGAG CCGAAGTCGC GCGCCGTCGA TCCGGCGCTG ATGATGGAAT CGCTGTTCCG GCTGACCGAG CTGGAAAGCC GGATCCCGCT CAATCTCAAC GTGCTGGTGA AGGGTCGCAT CCCCAAGGTG CTCGGCCTCG CCGAATGTCT GCGCGAATGG CTCGACCATC TGCGCGACGT GCTGCTGCGG CGCTCGAACT ACCGCAAGGC GCAGATCGAG CATCGGCTCG AAGTGCTCGG CGGCTATCTG ATCGCGTATC TAAACATCGA CAAGGTGATC AAGATCATCC GCACCGAGGA CGAGCCGAAA CCCGTCCTGA TGAAGGCCTT CAACCTCACC GATGTGCAGG CCGAAGCGAT CCTCAACATG CGGCTGCGCA GCTTGCGCAA GCTCGAGGAA TTCGAGATCC GCACCGAGGA CAAGAATCTG CGCGCCGAGC TGAAGGGCAT CAACGCGATT CTGAAATCCG AGACCGAGCA GTGGGCCAAG GTCGGCGAGC AGGTGCGCAA GGTGCGCGAG ATGTTCGGGC CGAAGACACC GCTCGGCAAG CGCCGCACCC AATTCGCCGA CGCGCCCGAG CACGATCTCG CCGCGATCGA GGAAGCCTTC GTCGAGCGCG AGCCGGTCAC CGTGGTGATC TCCGACAAGG GCTGGGTGCG CACCCTGAAG GGCCACGTCG CCGATCTGTC CGGTCTGAAC TTCAAGCAGG ACGACAAGCT CGATAGGGCG TTCTTCGCCG AGACCACGTC GAAGCTGCTG CTGCTCGCCA CCAATGGGCG GTTCTACTCG CTCGAGGTCG CCAAGCTGCC CGGCGGCCGC GGCCATGGCG AGCCGATCCG CATGTTCATC GACATGGAGC AGGACGCCGC CATCGTCGCG ATGTTCGTGC ACAAGGGCGG TCGCAAATTC CTGATCGCCA GTCATGACGG CCAGGGATTC GTCGTCGGCG AGGACGATTG CGTCGGCACC ACCCGCAAGG GCAAGCAGAT CATCAATGTC GAGATGCCGA ACGAGGCGAG GGCGCTGACC GTCGTCGGCG ACGGCTCGGA CAACGTCGCG GTGATCGGCG ACAACCGCAA GATGCTGATC TTCCCGCTCG ACCAGGTGCC GGAGATGGCG CGCGGCCGCG GCGTGCGGCT GCAGAAATAC AAGGACGGCG GGCTGTCCGA CATCGTCACC TTCGTGGCCA AGGAAGGTCT GAGCTGGCGC GACTCCGCCG GCCGCGAGTT CAGCGCGACG ATGAAGGAAC TGGCCGAATG GCGCGGCAAT CGCGCCGATG CCGGCCGAAT GCCGCCGAAG GGTTTTCCGA AGTCGAACAA ATTCGGCCGC GGCATCGAGT GA
|
Protein sequence | MGKRLIPPEP AEIHEVQLRE ALEERYLAYA LSTIMHRALP DARDGLKPVH RRILYGMRLL RLDPGTPFKK SAKIVGDVMG SFHPHGDQSI YDALVRLAQD FSSRYPLVDG QGNFGNIDGD NPAAYRYTEA RMTDVARLLL DGIDEDGVAF RPNYDGQAKE PVVLPGGFPN LLANGAQGIA VGMATAIPPH NAAELCDAAL HLIDKPDAKT KALLRFVKGP DFPTGGIVID SKESIAEAYT TGRGAFRTRA KWMQEEGARG TWVVVVTEIP WLVQKSRLIE KIAELLNEKK LPLVGDIRDE SAEDVRVVIE PKSRAVDPAL MMESLFRLTE LESRIPLNLN VLVKGRIPKV LGLAECLREW LDHLRDVLLR RSNYRKAQIE HRLEVLGGYL IAYLNIDKVI KIIRTEDEPK PVLMKAFNLT DVQAEAILNM RLRSLRKLEE FEIRTEDKNL RAELKGINAI LKSETEQWAK VGEQVRKVRE MFGPKTPLGK RRTQFADAPE HDLAAIEEAF VEREPVTVVI SDKGWVRTLK GHVADLSGLN FKQDDKLDRA FFAETTSKLL LLATNGRFYS LEVAKLPGGR GHGEPIRMFI DMEQDAAIVA MFVHKGGRKF LIASHDGQGF VVGEDDCVGT TRKGKQIINV EMPNEARALT VVGDGSDNVA VIGDNRKMLI FPLDQVPEMA RGRGVRLQKY KDGGLSDIVT FVAKEGLSWR DSAGREFSAT MKELAEWRGN RADAGRMPPK GFPKSNKFGR GIE
|
| |