Gene RPB_2620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2620 
Symbol 
ID3910412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3001661 
End bp3003922 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content64% 
IMG OID637884519 
ProductDNA topoisomerase IV subunit A 
Protein accessionYP_486233 
Protein GI86749737 
COG category[L] Replication, recombination and repair 
COG ID[COG0188] Type IIA topoisomerase (DNA gyrase/topo II, topoisomerase IV), A subunit 
TIGRFAM ID[TIGR01062] DNA topoisomerase IV, A subunit, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.180269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAAA GACTGATTCC GCCGGAGCCG GCCGAGATTC ACGAAGTGCA GCTTCGTGAA 
GCGCTGGAAG AGCGTTACCT CGCTTATGCG CTCTCGACCA TCATGCATCG CGCCTTGCCT
GACGCTCGCG ACGGGCTGAA GCCGGTGCAT CGGCGTATTC TTTATGGCAT GCGGCTGTTG
CGGCTCGACC CCGGCACGCC GTTCAAGAAG TCGGCCAAGA TCGTCGGCGA CGTGATGGGC
TCGTTCCATC CGCACGGCGA CCAGTCGATC TACGACGCGC TGGTGCGCCT CGCGCAGGAC
TTCTCCTCGC GCTATCCGCT GGTCGACGGC CAGGGAAACT TCGGCAATAT CGACGGCGAT
AATCCGGCCG CCTATCGCTA CACCGAAGCG CGCATGACCG ATGTCGCGCG GCTCTTGCTC
GATGGTATCG ACGAGGACGG CGTCGCGTTT CGGCCCAACT ACGACGGCCA GGCCAAAGAG
CCGGTGGTGC TGCCCGGCGG CTTTCCGAAC CTGCTTGCCA ACGGCGCGCA GGGCATCGCG
GTCGGCATGG CCACCGCGAT CCCGCCGCAC AACGCCGCCG AACTCTGCGA CGCCGCGCTG
CATCTGATCG ACAAGCCGGA CGCCAAGACG AAGGCGCTGC TGCGCTTCGT CAAGGGCCCG
GACTTCCCGA CTGGCGGCAT CGTCATCGAT TCCAAGGAGA GCATCGCCGA GGCCTATACG
ACGGGCCGCG GCGCGTTCCG CACCCGCGCC AAATGGATGC AGGAGGAGGG CGCACGCGGC
ACCTGGGTCG TGGTCGTCAC CGAAATTCCG TGGCTGGTGC AGAAGTCCCG GCTGATCGAG
AAGATCGCCG AACTGTTGAA CGAGAAGAAG CTGCCGCTGG TCGGCGACAT CCGCGACGAA
TCCGCCGAAG ACGTGCGCGT CGTGATCGAG CCGAAGTCGC GCGCCGTCGA TCCGGCGCTG
ATGATGGAAT CGCTGTTCCG GCTGACCGAG CTGGAAAGCC GGATCCCGCT CAATCTCAAC
GTGCTGGTGA AGGGTCGCAT CCCCAAGGTG CTCGGCCTCG CCGAATGTCT GCGCGAATGG
CTCGACCATC TGCGCGACGT GCTGCTGCGG CGCTCGAACT ACCGCAAGGC GCAGATCGAG
CATCGGCTCG AAGTGCTCGG CGGCTATCTG ATCGCGTATC TAAACATCGA CAAGGTGATC
AAGATCATCC GCACCGAGGA CGAGCCGAAA CCCGTCCTGA TGAAGGCCTT CAACCTCACC
GATGTGCAGG CCGAAGCGAT CCTCAACATG CGGCTGCGCA GCTTGCGCAA GCTCGAGGAA
TTCGAGATCC GCACCGAGGA CAAGAATCTG CGCGCCGAGC TGAAGGGCAT CAACGCGATT
CTGAAATCCG AGACCGAGCA GTGGGCCAAG GTCGGCGAGC AGGTGCGCAA GGTGCGCGAG
ATGTTCGGGC CGAAGACACC GCTCGGCAAG CGCCGCACCC AATTCGCCGA CGCGCCCGAG
CACGATCTCG CCGCGATCGA GGAAGCCTTC GTCGAGCGCG AGCCGGTCAC CGTGGTGATC
TCCGACAAGG GCTGGGTGCG CACCCTGAAG GGCCACGTCG CCGATCTGTC CGGTCTGAAC
TTCAAGCAGG ACGACAAGCT CGATAGGGCG TTCTTCGCCG AGACCACGTC GAAGCTGCTG
CTGCTCGCCA CCAATGGGCG GTTCTACTCG CTCGAGGTCG CCAAGCTGCC CGGCGGCCGC
GGCCATGGCG AGCCGATCCG CATGTTCATC GACATGGAGC AGGACGCCGC CATCGTCGCG
ATGTTCGTGC ACAAGGGCGG TCGCAAATTC CTGATCGCCA GTCATGACGG CCAGGGATTC
GTCGTCGGCG AGGACGATTG CGTCGGCACC ACCCGCAAGG GCAAGCAGAT CATCAATGTC
GAGATGCCGA ACGAGGCGAG GGCGCTGACC GTCGTCGGCG ACGGCTCGGA CAACGTCGCG
GTGATCGGCG ACAACCGCAA GATGCTGATC TTCCCGCTCG ACCAGGTGCC GGAGATGGCG
CGCGGCCGCG GCGTGCGGCT GCAGAAATAC AAGGACGGCG GGCTGTCCGA CATCGTCACC
TTCGTGGCCA AGGAAGGTCT GAGCTGGCGC GACTCCGCCG GCCGCGAGTT CAGCGCGACG
ATGAAGGAAC TGGCCGAATG GCGCGGCAAT CGCGCCGATG CCGGCCGAAT GCCGCCGAAG
GGTTTTCCGA AGTCGAACAA ATTCGGCCGC GGCATCGAGT GA
 
Protein sequence
MGKRLIPPEP AEIHEVQLRE ALEERYLAYA LSTIMHRALP DARDGLKPVH RRILYGMRLL 
RLDPGTPFKK SAKIVGDVMG SFHPHGDQSI YDALVRLAQD FSSRYPLVDG QGNFGNIDGD
NPAAYRYTEA RMTDVARLLL DGIDEDGVAF RPNYDGQAKE PVVLPGGFPN LLANGAQGIA
VGMATAIPPH NAAELCDAAL HLIDKPDAKT KALLRFVKGP DFPTGGIVID SKESIAEAYT
TGRGAFRTRA KWMQEEGARG TWVVVVTEIP WLVQKSRLIE KIAELLNEKK LPLVGDIRDE
SAEDVRVVIE PKSRAVDPAL MMESLFRLTE LESRIPLNLN VLVKGRIPKV LGLAECLREW
LDHLRDVLLR RSNYRKAQIE HRLEVLGGYL IAYLNIDKVI KIIRTEDEPK PVLMKAFNLT
DVQAEAILNM RLRSLRKLEE FEIRTEDKNL RAELKGINAI LKSETEQWAK VGEQVRKVRE
MFGPKTPLGK RRTQFADAPE HDLAAIEEAF VEREPVTVVI SDKGWVRTLK GHVADLSGLN
FKQDDKLDRA FFAETTSKLL LLATNGRFYS LEVAKLPGGR GHGEPIRMFI DMEQDAAIVA
MFVHKGGRKF LIASHDGQGF VVGEDDCVGT TRKGKQIINV EMPNEARALT VVGDGSDNVA
VIGDNRKMLI FPLDQVPEMA RGRGVRLQKY KDGGLSDIVT FVAKEGLSWR DSAGREFSAT
MKELAEWRGN RADAGRMPPK GFPKSNKFGR GIE