Gene RPB_2419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2419 
Symbol 
ID3909553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2772217 
End bp2774952 
Gene Length2736 bp 
Protein Length911 aa 
Translation table11 
GC content66% 
IMG OID637884318 
ProductDNA topoisomerase I 
Protein accessionYP_486035 
Protein GI86749539 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0527527 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.116125 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTCG TCATTGTCGA GTCGCCTGCG AAGGCCAAGA CGATCAACAA ATATCTGGGC 
TCGTCCTACG AGGTTCTGGC CTCGTTCGGG CATGTGCGCG ACCTGCCGGC CAAGAATGGC
TCGGTCGATC CCGACGCCAA TTTCCAGATG ATCTGGGAGA TCGATCCGAA AGCCGCCGGA
CGGCTCAACG ACATCGCCAA GGCGCTCAAG GGCGCCGACA AGCTGATCCT CGCCACCGAC
CCTGATCGCG AGGGCGAGGC GATCTCCTGG CACGTGCTGG AGGTGCTCAA ACAAAAGCGT
GCGCTGAAGG ACCAGAAGGT CGAGCGCGTG GTGTTCAACG CCATCACCAA GCAGTCCGTC
ACCGAGGCGA TGCAGCATCC GCGCGAGATC GACGGCGCGC TGGTCGACGC CTATATGGCG
CGCCGGGCGC TGGATTATCT GGTCGGCTTC ACTCTTTCTC CTGTGCTGTG GCGCAAGCTG
CCGGGCGCCC GCTCCGCCGG GCGCGTGCAG TCGGTCGCGC TGCGGCTGGT GTGCGACCGC
GAGATGGAGA TCGAGAAGTT CGTCGCGCGC GAATACTGGT CGCTGATCGC GACGCTGACG
ACGCCGCGCG GCGACGCCTT CGAGGCCCGC CTGGTCGGCG CCGACGGCAA GAAGATCCAG
CGGCTCGACA TCGGCACCGG CGCCGAGGCC GAGGACTTCA AGCAGGCGAT CGAGACCGCC
AATTTCAACG TGTCGAGCGT CGAGGCCAAG CCCGCCCGCC GCAATCCCTA CGCCCCGTTC
ACCACCTCGA CGCTGCAGCA GGAAGCCAGC CGCAAGCTCG GCTTTGCGCC GGCGCACACC
ATGCGGATCG CGCAGCGGCT GTATGAAGGC ATCGACATCG GCGGCGAAAC CACCGGTCTC
ATTACTTATA TGCGAACCGA CGGCGTCCAG ATCGACCCGT CCGCCATTAC GCAAGCGCGC
AAGGTGATCG CCGAGGATTA CGGCAGCGCC TATGTGCCGG ACTCGCCACG GCAATATCAG
GCCAAGGCCA AGAACGCCCA GGAAGCGCAC GAAGCAATCC GCCCGACCGA CCTGTCGCGC
CGCCCGTCCG AGGTCAACAA GCGGCTCGAT TCCGACCAGG CCCGGCTCTA CGAGCTGATC
TGGGTCCGCA CCGTCGCCAG CCAGATGGAA TCGGCCGAGA TGGAGCGCAC CACCGTCGAC
ATCGAGGCGA AGGCCGGATC GCGCGTGCTG GAGCTGCGCG CCACCGGCCA GGTTGTGAAG
TTCGACGGCT TCCTCGCCGC CTATCAGGAA GGCCGCGACG ACGATTCCGA AGACGAGGAT
TCGCGACGGC TGCCGGCGAT GAGCGAGAAC GAGGCGCTGA AGCGCGAAGC GCTCGCGGTG
ACGCAGCATT TCACCGAACC GCCGCCGCGC TTCTCGGAAG CCTCATTGGT GAAGCGGATG
GAAGAGCTCG GCATCGGCCG GCCCTCGACC TATGCGTCGA TCCTGCAGGT GCTGAAGGAT
CGCGGCTATG TGAAGCTCGA AAAGAAGCGG CTGCACGGCG AGGACAAGGG CCGCGTCGTG
ATCGCGTTCC TGGAGAGCTT CTTCGCCCGC TATGTCGAAT ACGACTTCAC CGCGGCGCTG
GAAGAGAAGC TCGACCGCAT CTCCAACAAC GAAATCTCCT GGCAGCAGGT GCTGCGCGAT
TTCTGGACCG ACTTCATCGG CGCGGTCAAT GACATCAAGG AACTGCGCGT CGCGCAGGTG
CTCGACGTGC TCGACGAGAT GCTCGGCCCG CACATCTATG CACCCCGCGA GGACGGCGGC
GATCCGCGGC AGTGCCCGAG CTGCGGCACT GGCCGGCTCA ACCTCAAGGC CGGCAAGTTC
GGCGCCTTCG TCGGCTGCTC GAACTATCCG GAATGCCGCC ACACCCGCCC GCTTGCTGCA
GATGGCGGCG GCGCCGATGC CGATCGCGTG CTCGGCCTCG ATCCCGACAC CGGCTTCGAA
GTCGCGGTCA AATCCGGCCG GTTCGGCCCC TATATCCAGC TCGGCGACGC CAAGGACTAC
GCGGAGGGCG AAAAGCCCAA GCGCGCCGGC ATCCCGAAGG GCACCTCGCC GTCCGACGTC
GAGCTCGACG TCGCGCTGCG GCTCCTGGCG CTGCCGCGTG AAGTCGGCAA GCACCCCGAG
ACCGGCGAGC CGATCAAGGC CGGCATCGGC CGGTTCGGGC CCTATGTGCA GCACGAGAAG
ACCTACGCCA GCCTCGAGGC CGGCGATGAC GTCCACAACA TTGGGCTCAA TCGCGCGGTC
ACGCTGATCG CCGAGAAGAT CGCCAAGGGT CCGAGCAAGC GCCGGTTTGG CGCCGATCCC
GGCAAGCCGC TCGGCGATCA TCCGTCGCTC GGCCCGGTCG CCGTCAAGGC CGGCCGCTAC
GGCGCCTATG TCACCGCCGG CGGCGTCAAT GCCACGATCC CGAACGACAA GACCCAGGAC
ACCATCACGC TCCCCGAAGC GATCGCGCTG ATCGACGAGC GCGCCGCCAA GGGCGGTGGG
GCTAAGGCCA AGAAGAAGGC GCCGGCCAAG AAAGCCGCAG CCAAGAGCGA CGCCAAGCCG
GCGAAGAAAG CCGCGGCCAA GAAGCCGAAA GCCGAGGGCG CCGCCGCAAG CCCGGCGCGC
GCGCCGGTGA AAGCCAAGAC GTCAACGACC AAGCCTAAAG CCGCGGCAGC CAAGCCGAAA
TCACCCGCCA AAAAGAGCGC GGCCAAGAAC GGATAG
 
Protein sequence
MNLVIVESPA KAKTINKYLG SSYEVLASFG HVRDLPAKNG SVDPDANFQM IWEIDPKAAG 
RLNDIAKALK GADKLILATD PDREGEAISW HVLEVLKQKR ALKDQKVERV VFNAITKQSV
TEAMQHPREI DGALVDAYMA RRALDYLVGF TLSPVLWRKL PGARSAGRVQ SVALRLVCDR
EMEIEKFVAR EYWSLIATLT TPRGDAFEAR LVGADGKKIQ RLDIGTGAEA EDFKQAIETA
NFNVSSVEAK PARRNPYAPF TTSTLQQEAS RKLGFAPAHT MRIAQRLYEG IDIGGETTGL
ITYMRTDGVQ IDPSAITQAR KVIAEDYGSA YVPDSPRQYQ AKAKNAQEAH EAIRPTDLSR
RPSEVNKRLD SDQARLYELI WVRTVASQME SAEMERTTVD IEAKAGSRVL ELRATGQVVK
FDGFLAAYQE GRDDDSEDED SRRLPAMSEN EALKREALAV TQHFTEPPPR FSEASLVKRM
EELGIGRPST YASILQVLKD RGYVKLEKKR LHGEDKGRVV IAFLESFFAR YVEYDFTAAL
EEKLDRISNN EISWQQVLRD FWTDFIGAVN DIKELRVAQV LDVLDEMLGP HIYAPREDGG
DPRQCPSCGT GRLNLKAGKF GAFVGCSNYP ECRHTRPLAA DGGGADADRV LGLDPDTGFE
VAVKSGRFGP YIQLGDAKDY AEGEKPKRAG IPKGTSPSDV ELDVALRLLA LPREVGKHPE
TGEPIKAGIG RFGPYVQHEK TYASLEAGDD VHNIGLNRAV TLIAEKIAKG PSKRRFGADP
GKPLGDHPSL GPVAVKAGRY GAYVTAGGVN ATIPNDKTQD TITLPEAIAL IDERAAKGGG
AKAKKKAPAK KAAAKSDAKP AKKAAAKKPK AEGAAASPAR APVKAKTSTT KPKAAAAKPK
SPAKKSAAKN G