Gene RPD_3033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3033 
Symbol 
ID4023536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3378832 
End bp3381576 
Gene Length2745 bp 
Protein Length914 aa 
Translation table11 
GC content66% 
IMG OID637963232 
ProductDNA topoisomerase I 
Protein accessionYP_570160 
Protein GI91977501 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.298843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.412116 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTCG TCATTGTCGA GTCGCCTGCG AAGGCCAAGA CGATCAACAA ATATCTCGGC 
TCCTCCTACG AGGTTCTGGC CTCGTTCGGG CATGTCCGCG ATCTGCCGGC CAAGAACGGG
TCGGTCGATC CAGACGCGAA TTTCCAGATG ATTTGGGAGA TCGATCCCAA AGCTGCCGGC
CGGCTCAACG ACATCGCCAA GGCCCTCAAA GGCGCCGACA AGCTGATCCT CGCCACCGAC
CCTGATCGCG AGGGTGAGGC GATCTCCTGG CACGTGCTGG AAGTGTTGAA GCAGAAGCGC
GCGCTGAAAG ACCAGAAGGT CGAGCGCGTG GTGTTCAACG CCATCACCAA GCAGTCGGTC
ACCGACGCCA TGAAGCACCC GCGCGAGATC GACGGCGCGC TGGTCGACGC CTATATGGCG
CGCCGCGCGC TGGATTATCT GGTCGGCTTC ACGCTCTCCC CGGTGCTGTG GCGCAAGCTG
CCCGGCGCGC GTTCCGCCGG GCGGGTGCAA TCGGTGGCGC TGCGGCTTGT GTGCGACCGC
GAGATGGAGA TCGAGAAGTT CGTTCCGCGC GAATACTGGT CGCTGATCGC GACCCTGACG
ACGCCGCGCG GCGACAGCTT CGAGGCCCGC CTGGTCGGCG CCGACGGCAA GAAGATCCAG
CGGCTCGACA TTGGTACCGG CGTCGAGGCC GAGGATTTCA AGCAGGCGAT CGAGCAGGCC
AACTTCAAGG TGTCGAGCGT CGAGGCCAAG CCGGCCCGCC GCAACCCCTA CGCCCCCTTC
ACCACCTCGA CGCTGCAGCA GGAAGCCAGC CGCAAGCTCG GCTTCGCGCC GGCGCACACG
ATGCGGATCG CGCAACGGCT GTATGAAGGC ATCGACATCG GCGGCGAGAC CACCGGTCTC
ATTACTTATA TGCGTACCGA CGGCGTCCAG ATCGACCCCT CCGCCATCAC CGAGGCGCGC
AAGGTGATCG CCGAGGATTA CGGCAGCGCC TACGTTCCGG ATACGCCGCG GCAATATCAG
GCCAAGGCGA AAAACGCCCA GGAAGCGCAT GAGGCGATCC GCCCGACCGA CATGTCGCGC
CGCCCGGCCG ACGTCAACGG CAGGCTCGAT TCCGATCAGG CCCGACTCTA CGAACTGATC
TGGGTCCGCA CCGTCGCGAG CCAGATGGAA TCGGCCGAGA TGGAGCGCAC CACCGTCGAC
ATCGAGGCCA AAGCCGGGTC GCGGGTGCTG GAGCTGCGCG CCACCGGTCA GGTGGTCAAG
TTCGACGGCT TTCTCGCCGC CTATCAGGAA GGCCGCGACG ACGACAGCGA GGACGAGGAT
TCGCGCCGGC TGCCGGCGAT GAGCGAAGAC GAGGCGCTGA AGCGCGACGC GCTCGCCGTC
ACCCAGCATT TCACCGAACC GCCGCCGCGC TTTTCGGAAG CCTCGCTGGT GAAGCGGATG
GAAGAGCTCG GCATCGGCCG ACCCTCGACT TACGCCTCGA TCCTGCAGGT GCTGAAGGAT
CGCGGCTACG TGAAGCTGGA GAAGAAGCGG CTGCACGGCG AGGACAAAGG TCGCGTCGTG
ATCGCGTTCC TGGAGAGCTT TTTCGCGCGC TACGTCGAAT ACGACTTCAC CGCGGCGCTG
GAAGAGAAAC TCGACCGCAT CTCCAACAAT GAAATCTCCT GGCAGCAGGT GCTGCGCGAT
TTCTGGACCG ATTTCATCGG CGCGGTCGAC GACATTAAAG AGCTGCGGGT CGCGCAGGTG
CTCGACGTGC TCGACGAGAT GCTCGGTCCG CATATCTATG CGCCGCGCGA GGATGGCGGC
GATCCGCGGC AGTGCCCGAG CTGCGGCACC GGCCGGCTCA ACCTCAAGGC CGGCAAGTTC
GGCGCCTTTG TCGGCTGCTC GAACTATCCG GAATGCCGCC ACACCCGCCC GCTCGCCGCC
GACGGCGGCG GCGGCGATGC CGACCGCGTG CTCGGCATCG ATCCCGACAC CGGCTTCGAA
GTGGCCGTCA AATCCGGTCG GTTCGGTCCT TACATCCAGC TTGGCGAAGC CAAGGACTAC
GCCGAAGGCG AGAAGCCGAA GCGCGCAGGC ATTCCGAAGG GCACCTCGCC GTCCGACGTG
GAGCTCGAGA TGGCGCTGCG GCTGCTCGCG CTGCCGCGCG AAGTCGGCAA GCATCCGGAG
ACCGGCGAGC CGATCAAGGC CGGCATCGGC CGTTTCGGGC CCTATGTGCA GCACGAGAAG
ACCTATGCCA GCCTCGAGGC CGGCGACGAT GTCCACAACA TCGGGCTGAA CCGTGCGGTG
ACCCTGATCG CCGAGAAGAT CGCCAAGGGC CCGAGTAAGC GGCGCTTCGG CGCTGACCCC
GGCAAGCCGC TCGGCGATCA CCCGACCCTC GGCGGCGTCG CCGTGAAAGC CGGCCGCTAC
GGCGCCTATG TCACCGCCGG CGGCGTCAAC GCCACGATCC CGAACGACAA GACCCAGGAC
ACCATCACGC TCGCCGAGGC CATCGCGCTG ATCGACGAGC GCGCCGCCAA GGGCGGCGGC
GGCAAGGCCA AGAAGAAGGC TCCGGCGAAG AAGGCCGCAG CCTCCGGCGA GGCCAAGCCG
AAGAAAGCCG CGGCCAAGAA GACCAAGCCG AAAGCCGAAA CCGCCGCCGC CAGCAAAGCG
CGCGCGCCGG TGACGGCCAA GACGTCCGTG GCCAAGGCCT CCACCGCCAA AGCAACAGCC
AAGCCCAAAT CACCCGCCAA AAAGAGCGCG GCCAAGAACG GATAG
 
Protein sequence
MNLVIVESPA KAKTINKYLG SSYEVLASFG HVRDLPAKNG SVDPDANFQM IWEIDPKAAG 
RLNDIAKALK GADKLILATD PDREGEAISW HVLEVLKQKR ALKDQKVERV VFNAITKQSV
TDAMKHPREI DGALVDAYMA RRALDYLVGF TLSPVLWRKL PGARSAGRVQ SVALRLVCDR
EMEIEKFVPR EYWSLIATLT TPRGDSFEAR LVGADGKKIQ RLDIGTGVEA EDFKQAIEQA
NFKVSSVEAK PARRNPYAPF TTSTLQQEAS RKLGFAPAHT MRIAQRLYEG IDIGGETTGL
ITYMRTDGVQ IDPSAITEAR KVIAEDYGSA YVPDTPRQYQ AKAKNAQEAH EAIRPTDMSR
RPADVNGRLD SDQARLYELI WVRTVASQME SAEMERTTVD IEAKAGSRVL ELRATGQVVK
FDGFLAAYQE GRDDDSEDED SRRLPAMSED EALKRDALAV TQHFTEPPPR FSEASLVKRM
EELGIGRPST YASILQVLKD RGYVKLEKKR LHGEDKGRVV IAFLESFFAR YVEYDFTAAL
EEKLDRISNN EISWQQVLRD FWTDFIGAVD DIKELRVAQV LDVLDEMLGP HIYAPREDGG
DPRQCPSCGT GRLNLKAGKF GAFVGCSNYP ECRHTRPLAA DGGGGDADRV LGIDPDTGFE
VAVKSGRFGP YIQLGEAKDY AEGEKPKRAG IPKGTSPSDV ELEMALRLLA LPREVGKHPE
TGEPIKAGIG RFGPYVQHEK TYASLEAGDD VHNIGLNRAV TLIAEKIAKG PSKRRFGADP
GKPLGDHPTL GGVAVKAGRY GAYVTAGGVN ATIPNDKTQD TITLAEAIAL IDERAAKGGG
GKAKKKAPAK KAAASGEAKP KKAAAKKTKP KAETAAASKA RAPVTAKTSV AKASTAKATA
KPKSPAKKSA AKNG