Gene RPC_2239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2239 
Symbol 
ID3973256 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2443522 
End bp2446290 
Gene Length2769 bp 
Protein Length922 aa 
Translation table11 
GC content65% 
IMG OID637925347 
ProductDNA topoisomerase I 
Protein accessionYP_532112 
Protein GI90423742 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0515029 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0535493 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATCG TCATTGTCGA GTCGCCTGCG AAGGCCAAGA CGATCAACAA ATATCTGGGC 
AGTTCCTACG AGGTTCTGGC CTCGTTCGGC CATGTCCGCG ACCTTCCGGC GAAGAACGGC
TCGGTCGATC CCGACGCCAA TTTCCAGATG ATCTGGGAAA TCGATCCCAA GGCCGCCGGC
CGGCTCAACG ACATCGCCAA GGCGCTGAAG GGCGCCGACA AGCTGATCCT CGCAACCGAC
CCTGATCGCG AGGGGGAAGC GATCTCCTGG CACGTGCTGG AGGTGTTGAA AGAGAAGCGC
GCGATCAAGG ATCACAAGAT CGAACGCGTG GTGTTCAACG CCATCACCAA GCAGGCGGTC
ACCGATGCGA TGAAGAACCC GCGCCAGATC GACGGCGCGC TGGTCGACGC CTATATGGCG
CGCCGCGCGC TGGACTATCT GGTCGGCTTT ACACTCTCTC CCGTATTGTG GCGCAAACTG
CCCGGCGCCC GCTCCGCCGG CCGCGTCCAA TCGGTGGCGC TGCGGCTGGT CTGCGACCGC
GAGCTCGAGA TCGAGAAATT CGTGCCGCGG GAATACTGGT CGCTGGTGGC GACCCTGACC
ACGCCGCGCG GCGAGATGTT CGAGGCCCGG CTCACCGGCG CCGATGGCAA GAAGATCCAG
CGGCTCGACA TCGGCACCGG CGCCGAGGCC GAGGATTTCA AGCAGGCGAT CGAAGCGGCG
CTGTTCAATG TCGCCAGCGT CGAAGCCAAG CCGGCGCGGC GCAACCCCTA CGCGCCGTTC
ACCACCTCGA CGCTGCAGCA GGAGGCCAGC CGCAAGCTCG GCTTCGCCCC GGCGCACACC
ATGCGGATCG CGCAGCGGTT GTATGAAGGC ATCGACATCG GCGGCGAGAC CACCGGTCTC
ATTACTTATA TGCGTACCGA CGGCGTGCAG ATTGATTCCT CCGCCATCAC CCAGGCGCGC
CAGGTGATCG GCGAGGACTA CGGCAAGCAA TACGTTCCGG AGGCGCCGCG GCAATACACC
GCCAAGGCCA AGAACGCCCA GGAAGCCCAT GAAGCGATCC GGCCGACCGA CCTCAGCCGC
CGCCCCGCCA GCTTGCGCGC CCGGCTCGAC CACGATCAGA TCCGGCTCTA CGAGCTGATC
TGGATCCGCA CCATCGCCAG CCAGATGGAA TCCGCCGAAT TGGAGCGCAC CACCGTCGAG
ATCGCCGCCA AGGCGGGCTC GCGGGTGCTG GAACTGCGCG CCACCGGCCA GGTGGTGAAG
TTCGACGGCT TCCTGGCGGT GTATCAGGAA GGCCGCGACG ACGACGGTGA CGACGAGGAT
TCCCGCCGAC TGCCGGCAAT GAGCCAAGGC GAAGCCTTGG CTCGCAAGGA CCTCGCCGTC
ACCCAGCATT TCACCGAGCC GCCGCCGCGC TTCTCCGAAG CCTCGCTGGT CAAGCGGATG
GAAGAGCTCG GCATCGGCCG GCCCTCGACC TACGCCTCGA TCCTGCAGGT GTTGAAGGAC
CGCGGCTACG TCAAGCTCGA CAAGAAGCGG CTGCACGCCG AGGACAAGGG CCGCGTCGTG
GTCGCGTTCC TGGAGAACTT CTTCGCCCGC TACGTCGAAT ACGACTTCAC CGCGGCGCTG
GAGGAAAACC TCGACCGGAT TTCCAACAAC GAAATCTCCT GGCAACAGGT GCTGCGCGAT
TTCTGGACCG ACTTCATCGG CGCGGTCAAC GACATCAAGG ATCTGCGCGT CGCGCAGGTG
CTGGACGCGC TCGACGACAT GCTCGGCTCG CACATCTATG CGCCACGCGA CGACGGCGGC
GATCCGCGGC AATGCCCGAG CTGCGGCACC GGCAAGCTCA ACCTCAAGGC CGGCAAGTTC
GGCGCCTTCG TCGGCTGCAG CAACTATCCG GAATGCCGCT ACACCCGCCC GTTGGCGGCT
GACGGCGGCG GCGACGGCGA CCGCATTCTC GGCAAGGACC CGGTGTCCGG CCTCGAAGTC
GCGGTCAAGG CCGGCCGGTT CGGTCCCTAT ATCCAGCTCG GTGACGCCAA GGACTACGCC
GAGGGCGAGA AGCCGAAACG CGCCGGCATT CCGAAAAACT CCTCGCCCGG CGACATGGAG
CTCGAGCTCG CGCTGAAGCT GTTGTCGCTG CCGCGCGAAG TCGGCAAACA TCCGGAGACC
GGCGAGCCGA TCAAGGCCGG CATCGGCCGC TTCGGTCCCT ATGTGCAGCA TGAGAAGACC
TATGCCAGCC TGGAAGCTGG CGACGAGGTG TTCGACATCG GGCTGAACCG CGCGGTGACG
CTGATCGCCG AGAAGATCCT CAAAGGCCCG AGCAAGCGAC GGTTTGGTTC GGACCCCGGC
AAACCGCTCG GCGAGCATCC CTCGCTCGGC ACCGTGGCGG TGAAGAGCGG ACGTTACGGC
GCTTACGTCA CCGCCGGCGG CGTCAACGCC ACGATTCCGA GCGACAAAAC TCAAGAGAGC
ATCACCCTGC CCGAGGCCAT CGCGCTGATC GACGAGCGCG CGGCGAAGGG CGGCGGCAAG
CCGAAGAAAG CCGCGAAGAA AGCGCCGGCC AAGAAGGCCG CGAAGTCCGA TACCGACGCA
GCGGCAGAGA CGAAAAAACC CGCGAAGAAA GCCGCGGCGA AGAAGTCGGT CGCCAAGCCG
AAGGCCGACG GCGTCGCCGT AAGTGCCGCG CGCGCGCCGG CGAAAGCCAA ATCCTCGACC
AAGACCGCCG CAGCCAAGAA GCCTGCAAAG CCGGCGGCGA AAAAATCCGC GGGCAAAGCC
AACGGCTGA
 
Protein sequence
MNIVIVESPA KAKTINKYLG SSYEVLASFG HVRDLPAKNG SVDPDANFQM IWEIDPKAAG 
RLNDIAKALK GADKLILATD PDREGEAISW HVLEVLKEKR AIKDHKIERV VFNAITKQAV
TDAMKNPRQI DGALVDAYMA RRALDYLVGF TLSPVLWRKL PGARSAGRVQ SVALRLVCDR
ELEIEKFVPR EYWSLVATLT TPRGEMFEAR LTGADGKKIQ RLDIGTGAEA EDFKQAIEAA
LFNVASVEAK PARRNPYAPF TTSTLQQEAS RKLGFAPAHT MRIAQRLYEG IDIGGETTGL
ITYMRTDGVQ IDSSAITQAR QVIGEDYGKQ YVPEAPRQYT AKAKNAQEAH EAIRPTDLSR
RPASLRARLD HDQIRLYELI WIRTIASQME SAELERTTVE IAAKAGSRVL ELRATGQVVK
FDGFLAVYQE GRDDDGDDED SRRLPAMSQG EALARKDLAV TQHFTEPPPR FSEASLVKRM
EELGIGRPST YASILQVLKD RGYVKLDKKR LHAEDKGRVV VAFLENFFAR YVEYDFTAAL
EENLDRISNN EISWQQVLRD FWTDFIGAVN DIKDLRVAQV LDALDDMLGS HIYAPRDDGG
DPRQCPSCGT GKLNLKAGKF GAFVGCSNYP ECRYTRPLAA DGGGDGDRIL GKDPVSGLEV
AVKAGRFGPY IQLGDAKDYA EGEKPKRAGI PKNSSPGDME LELALKLLSL PREVGKHPET
GEPIKAGIGR FGPYVQHEKT YASLEAGDEV FDIGLNRAVT LIAEKILKGP SKRRFGSDPG
KPLGEHPSLG TVAVKSGRYG AYVTAGGVNA TIPSDKTQES ITLPEAIALI DERAAKGGGK
PKKAAKKAPA KKAAKSDTDA AAETKKPAKK AAAKKSVAKP KADGVAVSAA RAPAKAKSST
KTAAAKKPAK PAAKKSAGKA NG