Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0203 |
Symbol | |
ID | 4020661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 229283 |
End bp | 232432 |
Gene Length | 3150 bp |
Protein Length | 1049 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637960382 |
Product | hypothetical protein |
Protein accession | YP_567344 |
Protein GI | 91974685 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3893] Inactivated superfamily I helicase |
TIGRFAM ID | [TIGR02786] double-strand break repair protein AddB, alphaproteobacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.587798 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTTT TCAGTGTTCC GCCCTCCGCG CCGTTTCTGC GCACGACGAT CAAGGCTCTG GTCGATGGCG AACTGATCGC GGGGTTCGAC GCGCGCGCAA AGCCCGAGCG CCTTGCCGAA GCCACGCTGT ATCTGCCGAC CCGCCGCGCC GGTCGCCTCG CGCGCGACAT CTTCCTCGAT GTCCTGGGCG CCGATGCGGT GTTGCTGCCG CGCATCGTCC CACTCGGCGA TGTCGACGAG GACGAACTGG CGTTCGCACA GGCCGCAACG GGCGCGGCCG ATCTCGACAT TCCGCCGGCG CTCGAAGGGC TGCCACGGCG CCTGCAGCTG GCGCAATTGA TCGCGGTCTG GGCAAAAGGG CTGCGGTCCG GCGATCCGCA GCAGTCGCCA TTGGTGGTCG GCGGCCCGGC GTCGACATTG GCGCTCGCCG ACGATCTGGC GCGGCTGATC GACGACATGG CGACGCGCGG CGTCGACTGG GCCGCGCTCG ACTCGCTGGT GCCGGATGCG TTCGATCGCT ACTGGCGACT GACGCTGGAT TTCCTCAAGA TCGCCGGCCA GTGGTGGCCG CAGCATCTGC GCGAGAGCGA CCAGATCGAA CCGGCGGCGC GGCGCGACTT GCTGATCGAA GCGGAAGCAG CGCGGCTTGC GGCGCATCGC GGCGGTCCGG TGATCGCGGC AGGCTCGACC GGCTCGATGC CGGCGACCGC GAAGCTGCTG CACGCCATCG CCCGATTGCC GAACGGCGCC GTTATCCTGC CGGGGCTGGA CACCGAACTC GACGAACAGG CGTGGCGACT GATCGGCGCC GTGCGCGACA AGCAGGGCCA ATTGATCTCG CCGCCGTCGC CGAATCATCC GCAATTCGCG ATGCACGGCC TGCTGACCCG GATGGGACTC GAGCGGCGCG AAATTGTCCG GCTCGGCGAA GCCGTGCGCC ATGGCCGCGA AGTGCTGGCC TCCGAGGCGA TGCGGCCGTC GGCGGCGACG GCGTCGTGGC ACGAGCGGCT CGCCGATCCC GAGGTCGATC GACTGATCGA ACAAGGCGTC AATGGCCTGA CGGTGATCGA GGCGCCGAAT TCCGAGATCG AGGCGCTGGC GATCGCGGTG GCGCTGCGCG AGGCGCGCGA ACGCGGCCAA TCCGCCGCAT TGGTGACGCC GGACCGCGCG CTGGCCCGGC GCGTGGTCGC CGCGCTCGGC CGCTGGAATC TGCCGGTCGA TGATTCCGGC GGCGATTCGC TGATGGAGAC CCAGGCCGGC ATCTTCGCCC GGCTCGCCGC TGAAGCGGCC CTGCATGGCT GCGAGCCGGC GACGCTGTTG GCGCTGCTGA AGCATCCGTT GCTGCGGCTC GGCCGCGCCG CGGGCGGCTG GCGGCACGCC ATCGAGACGC TGGAACTGGC GCTGCTGCGC GGAACGCGTC CGGCTGCCGG CAGCGAAGGC CTGGTCAAGG AGTTCGCGAA ATATCGCGCC GAACTGACCA GGCTGAAGCG CGGGGAACTC AGCGCGCTGC ATCCCTCGGA GCCGCGCGCG CGGCTCGGCG ACGAGAGCCT CGACGCCGCG CAGGAGCTGA TCGAGGCATT GCGCGCGGCG CTGGCGCCGT TGGAGACGGT GGGCGCAGAG CCGCTCGATC TGTGCGCCTT CGGGCGTCGA CACCGCGATG TGCTGATCGC GCTGTCGATC GATCACGACG AGATCGCGGT CGCTTTCGAA GGATCGCAGG GATCGGCCCT GCTTAAAGCG TTCGACGATC TCGCGGCGGT CGAGCCGCTG AGCGGCGTGC TGGTGCCGCC ACACGACTAC GCGGACGTGT TCGAAACCGC GTTCAGCGAC CGCATCGTGC GACGGCCCGA ACTCGCCGGC GCTGCGCTGC GCATCTACGG CCCGCTCGAA GCGCGGTTGA CGCAGCATGA TCGCGTCATT CTCGGTGGCC TGGTCGAAGG CGTCTGGCCG CCGGCGCCGC GGATCGATCC GTGGCTGTCG CGGCCGATGC GCCATGACCT CGGCCTCGAC CTGCCGGAGC GGCGGATCGG CCTGTCGGCG CACGACTTCG CGCAACTGCT CGGCGCCGAT GAGGTGATCC TCACTTACGC CAACAAGGTC GGCGGCGCGC CGGCGGTGGT GTCGCGCTTC CTGCACCGGC TCGAAGCCGT GACCGGCAAG GCGCGCTGGA GCGCCGTCAA GGCGCGCGGG CAAAGCTATC TCGACTACGC GCAGGCGCTC GATCGTCCCG AACAGGTCAC GCCGATCGCC CAGCCGGCGC CGAGGCCGCC GCGCGAGGCG CGGCCGCTGA AGTTGTCGGT CACCGCGATC GAGGACTGGC TGCGTGATCC GTACACGATC TACGCCAAGT TCATTCTCGG CCTGTCGGCG ATCGATCCGG TCGACATGCC GCTGTCCGCG GCGGATCGCG GCTCGGCGAT CCACGAAGCG CTCGGCGAAT TCACCGAGCT GTTCCCCGAC GAGCTGCCCG ACGATCCGGC GCAGGTGCTT CGTGAGATCG GCGAAAAGCA CTTCGCGCCG CTGATGGCTC ATCCGGAAGC GCGCGCGCTG TGGTGGCCTC GCTTCGCCCG CATCGCCGCG TGGTTCGGCA ATTGGGAGCA GGCGAGGCGC GCCGACGGAC TCCGCGTGTT TGCCGAGCGC GACGGTAGCC TCTCCATCCC GCTCGACGGC GGTCGCAACT TCATCCTGTC CGCGCGCGCC GATCGCATCG AGCATCGCGC CGACGGCAGC TTCGCGATTT TGGACTACAA GACCGGAAAT CCGCCGACCG GCAAGCAGGT GCGGATGGGG CTGTCGCCGC AACTCACGCT GGAAGCCGCA ATCCTGCGCG ACGGCGGCTT CGAGGGCATC GACGCCGGTT CGTCGGTGAG CGAACTTACT TACGTCAAGC TCAGCGGCAA CTCGCCGCCC GGCGACGAAT GCGTGCTGGA ATTGAGGATC GAGCGCAAGG ACGAGCGGCA GTCTCCCGAC GACGCGGCCG CCGAAGCGCG TAGCAAGCTC GAAACCCTGA TCCGGCGCTT CGACGACGAG GCGCAGCCGT ATCACGCGCT GGTGCTGTCG ATGTGGTCGC GTCGCTATGG CCGCTACGAC GATCTGGCGC GGATCAAGGA ATGGTCGGCC GCCGGCGGCG GTGCGGAGGA TCGGCTGTGA
|
Protein sequence | MRVFSVPPSA PFLRTTIKAL VDGELIAGFD ARAKPERLAE ATLYLPTRRA GRLARDIFLD VLGADAVLLP RIVPLGDVDE DELAFAQAAT GAADLDIPPA LEGLPRRLQL AQLIAVWAKG LRSGDPQQSP LVVGGPASTL ALADDLARLI DDMATRGVDW AALDSLVPDA FDRYWRLTLD FLKIAGQWWP QHLRESDQIE PAARRDLLIE AEAARLAAHR GGPVIAAGST GSMPATAKLL HAIARLPNGA VILPGLDTEL DEQAWRLIGA VRDKQGQLIS PPSPNHPQFA MHGLLTRMGL ERREIVRLGE AVRHGREVLA SEAMRPSAAT ASWHERLADP EVDRLIEQGV NGLTVIEAPN SEIEALAIAV ALREARERGQ SAALVTPDRA LARRVVAALG RWNLPVDDSG GDSLMETQAG IFARLAAEAA LHGCEPATLL ALLKHPLLRL GRAAGGWRHA IETLELALLR GTRPAAGSEG LVKEFAKYRA ELTRLKRGEL SALHPSEPRA RLGDESLDAA QELIEALRAA LAPLETVGAE PLDLCAFGRR HRDVLIALSI DHDEIAVAFE GSQGSALLKA FDDLAAVEPL SGVLVPPHDY ADVFETAFSD RIVRRPELAG AALRIYGPLE ARLTQHDRVI LGGLVEGVWP PAPRIDPWLS RPMRHDLGLD LPERRIGLSA HDFAQLLGAD EVILTYANKV GGAPAVVSRF LHRLEAVTGK ARWSAVKARG QSYLDYAQAL DRPEQVTPIA QPAPRPPREA RPLKLSVTAI EDWLRDPYTI YAKFILGLSA IDPVDMPLSA ADRGSAIHEA LGEFTELFPD ELPDDPAQVL REIGEKHFAP LMAHPEARAL WWPRFARIAA WFGNWEQARR ADGLRVFAER DGSLSIPLDG GRNFILSARA DRIEHRADGS FAILDYKTGN PPTGKQVRMG LSPQLTLEAA ILRDGGFEGI DAGSSVSELT YVKLSGNSPP GDECVLELRI ERKDERQSPD DAAAEARSKL ETLIRRFDDE AQPYHALVLS MWSRRYGRYD DLARIKEWSA AGGGAEDRL
|
| |