Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4137 |
Symbol | |
ID | 3970330 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 4597003 |
End bp | 4600164 |
Gene Length | 3162 bp |
Protein Length | 1053 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637927241 |
Product | excinuclease ABC subunit B |
Protein accession | YP_533982 |
Protein GI | 90425612 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0556] Helicase subunit of the DNA excision repair complex |
TIGRFAM ID | [TIGR00631] excinuclease ABC, B subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAAGA CTCCCGACAA ACCCAACAAG CCGATCAAGA CCCCGAAATC CAAGGCGCAT CGGCCCGACG TGAAGCCGAT CGGGCCGGCG CTGGCGGAAC TGCTCAATCC CGCGATCAAT CGCGGCGACG CCGGCATGGG TTCCGGCACC GGGCTACAGC CGCCGCCGGA CAATTCCCGC GACCGCCGCA CCGGCGGCGA GGCCGCGATC CATCGCGCGC GAGCGTCGAC GACACAAGTG CCCCCACCCA ACCCTCCCCC GCAAGCGGGA GAGGGCTCCG CGCGGCGAGC GCCGCACACC GTCGCTCGCG TTGAATCCGT GGAGGGGCAG CCATCGCCCC CTCTCCCGCT TGCGGGGGAG GGCCGGGGTG GGGGCGCCAC GCGCACGACC GCGCCCGGCT TCGCCGAAGC CCCGCAATCC GACTTCGCCA CGCCGAATTA CGGCACCACA GCCACCATCC CGACGCTCGA TCCGGAACTC GCCAAGCAGC TCGGCTTCAC CACCGAGGAA GAGGATGAAG CCGCCCTCGC CCGCCCGCCG CGCAACAAGA TGGAAGCGCT CGGCGTGCAG GCCACCGCCG ACGCGCTGGA GAACCTGATC CGCGACGGCC GCCCGGAATT CAAGGGCGAG GACGGCAACG TCAAGCTATG GACGCCGCAT CGCCCGCCGC GCCCGGAGAA GACCGAAGGC GGCGTGCGCT TCGTCATCAA ATCCGAATAT GAACCGAAGG GCGACCAGCC CACCGCGATC AAGGAACTGG TCGAAGGCAT TTCGCGCAAC GATCGCACCC AGGTGCTGCT CGGCGTCACC GGCTCCGGCA AGACCTACAC CATGGCCAAG GTGATCGAGG CGACGCAGCG CCCGGCGATC ATTCTGGCGC CGAACAAGAC GCTGGCGGCG CAGCTCTATG GCGAGTTCAA GAGTTTTTTC CCCGACAACG CCGTCGAGTA CTTTGTCAGC TATTACGATT ACTACCAGCC CGAGGCCTAC GTCCCCCGCA CCGACACCTA CATCGAAAAA GATTCATCGA TCAACGAACA GATCGACCGG ATGCGCCACG CCGCCACCCG CGCGCTGCTG GAGCGCGACG ACGTCATCAT CGTCGCCTCG GTGAGCTGCA TCTACGGTAT CGGCTCGGTC GAGACCTATA CCGCGATGAC CTTTGCGCTG AAGAAGGGCG AGCGGATCGA CCAGCGCGCG CTGATCGCCG ATCTGGTGGC GCTGCAATAC AAGCGCACCC AGGCCGATTT TACTCGCGGC ACCTTCCGGG TGCGCGGTGA CGTCATCGAC ATCTTCCCGG CGCACTATGA AGACCGCGCC TGGCGCGTCA ATCTGTTCGG CGACACCGTC GAGACCATCG AGGAATTCGA CCCGCTCACC GGCCACAAGC AGGACGAGCT GGAATTCATC AAGATCTACG CCAATTCGCA TTACGTCACG CCGCGGCCGA CGCTGTTGCA GGCGATCAAA TCGATCAAGG CCGAGTTGAA ATGGCGGCTC GATCAGCTCA ACGATCAGGG CCGATTGCTG GAGGCGCAGC GGCTCGAGCA GCGCACCACC TTCGACATCG AGATGATGGA GGCCACCGGA AGCTGCGCCG GCATCGAGAA CTACTCGCGC TATCTCACCG GCCGCCGCCC CGGCGAGCCG CCGCCGACGC TGTTCGAATA CGTCCCCGAC AACGCGCTGG TGTTCGCCGA CGAGAGCCAC GTCACCGTGC CGCAGATCGG CGGCATGTTC AAAGGCGATT TTCGACGCAA GGCGACGCTG GCCGAGTATG GATTCCGCTT GCCGTCCTGC ATGGACAACC GGCCGCTGCG GTTCGAGGAA TGGGACATGA TGCGCCCGCA ATCGGTCGCG GTGTCGGCGA CGCCGTCGGC GTGGGAGCTG AACGAGAGCG GCGGCGTGTT CGTCGAACAG GTCATCCGCC CGACCGGGCT GATCGACCCG CCGGTCAACA TCCGTCCGGC CCGCACCCAG GTCGACGACC TGGTCGGCGA GGTCCGCGCC ACCGCGCAGG CCGGCTATCG CTCGCTGATC ACCGTCTTGA CCAAGCGGAT GGCCGAAGAC CTCACCGAAT ATCTGCACGA ACAGGGCATT CGGGTGCGCT ATATGCACAG CGACATCGAC ACCATCGAAC GCATCGAGAT CATCCGCGAT TTACGCTTGG GGGCTTTTGA CGCCCTGGTC GGCATCAACC TGTTGCGCGA AGGACTCGAC ATTCCGGAAT GCGCGCTGGT GGCGATCCTC GACGCCGACA AGGAAGGCTT TCTGCGCAGC GAGACCTCCC TGATCCAGAC CATCGGCCGC GCCGCGCGCA ACGTCGACGG CAAGGTGATC CTCTACGCCG ACCGCATCAC CGGATCGATG GAGCGCGCCA TCGCCGAGAC CGACCGCCGC CGCGAGAAGC AGGTCGAGTA CAACACCGAA CACAACATCA CCCCGGAGAG CATCAAGAAA TCGATCGGCG ACATTCTCAA CAGCGTCTAT GAGCGCGACC ACGTGCTGGT CGAAATCGGC GGCGGCGGCC AAGGCGGCAG CTGGTCCGAC GACGTCGGCG CGATCGGGCA TAATTTCGAG ACGGTGCTGG CCGATCTCGA AACCAGGATG CGCGAGGCCG CGGCCGACCT GAACTTCGAA GAGGCCGCGC GGCTGCGCGA CGAGGTCAAG CGGCTGCGCG CCACCGAATT GGCGGTGGTC GACGATCCCA CCGCCAAGCA ACGCACCGTG CAGGGCAAGG CGGGGTCCTA CGCCGGCGCC AAGAAATACG GCGCTGCTGC CAACCTGCCG CAGCAATCCA AGGAACGCGG CGGCAACAAC ACGCCGAAGG TGCGGGGGGC GACGGGCGCC GCCACGCAAC GCGACGGTTC TTCTCCCTCC CCCCTTGCGG GGGAGGGTCG GGGTGGGGGG TCTAAAGCGG CAGCCTCTCG CATCCACAAG CCCGATCTCG ACGAGATGGG TATCGCCAGC TGGCACGAGG TCATGCCGGA TCGCAAAGGC CGCGCCAAGC CGCGCAAACC GACGCTCGAC GAGATGGGCC CCGGCACCGA GAGCAGGATT TTTCAGCCGA AGACGTCGCG CGAATCCGGC CCGGAATTCG GCCCCTCGCC GCGCTCATCC GGCGGCGCGC CGGGGCACAG GGGAGGATGG AAGAAGAGGT AG
|
Protein sequence | MAKTPDKPNK PIKTPKSKAH RPDVKPIGPA LAELLNPAIN RGDAGMGSGT GLQPPPDNSR DRRTGGEAAI HRARASTTQV PPPNPPPQAG EGSARRAPHT VARVESVEGQ PSPPLPLAGE GRGGGATRTT APGFAEAPQS DFATPNYGTT ATIPTLDPEL AKQLGFTTEE EDEAALARPP RNKMEALGVQ ATADALENLI RDGRPEFKGE DGNVKLWTPH RPPRPEKTEG GVRFVIKSEY EPKGDQPTAI KELVEGISRN DRTQVLLGVT GSGKTYTMAK VIEATQRPAI ILAPNKTLAA QLYGEFKSFF PDNAVEYFVS YYDYYQPEAY VPRTDTYIEK DSSINEQIDR MRHAATRALL ERDDVIIVAS VSCIYGIGSV ETYTAMTFAL KKGERIDQRA LIADLVALQY KRTQADFTRG TFRVRGDVID IFPAHYEDRA WRVNLFGDTV ETIEEFDPLT GHKQDELEFI KIYANSHYVT PRPTLLQAIK SIKAELKWRL DQLNDQGRLL EAQRLEQRTT FDIEMMEATG SCAGIENYSR YLTGRRPGEP PPTLFEYVPD NALVFADESH VTVPQIGGMF KGDFRRKATL AEYGFRLPSC MDNRPLRFEE WDMMRPQSVA VSATPSAWEL NESGGVFVEQ VIRPTGLIDP PVNIRPARTQ VDDLVGEVRA TAQAGYRSLI TVLTKRMAED LTEYLHEQGI RVRYMHSDID TIERIEIIRD LRLGAFDALV GINLLREGLD IPECALVAIL DADKEGFLRS ETSLIQTIGR AARNVDGKVI LYADRITGSM ERAIAETDRR REKQVEYNTE HNITPESIKK SIGDILNSVY ERDHVLVEIG GGGQGGSWSD DVGAIGHNFE TVLADLETRM REAAADLNFE EAARLRDEVK RLRATELAVV DDPTAKQRTV QGKAGSYAGA KKYGAAANLP QQSKERGGNN TPKVRGATGA ATQRDGSSPS PLAGEGRGGG SKAAASRIHK PDLDEMGIAS WHEVMPDRKG RAKPRKPTLD EMGPGTESRI FQPKTSRESG PEFGPSPRSS GGAPGHRGGW KKR
|
| |