Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3032 |
Symbol | |
ID | 3973485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 3331379 |
End bp | 3337033 |
Gene Length | 5655 bp |
Protein Length | 1884 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637926143 |
Product | amino acid adenylation |
Protein accession | YP_532896 |
Protein GI | 90424526 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins [COG3319] Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGCAAG TGAATGTAGC TTCTTCGCTC AACGAGGTCG AGCGCCGGCT TCTGCAACAG GTCGCCTCGA CCCTTCTCCC GGAGGGGGGC GGCTGCACGT TCCGGGTCCT GGAGATTGCC GCCCCGAGCC TGGATGTATT CGATTGCCTC GAGACGATCG CCACCGACGT GGACGTCAAG CACCTCGTGA CGCATGCTTC GGTGCTGCAT CTGGCAAATC TGCGACTGCG CGCGCCGAAC TGCGCTCGCC TGCGTTTCGC CGCCTACGAT CCGACCTCTT GCGCGCCACC GGACAAAGGA CCGTTCGACC TCATCTTCGC GGCTCGTGAC GCGATCTTGC CGTCCCGCGC ACGCCAGGAA TTGGCTGCTT TGCTGACTCC GACGGGAAGG TTCGTGACTT GGCAAACCGC AGCTGCGCAG GAGCGATCCG TGATCGCCGA AGCAGACGGC GTGCGGCTGG TCGAGGTTGC GCCGTTCGTC GAAAAGGGAG TGACGCCGCC TGAGCCGACC GCCGGTGTCG ACGGCGGCGA GGCGAGTCCG AAGTTTCCGC TGACCGACAT CCAGCATGCG TATTGGGTCG GGCGCGGTGA CGTGCTCGAT CTGGGCGGCG TCGCTTGCCA CGTCTATTTC GAATGGGAGA TCCCGAATCT CGATGCTGCG AAGCTGGAAC GGGCCTGGAA CGATCTGATT CAGCGTCACG GGATGTTGCG TGCGGTGGTG ACGCCCGATG GCGAGCAATG TATCCTCCCC GACGTCCCGC ACTATACGAT CGCCATCGAG GACGTGTGCG GTCTGCCGGC GGCTGTGGCT GAATCCAGAC GACTCGAGAA GCGGGCGCGG ATGAGTGCTC AGGTTCTCGA CGCGCATCAG TGGCCGCTGT TCGAAATCTG CGTGACCCGC CTGTCCGCAC AGACATCCTT GTTACATCTC GATCTCGATC TCCTGATCGT CGATGTTCAG AGCTTCCATA TCCTGCTTTC CGAACTGGAG ATGCTGTATC GGGATTCGGC CTCTCTTCTT CCCGCTCCTT CGCTGACTTA TCGCGATTAT CTGCTGGCGA TCGTCCGCGA TCGTGAGAGC CCGGACTTCC AGAATGACCG CGCATGGTGG TTCGGCCGCA TTGCTGATCT CCCGGGACCG CCCCGCCTTC CTGCCGCAAA CCGCGGCACT GTGACTGCGA CGACGTTTCG GCGGTTGCGG CGGCGTTTGC CCACAGAGCA GTGGCAAACG CTGGAACGGA GGGCGCGGGC GCACGGCATC ACATGTTCAT CGTTGCTGCT GGCCGCGTTC TCGGAAGTGC TGGCGCGGTG GGTTTCCGAG CCGCGTTTCC TCCTGAACCT GACGCAGTTC GACCGTCGCC CGCTGCATCC CGACGTGATG AGGCTGGTGG GCGATTTCAC GTCGGTTCTG CTGATCGATA TCGATGCGGC AGGCCGTGTC GGCTTCGTCG AGCGCGCTCG CGTGGTTCAG GCGACATTGT GGCGCGCTCT TGCGCACGCG CGCTTCAGCG GTATCGAGGC GCTTCGCGAG ATCGCGCATC GACGCGGTGA AAGGGCGGAT CGGCTCATGC CGATCGTTTT CACCAGCCTG CTCGGCATCG ATATCGATTC GCTCGTTCAT CGAAATGGGG GAGGGGGCAT GCTGGGCGAG CCGTGTCATC TCTACACCTG CACGCCGCAA GTGTCGCTCG ATCATCAGGC GATGATCCGC AACGGCAGCC TGGAATACAA TTGGATCGTC ATCGACGACG TATTTCCGCC GGGTGTCGCC GACGCCTTGT TCGAAGTATA TGGCGCGTTT CTCGATCACC TCGCCAGCGA GAACGCGGAC TGGACCGAGC CGTCGCCGGA CGTCCTGTTG TCCGGCGTGG AAGTAGAAGA GCGGGCCCGC GCGAACGGGA CCGCGGTGCC GATGAGACTG GCGCGGCTCG AGCGCCTGTT TCTGGAGCAG GCGTCGAGAT CCCCCGATGC GATCGCAGTG ATCGCGCCCG ACGCTGAAAT GACCTACGGC GCTCTGCGCG AGCGTATGGA GAGGTTCGCC GGGGCGCTCA CGGCGATGGG GGGTGGTCCG GGAGAGCCGA TCGGCGTCGC GCTGCCGAAG GGCGCCGATC AGATCGCCGC AGTACTCGCG ATTTTGCATG TCGGTGCGTT CTATGTGCCG ATCTCTCACG ACATGCCGAA TGAACGGATC GGGCTTGTCG TGGCGGGCGC CGGCATGAAC AAGTCGTTCG GCAATCCCGA CGCCCGCCGA TGGCCGAACA CGCTTCATGT GATCGATCCG AAGCGAGCAG CGGCTGCCGA CCGAGTGGCG CCTCGGTGCG AAGCGTCGCT CGACGATCCT GCCTACGTCA TCTACACGTC GGGGTCGACC GGCGTGCCAA AGGGTGTGAC GGTGACGCAT CGCGCTGCGG CCAACACTAT CGTTGACGTC AATAACCGCA TCGGCGCCTC CGCCGGAGAT CGTGTTTTCG GGATCTCGGC ACTCGGTTTC GATCTGTCAG TCTACGATAT TTTCGGGACG TTGGTGGCCG GCGCAGCGCT CGTGCTCCCG GCAGAGGAAG ATCGGCGTGA GCCGGACGCA TGGCTCGGCC GCCTGGTCGA CACGGGCGTG ACGATCTGGA ATTCGGTGCC GGCACTGATG CAGATGCTGG TCGAGCATGT CGAGGCGAAA CGTAGTTTGC TTCCGCAGCT TCGCTGGACC TTGTTGTCGG GCGACTGGGT TCCTCTCGGA TTGCCGGACC GGATACGCGC GGTGGCGCCG GGGAGCCGGA TGGCTGCACT CGGCGGCGCG ACCGAAGCCG CGATCTGGTC GAACTGGTAT GAGATTGGCG AATTGAGTCG CGATTGGCCG TCGATCCCGT ACGGCTTTCC ATTGGCCAAT CAGCGCTACC ATATACTCGA CGACGAATTG CGCCCACGTC CGAATTGGGT CGAGGGCGAT CTCTTCATCG CAGGTGATGG ACTTGCATCT GGTTACTACG GCGATCCGAA GCAGACTGCT CGCGCGTTCT TCGAGCATCC TCGGACGGGT GAGCGGCTCT ACCGTACCGG AGATCGCGCA CGCTATCGAC CGGGGGGCAT CATCGAATTT CTCGGCCGCC GTGACCATCA GGTCAAGATC AACGGCATGA GGATCGAACT CGGTGAAGTC GAAGCCTGCC TGGTCTCCCA CCCGGACATC GAGGCCGCCG TGGTCGAGGC CGTCGACATC GGCAGAGCGC GCAAGCTGGT GAGCTATGTC GTGCCGTCCG CCGCGCCTCG TGAATGTTTC ACGCAACGGT CGGTCGATGC CGCGGATGCC GTCGAACATT GGGACGTGGC GACCACCGCC ATGGCGGAGG CCGCTGGTGA ACAGGTCGAG AGTACCGTGC TCGCGTCGCT GCGCGATTTC GAGCGGATCG GGGAAGAGCT GTCGAATGCG GCGATCCGCC GGGCTCTCGT CGCGATCGGT CTGGGCGACC GAACGAGCCT CGATCCGAAG CGAATCGCGG ACGACGTACC GCATTACGCC GGCCTCGTGA GGCAATGGCT GGATGCCTTA ACACAGGAAG GCACCTACCG CCGGGAGGGC GATCGTTTCG TCCGGTCGGG GCTGCTCGGG GATCCGGACG AACTGGATCG TCGTATCGAG CGCCGGATCG CGGAGTTGAG AACCAGGCTG TCGTGGACAA GGCAGGGCGG GGCGCTGGTC GACTGGATAA GCACGTGCGT GCGTCGTTTG CCGGACATCG TGCGGCGGCA ACCGGCGGAG GCGCTGGAAT TGCTGTTCCC GGACGGGGAT TTTTCGCGCA GCGAAGCATT GTACCAGGAC AACGTGATCG CGTCCTGCCT TGGCCGAGCC GTCGCAGAGG GCGTCGCGGC GCGCGCTGCG GCGTGCCGTG GTCGCCTGTT GCGGATCATG GAGATCGGTG CGGGGGTCGG CGGGTTGACG TCCTATGTCT TGCCGCGTTT GGCGGGCGTC GATTGCGAGT ACGTCTACAC CGATGTCAGT CCCGGCTTCC TGAAGATGGC GAAGGAAAAA TTCGGTGCGT ATGGGTTTGT CCGCTACGAT CTCTACGACA TCCAGCGAGC GCCGGAAGAG CAGGTCGAAG CCCTGCACGG GTTCGATATC GTGCTCGCGG CCAACGTGCT GCATAACGGT CGGGACGTCG TGCGCACCCT GTCTGACGTG AAGCGGCTTT TGTGCCCCGG AGGTATGCTT GCTGTCATCG AGGCGACGCG CAACAAGACA CTGCAACTGG TCACCGGCGG GCTGATCGAA GGGCTTCATG CCGAATTCGA CGATGATCGG CGCGACAGCG GCCTGCCGAT GTTGAACGCG GAGTCATGGT GCCGCGCGTT GCGCGAGGCC GGATATCCGG ACGTCGCCGT GCCCGCGGAA GGACAGTCGA TCGCAGCTTT CGGTCAGCAG GTGATCGTTG CACGTGGACC GGAGACGATC GTTACCGTCG AAACTGCACG CATCGACGCG TATCTGCGTG GCAAGCTGCC AGCCTATATG GTTCCGAGCC GGCATGTACG AATCGATCGG CTGCCCTTGT CCCTTAATGG AAAGATCGAT CGAAGCCGCT TGCCGCGGCC GGTCGTGCCT GAGGCGTGGC CTGGACAACT GACGAGCGAG CCGCCACGGG AGGGTATCGA AGCTGCGATT GCAGGGGTGT GGCGAGAGCT CCTGGGATGC AAGGAAGTGT CCCGTACCGA CAGTTTCTTC AATCTCGGCG GTGATAGCCT GATCGCGACA CGGTTGGCGA GCAGACTCCG GAAGGAGCTC GGCCGGGACG TCGCATTGCG GCTTCTGTTC GATCATCCGG TTCTGGCGGA CCAGGCGGTC GCGATCTCCG AGCAGGGGAT GTATCGTCGT AGCGCACCGG TCCTGTTGAC GTCGGGACCG AAAGGACCTT TGGTTTGCCT TCATGCCAGC GATGGATATG CCGCGGCCTA CCGTCCGCTG GCCGAGCGCC TGCAGGGCGT TTGCGTCCTG TCGGCGGTCG ATGCGCCGGG ACTTCTCGCC GACGAGAGGC CGCTCGAATC CTTGAGTGCT CTTGCTGCTT ATCATCGCGG TGCGTTGGGA CAGATGCCTG GATCGGGTTG GCAGCTTCTC GGCTGGTCGA TGGGAGCCCA CACAGCGTGG AAGCTTGCAG GAGATCTGAT CGCGGCTGGC GAGCGAGTGA TGCGGCTGGT GTTGATTGAT CCCTCTCCAA GAGGGCCTTT CGAGGCGGCG ATACGTTCGC CGGGGGCACT GCTCGAAAGC TGCGCGAGTG ACGTTCTACG CTCCGAACTG CTCGCACAAG GCCTCACGAC CGAGGCGCTC GACGGAATGC CCGATGCGGA CCGTGTCGCG GTGTGGCGAA GAGTGCTGAG CGGACGTGGC CTGCCGGATT CGCTTCTCTC CGATGATGAT GCCCTCGGCA GGATGATCGC CGTCATGGCC GCCAATCTCG CCGCGATGGT ACAAGCGAGG CTGGCACCGC TTCCTCCTGG TCCCGAAGTG GTGGTCTATA CGGCGACACG CCGGATGCCG AATTGGGGTG AGCCATTGAT GGATTGGTCG ACGCTTTTCC CTCGGTCCAC GCATCATGTG GCGATCGATG CCGATCATTG GTCGATTCTG GCGTCTGATG TTTTGACGAA GGATCTGGCA GGGCGCGTTA CATAA
|
Protein sequence | MPQVNVASSL NEVERRLLQQ VASTLLPEGG GCTFRVLEIA APSLDVFDCL ETIATDVDVK HLVTHASVLH LANLRLRAPN CARLRFAAYD PTSCAPPDKG PFDLIFAARD AILPSRARQE LAALLTPTGR FVTWQTAAAQ ERSVIAEADG VRLVEVAPFV EKGVTPPEPT AGVDGGEASP KFPLTDIQHA YWVGRGDVLD LGGVACHVYF EWEIPNLDAA KLERAWNDLI QRHGMLRAVV TPDGEQCILP DVPHYTIAIE DVCGLPAAVA ESRRLEKRAR MSAQVLDAHQ WPLFEICVTR LSAQTSLLHL DLDLLIVDVQ SFHILLSELE MLYRDSASLL PAPSLTYRDY LLAIVRDRES PDFQNDRAWW FGRIADLPGP PRLPAANRGT VTATTFRRLR RRLPTEQWQT LERRARAHGI TCSSLLLAAF SEVLARWVSE PRFLLNLTQF DRRPLHPDVM RLVGDFTSVL LIDIDAAGRV GFVERARVVQ ATLWRALAHA RFSGIEALRE IAHRRGERAD RLMPIVFTSL LGIDIDSLVH RNGGGGMLGE PCHLYTCTPQ VSLDHQAMIR NGSLEYNWIV IDDVFPPGVA DALFEVYGAF LDHLASENAD WTEPSPDVLL SGVEVEERAR ANGTAVPMRL ARLERLFLEQ ASRSPDAIAV IAPDAEMTYG ALRERMERFA GALTAMGGGP GEPIGVALPK GADQIAAVLA ILHVGAFYVP ISHDMPNERI GLVVAGAGMN KSFGNPDARR WPNTLHVIDP KRAAAADRVA PRCEASLDDP AYVIYTSGST GVPKGVTVTH RAAANTIVDV NNRIGASAGD RVFGISALGF DLSVYDIFGT LVAGAALVLP AEEDRREPDA WLGRLVDTGV TIWNSVPALM QMLVEHVEAK RSLLPQLRWT LLSGDWVPLG LPDRIRAVAP GSRMAALGGA TEAAIWSNWY EIGELSRDWP SIPYGFPLAN QRYHILDDEL RPRPNWVEGD LFIAGDGLAS GYYGDPKQTA RAFFEHPRTG ERLYRTGDRA RYRPGGIIEF LGRRDHQVKI NGMRIELGEV EACLVSHPDI EAAVVEAVDI GRARKLVSYV VPSAAPRECF TQRSVDAADA VEHWDVATTA MAEAAGEQVE STVLASLRDF ERIGEELSNA AIRRALVAIG LGDRTSLDPK RIADDVPHYA GLVRQWLDAL TQEGTYRREG DRFVRSGLLG DPDELDRRIE RRIAELRTRL SWTRQGGALV DWISTCVRRL PDIVRRQPAE ALELLFPDGD FSRSEALYQD NVIASCLGRA VAEGVAARAA ACRGRLLRIM EIGAGVGGLT SYVLPRLAGV DCEYVYTDVS PGFLKMAKEK FGAYGFVRYD LYDIQRAPEE QVEALHGFDI VLAANVLHNG RDVVRTLSDV KRLLCPGGML AVIEATRNKT LQLVTGGLIE GLHAEFDDDR RDSGLPMLNA ESWCRALREA GYPDVAVPAE GQSIAAFGQQ VIVARGPETI VTVETARIDA YLRGKLPAYM VPSRHVRIDR LPLSLNGKID RSRLPRPVVP EAWPGQLTSE PPREGIEAAI AGVWRELLGC KEVSRTDSFF NLGGDSLIAT RLASRLRKEL GRDVALRLLF DHPVLADQAV AISEQGMYRR SAPVLLTSGP KGPLVCLHAS DGYAAAYRPL AERLQGVCVL SAVDAPGLLA DERPLESLSA LAAYHRGALG QMPGSGWQLL GWSMGAHTAW KLAGDLIAAG ERVMRLVLID PSPRGPFEAA IRSPGALLES CASDVLRSEL LAQGLTTEAL DGMPDADRVA VWRRVLSGRG LPDSLLSDDD ALGRMIAVMA ANLAAMVQAR LAPLPPGPEV VVYTATRRMP NWGEPLMDWS TLFPRSTHHV AIDADHWSIL ASDVLTKDLA GRVT
|
| |