Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3780 |
Symbol | |
ID | 3911583 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4314012 |
End bp | 4317932 |
Gene Length | 3921 bp |
Protein Length | 1306 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637885681 |
Product | hypothetical protein |
Protein accession | YP_487385 |
Protein GI | 86750889 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTAAGC CATCGTTAAC GGCTTCCGTG GTGGAGTGCG ATGCGGATGG CCGCGCCGTG TCGGCGCGTG GCGACGTGCG AGCAATGCGA GAAGGCGGGG CGCGGCCCGC CCGGCAGAAC AGGGTCGTTG ACCGGAAGAT ACGAATGGCG CGGATGGCTG CCGCTGCAGT TTGGTCGCAG GCGCTCTGTC GGCGCCTCGC ACGAGTCTGC CTGGCGTTAG GGCTGGTGCT CGCCGTCGCG CTGTCGCCCC GGCCGGCGTC CGCGCAGGCG GTTCGCGGCG AAGCGATTTT CGAGTCGGGC GGAGGCTACG GGCGGCTGCT GTTCAAGCTC GCCGAGGACG TCGAGTCCGA CGTCGTCATG GCCGGCCTGA TCGTGGTGAT CCGGTTCAAG CGGCCGGTGG ATATTTCGAT CGACAAGCTC GCGGAGTCCG CGCCGAACTA CATCGGCTCC GCGCGGCTGG ATCCCGACGG CTACGCGGTT CGGTTGGCGC TGCTGCGCAA GTTCAGCGTC AATCCGATGT CCGCGGGCGA GCGGCTGTAT ATCGATTTTC TGCCGGACAA TTGGGTCGGC GCGCCGCCCG GCCTGCCTCC GGACGTCGTC AAGGCGCTGG CCGAACGCGC CCGCGCGGCG GAGCGCGCGC TACGCGCCCA ACAGGCCAAG GCGGATGTCA AGAAGAAGCC GCCGATCCGC GTTCGGGCGT CGATGCAGCC GACTTTCGTG CGCTACATCT TCGAAGTGCC GCCCGGCGTG CAGGTGTCGT CGACGCTGAC CGACAAGAAG CTGACGGTGC TGTTCAACAC CGGACTGAGT TTCGATCTCG CCGATGCACA ACTCGCCGGC GCGCCGAATG TCGGATCGAT CGGCCAGAAG ATCGACGGTG ACGGCTCGAG CGTGGACTTC GCGCTGATTG GCGGCGCCGA CGTGCGTTCG TTTCGTGAGG ACAAGAACTT CATCGTCGAC GTCAGTTTTC AGCCGCAGGA CGCCGAGCCG TCGAAGAAGA CGTCGCAGGC GCCCATCCTG CCCGAAATCG CCCGGCTCGA ACGCGAAGCC GCGCCCGCCG CACCGCGCGC CCCGGATGTC GCCCGAGCCG AGGCGAAGCC TGAGACCAAA TCCGATGGCA AATCTGAAGC GAAGTCCGGT ACCAAGCCGG AGGTGAAGCC GGCTGCCGCC GCAGCGACGC CGCCGGTGAT CGCGCCGCCC CAGGCCGTCG CATCACCTGC CGGCCCGGCG GCGCGCAGCG ACGCTCCAGC CCCTGCCGCG GTCGCGGCAC CGGCTGTCGC GGCGCCTGTG GTCACGGCCC CGGCCGCGCC GGCTCCGGCC ACGGTGGCGC CGGTTGCGGC GCCGCCTCCC GCAGCGAGCG AGACAACTCG GCCGTCGACG GAGACGGCCG TCCGCGCCGG CGCTCGCGCC GCGGCGGTCG AGGCTCCGAA GCTGCCACCT GCGCCCGAAC CGCCGGCGGC GGCGGCGAAC CCGGCTTTAT CCAGCGCAGC ACGACCGGTC GAGGCGCGCC GCAATACCGA TGGCCTGCAC CTCGCATTCG CGTTCGCGGC GCCGACGCCG GCCGCGCTGT TCCGGCGCGG CGACATCATC TGGCTGGTAC TCGACAACAC GACACCGTTC GACCTCACTT CGATCCGCCG CGAAGGTGGC GGCATCGTCG GAGATGTCAG CCGCGTCGAG CTCCCGAACG GGCAAGCGAT CCGCCTGCGT CTCGATCGTC CGCAACTCGC GACGCTGAGC GACGACGACG GCTCCGGCAA GAACTGGTCG ATCACGCTCG CGGATTCGGG GCGCGGCGCC GCGCGGCCGT TGACCGCGGT GCGCAACATC GCCGATCCGG CCCGCGCCAC CGTTGCCGTG GCGCTCGCCG GCCTCGGCCA GATGCATCGC CTGACCGATC CGGAGGCCGG CGACGCGCTG ACGGTGATCA CCGCGCTGCC GCCGCCGCGC GGCTTCATCA AGCGGCAGAG TTTCGTCGAG TTCAGCCTGC TGGAATCGCT GCACGGTGTG GTCATCGAAC TGAAATCCGA CGACGTCACC GTCGAGACCG TCGCCGATGC GGTGGTGCTG TCGCGGCCCG GTGGACTGAC GCTGTCGTCG GCCGAGCCGG CCGGGCAGGC CGGCTCTTCG GCAGAGCGGC CGTTCTTCGA CATCACCCAA TGGGCCAAAG ACCAGGAGGG GCGTTTCTCC GACGCGCTCG ACGCGCGGAT CAGGACGGCG TCGACCGCCA CCGGCGACGA CCGGCTGCCG GCGCGGCTCG ATCTGGCGCG GTTCTTCATG GCGCGCGGTC TGTATCACGA AGCCAAGGGC GCGCTCGACC TGGCGCTGCT CGGCGTCAAG CCCGGACAGG AAGACGTCGC GACGATGATC GGCCACGCCG CGGCGAGCGC GCTGATGCAG CGGCCGGAGC AGACACTGAA GGATGTCGCC AATCCGGTGA TCGCCTCGAC CTACGACGCG CAATTGTGGA AAGGCGTCGC GCTGGCGGGC CAGGGCAAAT GGCCGGAGGC GCGCGAGAAG CTGAAATCGG TGCAGTTCGC GATTACCGCG CTGCCGCTCG ACATGCAGCG CGAGGTGCTG GCCACCGCGA TGCGCGCCTC GCTGGAAGTC CGCGACTACG CCGGCGCCGC CAAGATCAGC AGCGATTTCG ATCTGGTCGG CATTCCGCCG GAGATGAAGC CCCCGCTCGC GGTGATGCGC GGCTGGCTCG ACGAGGCGCT CGGGCGGGAT CCGGAGGCGC TCAAGAGGTA CAAGGAAGCG ATGGCGTCGG CCGATCGCCA GGCCGCCAGC GAAGCCAAGT TCCGCGACGT CGTGCTGCGC AGCAAGCGCG GTGAGATGAC GCCGGAGGAA GCGCTGCCCG AGCTCGAACG GCTGTCGACG ACGTGGCGCG GCGACGATCT CGAAGTCCGC ACCCAGCAAT TGCTGTCGAA GCTCTACGCC AATGCCGGCC GCTACCGCGA TTCACTGGCG GCGGCGCGGA CCGCGACGCA ACTCGCGCCG AATTCCGAAT ATGCCCGCCA GGCGCAGGAC GACAGCCGGG CATTGTTCTC GCAGCTGTTC CTCGGCAACA AGGGCGACGA CATTCCGCCG ATCGAGGCGC TGGCGACGTT CTACGAATTT CGCGAGCTGA CGCCGATCGG CCGCCGCGGC GACGAGATGA TCCGCAGGCT CGCCGACCGC CTGGTCGCGG TCGATCTGCT GGATCAGGCG AGCGAGCTGT TGCAGTACCA GGTCGACAAG CGGCTCGAAG GCGCCGCGCG CGCCCAGGTC GCGGCGCGGC TGGCGATGGT CTATCTGATG AACCGCAAGC CGGATCGCGC GATCGCCGCG CTGCATTCGT CGCGAATCGC CGATCTCGCC GGTGAATTGC GCCAGCAGCG GCTGCTGCTC GAGGGGCGGG CGCAGAGCGA CATCGGCCGC CACGATCTCG CGCTCGACAT CATCACCAAT ATCAGTGGCC GCGAGGCGAT CCGGCTGCGC TCCGACATCT ACTGGGCGTC GCGGCGCTGG CGCGAATCCT CCGAACAGAT CGAACTGTAT CTCGGCGACC GCTGGCGCGA TTTCACGCCG CTGTCGCAGG CCGAGAAAAG CGACGTCATC CGCGCCGTGG TCGGCTACGC GCTGGCCGAG GATGCGCTCG GTCTCGACCG TTTCCGAGAG AAGTTCGCGC CGTTGATGAC CGACCCGGCC GACCGCGCCG CGTTCGACAT CGCCAGCAGG CCGGCCGCGG GCGATACCGC GGCGTTTGCC GCGATCGCCA AGATGGCGGC GAGCGTCGAT ACGCTGGAGG GCTTCCTGCG CGAAATGAAG CAGCGCTTCC CCGACGCCAG CGCCCGCGCC ACGCCGCCCG GCGCCGACAT GACGTCGACC GGCGCGCTGC CGGAGATCCC GAAGATCCGC GTCATCAAGA TGACGCGGTA G
|
Protein sequence | MVKPSLTASV VECDADGRAV SARGDVRAMR EGGARPARQN RVVDRKIRMA RMAAAAVWSQ ALCRRLARVC LALGLVLAVA LSPRPASAQA VRGEAIFESG GGYGRLLFKL AEDVESDVVM AGLIVVIRFK RPVDISIDKL AESAPNYIGS ARLDPDGYAV RLALLRKFSV NPMSAGERLY IDFLPDNWVG APPGLPPDVV KALAERARAA ERALRAQQAK ADVKKKPPIR VRASMQPTFV RYIFEVPPGV QVSSTLTDKK LTVLFNTGLS FDLADAQLAG APNVGSIGQK IDGDGSSVDF ALIGGADVRS FREDKNFIVD VSFQPQDAEP SKKTSQAPIL PEIARLEREA APAAPRAPDV ARAEAKPETK SDGKSEAKSG TKPEVKPAAA AATPPVIAPP QAVASPAGPA ARSDAPAPAA VAAPAVAAPV VTAPAAPAPA TVAPVAAPPP AASETTRPST ETAVRAGARA AAVEAPKLPP APEPPAAAAN PALSSAARPV EARRNTDGLH LAFAFAAPTP AALFRRGDII WLVLDNTTPF DLTSIRREGG GIVGDVSRVE LPNGQAIRLR LDRPQLATLS DDDGSGKNWS ITLADSGRGA ARPLTAVRNI ADPARATVAV ALAGLGQMHR LTDPEAGDAL TVITALPPPR GFIKRQSFVE FSLLESLHGV VIELKSDDVT VETVADAVVL SRPGGLTLSS AEPAGQAGSS AERPFFDITQ WAKDQEGRFS DALDARIRTA STATGDDRLP ARLDLARFFM ARGLYHEAKG ALDLALLGVK PGQEDVATMI GHAAASALMQ RPEQTLKDVA NPVIASTYDA QLWKGVALAG QGKWPEAREK LKSVQFAITA LPLDMQREVL ATAMRASLEV RDYAGAAKIS SDFDLVGIPP EMKPPLAVMR GWLDEALGRD PEALKRYKEA MASADRQAAS EAKFRDVVLR SKRGEMTPEE ALPELERLST TWRGDDLEVR TQQLLSKLYA NAGRYRDSLA AARTATQLAP NSEYARQAQD DSRALFSQLF LGNKGDDIPP IEALATFYEF RELTPIGRRG DEMIRRLADR LVAVDLLDQA SELLQYQVDK RLEGAARAQV AARLAMVYLM NRKPDRAIAA LHSSRIADLA GELRQQRLLL EGRAQSDIGR HDLALDIITN ISGREAIRLR SDIYWASRRW RESSEQIELY LGDRWRDFTP LSQAEKSDVI RAVVGYALAE DALGLDRFRE KFAPLMTDPA DRAAFDIASR PAAGDTAAFA AIAKMAASVD TLEGFLREMK QRFPDASARA TPPGADMTST GALPEIPKIR VIKMTR
|
| |