Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1637 |
Symbol | |
ID | 3909914 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1845713 |
End bp | 1852096 |
Gene Length | 6384 bp |
Protein Length | 2127 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637883531 |
Product | VCBS |
Protein accession | YP_485256 |
Protein GI | 86748760 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01965] VCBS repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCGC TGATCAAGAT GGCCCAACTG CAATTGTCGG AAGCGACGAC CAGCGCGGTT GGTCTGAAGA CGTTCAAGGT CTCCAAGCCC GAAACGGGGC AAGCTCTCGC GATTCAACTC GATGGAAAAT CCGTCGTCGA TCTGACGAGC TTCGCCGACG AGAAGATCAC GCTTGTGCGG CTGGGCAATC GGCTGGTCAT CCTGTTCGAC AACCAGTCGA CCGTCACGAT CAATCCATTT TATGACGCGC TCGGCCGACC GATCGCTGAT GTCTCGCTTC AGGTGTCGCA GGGGCAGCCG ATCAGCGGCG CCGAATTTGT GGCTTCGTTC CCGATCTCGA CGGACCAGTC CATTCTGCCG GCCGCGGGAG CTGGTGGTCC CGCGCCGGTG GCGTCGAGTG CAAGCTTCTC TGACATCACT ATCGATCAGC TCAACCCGTC AGGAAATGGA ATTGGTTCGC GCGCTCCGGT CGAGACCGGA GTGGTGACAA CCACGATCCG TGACGTCTTT GAGCCAGGCA GTGGATCTCC GTTAGGTGAT ACCAACGACG CAGCTGTCAT CACCGGAACC AGCACGGCGT CGCTGCTCGA GGCCGACGTG GTGCTGGGCA CCGGCGGCAC GCTGGTGGCG AGCGATCCCG ACAGTTCGAA CGCCTTCGTG CCGCAGGCCG CGGTGGCCGG TAGCAACGGT TACGGCAACT TCAGTATCGA CGCTGCGGGG GTGTGGACCT ACGCCATGAC CACCCCTCAC AACGAGTTCG TAGGCGGCGT CGATTACACC GACAGCGTGA CGGTGGCGAC AGTCGACGGC ACCACCCAGG TCGTCACGGT GACCATCACC GGCACCGACG ACGCGACGGT GATCACGGGA GCGAGCACGG CGACGCTGAT CGAGGCCGAC GTGGTGCTGA GCACCGGCGG CACGCTGGTG GCCAGCGATC CCGACAGTTC GAACGCCTTC GTGCCGCAGG CCGCGGTGGC CGGCAGCAAC GGTTACGGCA ACTTCAGTAT CGACGCTGCG GGGGTGTGGA CCTACGCCAT GACCACCCCT CACAATGAAT TTGTGGGTGG CGTCGACTAC ACCGACAGCG TGACGGTGGC GACCGTCGAC GGCACCACCC AGCTCCTCAC CGTGACCATC ACCGGCACCG ACGACGCGAC GGTGATCGCG GGGTTCAGCA CGGCGACGCT GACCGAGGCC GACGTGGTGC TGAGCACCGG TGGCACGCTG GTCGCCAGCG ATCCCGACAG TTCGAACGCC TTCGTGCCGC AGGCCGGGGT CGCCGGTAGC AACGGTTACG GCAGCTTCAG CATCGACGCC GCCGGGGTGT GGACCTACAG CCTGACCACC GCCCATGACG AGTTCGTGGG CGGCTTCGAC TACACCGACA GCATGACGGT TGCGACGGTG GATGGCACCA CGCAGCTCGT CACGGTGACC ATCACCGGCA CCAACGACGC GGCGGTGATC ACGGGGACCA GCACGGCTTC GCTGGTCGAG GCCGACGCCG TGCTGACTAC TGGCGGCACC CTGGTCGCGA GCGATATCGA CAACTTGAAC GCATTCGTGG TGCAGACCGG GGTGGCCGGC AGCAATGGCT ACGGCACCTT CAGCATCGAC GCTTCGGGCG CGTGGACCTA CAGCATGACC ACCGCCCACG ACGAGTTTGT GGAGGGCGTC GACTACACCG ACAGCATCAC GGTTACGACG GTCGACGGGA CCACGCAGCT CGTCACGGTG ACCATCACCG GTACCGACGA CGCGACGGTC GTGTCGTCGG CCGATGTGTC GGTTGCTGAG GGCAATGCGC CGATTTCGAC CGGCGGCACG CTCACGATCA GCGACGTCGA CAGCGCGGTG ACCTTCGTGG CGCAGCCCGG CACGCCCGGA GCGCACGGCA TTTTCTCGAT CGGGACGAAT GGCGCCTGGA CCTACGTGGC GAACGACGCG TTCGACGGCC TGAATGTTGG GCAGAGCGTG AGTGACACCT TCTCGGTGTC GGCAGCGGAC GGCACGCTGA CGTCGGTGCA GGTGACGATC ACCGGGACCA ACGACCTGGC GTTCGTGTCG TCAGCCGATG TGTCGGTTGC GGAGGACAAT GCTCCGATTT CGACCGGTGG CACGCTGACG ATCAGTGACG TCGACAGCGC GGCGACCTTC GTGGAACAGT CCGGCACGCC CGGAGTGCAC GGCATTTTCT CGATCGGGAC GAATGGCGCC TGGACCTACG TGGCGAACGA CGCGTTCGAG AGCCTGAGTG TCGGGCAGAG CGTGAGCGAC ACCTTCTCGG TGTCGGCAGC GGACGGCACG CTGACGTCGG TGCAGGTGAC GATCACCGGG ACCAACGACG CGCCGGTGAT CACGGTTGCG AGTCCCGACA GCTCGTCCGC ATCGCTGACG GAAAGCGACG CTGCATTAAC TGCCTCAGGC ACACTGACAG TGACGGACGT CGACGTCAGC AACACCGTCA CCGCCTCGGT GCAGAGCGTC GTGGTGGCCG GCCCAAGCGG TGGCCTTAGC AGTGCGGCTC TGCTGGCGAT GCTGGCAGTG CCGGGCGCCG CGATCGCGGC GGACCCGACC GACGCGCATA ATCTGGCCTG GACGTTCAAC TCGGCGCCGC AGGCGTTTGA CTTCCTTGCG CAGGGTGAGA CCCTGAGCCT GACCTATACG GTCGGAGTGC AGGACGGCGC TGGCGGCTCC GACACGCAGA CTGTGACCGT AACCGTTGCG GGCACTAATG ACACGCCGGT TTTGATCACG GGTCCTGGCT ACAGCGAACC GGCTTCGCAA ATGACCGGGG CGGTCACCGA AGATGCGGCG GCCAATCAGG TCCGCGGCAT CGTCAATGTT TTCGATGCCG ATAACGGCCA TGGTCTGACT GCCGTCATCG ATAATCCGAA CGGCACTTAC GGCACGCTCG CCCGGGACAC CTGGGTGGAC GGCAATGGTG ACACCCAGGA CGGCTGGGTC TACACGCTCG ACAATACGCG GAGTGCGACG CAGGCCCTGA AGGAAGGCGA GGTTCAGACA GAGACCTTCA CCTTCAAGGT GATGGACGAG CACGGCGCGA TTGACACGGA GACCGTGACC ATCACGGTCA CCGGCAATAA CGACGCGCCG ATAGTCGCGC ATGCCATCAC CAATCAGACG CTGCTGGAGG ATCACGCCTG GACGTTCCAG GTTCCGGCAG ACACTTTTGC CGACGTCGAC GGTCCGCCGC TGGAATATTC GGCGTTCCTC GCGAATGGTG ATCCGCTGCC CGATGGCATG CATTTCGATC CGGCGACTTT GACGTTCACC GGCACGCCTC CGCTGAACGA GACCCATTCG CCAGATCTGA AGGTTGTGGC GTGGGACGGC GAGTTTTCGG CCGAGGCGTA CTTCACGCTG AATATCACGC CGGTTAACGA TGCGCCGGCG GGCACCAACG GTGCGGTGAC GACGCTGGAA GACATCGCCT ACACCCTCAC GCTTGCGGAT TTCGGGTACA CCGACGCGAA CGACGATCCG GCGAATACTC TGCTGGGCGT GAAGATCACC ACGCTGCCGG GCGCCGGAAC GCTGGCGCTC GACGGCCATC CGGTGACGGC GGGAGCAACG GCGTCGGCTG CGGATATCTT GGCCGGTAAG CTGCAATTCG TGCCGGCGGC GAATGCCAAC GGCAGCAACT ATGCGAGCTT CACGTTCCAG GTGCAGGACG ATGGCGGAAC CGCCAATGGC GGCGTTGACC TCGATCCGTT AGCCAATACG CTGACCATCA ACGTCACACC GGTCAACGAT GCGCCGGTGC TGTCGGGTCA CGAAAATCCG AGCGCGATTG CAGAAGACAG CGGCGCTGCT GGTGCTGCCA CGACGGTGGC GCACCTTCTC GGCCTCGACG ACGTCAATCC GGCGAACGAC CATGCCATCG ATGTCGATGC CGCGTCTCTC GGCATCGTCA TCACGGCGGT CGATGACTCC CACGGAGTCT GGCAGTTTCA ATCCGGCGGC GGCACGTGGA CCGATGTTGA TCTCGCCCCG GGCGAGGGGC TTCATCTGGC GCAGACTGAT GCTCTTCGCT TTATCCCTTC GCGGAACGTC CAGACAACCG ATGCCACGCT GTTCGATGCG TCGGGCAACC CGGTCCTGGG GTCGACCAAC CCCTCTTTTG TTGCCGAGCC CCCCGGCCTC ACCTTCCGTG CCTGGGATAT GTCCAATGGT ATCGCCGCAG GAATCACCGC GCTGGTTGTG GGCTTCGGCG GGACCTCGGC CTATAGCGCC ACGGCCGAGA CGGCATCGCT GGTTGTCACC GGAACGCTGG ATCGTGTCTT CACGAACGGA GACGATACGG TCGACCTGAC GGGTCTGTCG AATGATCCGG CGGTCGACGC TGTCTGGTTC GAAGACCAGA ACTTCTTCGA TGCAGGGGAG GGTGACGATA ACATCGTGCT GCCGCGCATG GCGGGTCCGC CGGGTCCAGA TCCGCTCGCT CCCGTGTACC AAAACCAGAC GTTCCATTTG GGGGCAGGCG ACGATACCGC AGACGCGAGC GCGACCATCG GCATGACCAT TCTCGGCGGC GACGGTGCGG ATCGTACGAT CGCTTCGGCC GAGACCACCG ATCTTGCCTT TGCGGGCGGG GGCGACGAAT CGACCGATGT CTTGCAAGTC GACGCGATTG CCGACGCGAT CGTCGACAAG ACCGGCACAC TGAGCGGCAA CTACAATGCA TTCGATGTCG GTTGGAACGG TTCGCGTCAT GTGGTCGCGA CGGAGATCGA GCATATCGAA ATCTCCGGCA CCGGCGCGGG TTCGACCCTG ACGGTGTTGG CCGATGATTT CGATCCCGAT CCCGTACAGC CGCATGACGA ACGAATCGCG GTCACCGACC AGAGCGTCGA CGGCTTTACC GTGGAGTTCG CCGAGAGCCT GGAACAGGTG CGGGGCAGCG GTTACGGCCG GCTGCTGATC GACGGCCGCC GTGGCGACGA CAGCATTGAT TTCACCGGGC TGAAAGCCGG CGGCACCAAT GGTCTCAATG ACGACATCAG CGCCCTCGAA GTGCGCGGCG GGGAGGGCGA GGACCAGTTC GTGTTCGGCG TCGACAGCGT CGCCGCGACG GTGGTCGGCG GTCTCGAATC GGATCTGTTG CGGTTCGGTG GCGATACGGT GATCGCCAAG CAGGTCGGCG ACGCCGTCGT CGACAAGACC GGCACATTGA GCGGCAACTA CAACGCGTTC GACGTCAGTT GGAACGGCTC GCGTCATGTG ACCGCAACGG AGATCGAGCG TGTCGAAATC TCCGGCACCG GCGCGGGCTC GACCCTGACG GTGCTGGCCG ATGATTTCGA TCCCGATCCC GCACAGCCGC AGGACGAACG AATCGAGGTC ACCGACCAGA GCGTCAACGG CTTCACCGTG GAGTTCGCCG AGAGCCTGGA GCAGGTGCAG GGCAGCGGCT ACGGCCGGCT GGTGATCGAC GGACGCCGTG GTGACGACAG CATCGATCTC ACCGGGCTGA AAGCCGGCGG CCTCAATCGC CTCAACGACA GCATCGACGC CGTCGAATTG CGCGGCGGTG AGGGCGAGGA TCGGTTCGTG TTCGGCGTCG ACAGCGTCGC CGCGACGGTG ATCGGCGGCG TCGAAACCGA TCTGCTGCGG TTCGGTGGCG ACACCGTCGT CGCTCAACAT GTCGGCGACG CCATCGTCGA CAAGACCGGC ATGCTGAGCG GCAACTACAA TGCATTCGAC GTCAGCTGGG GCGGTTCGCG CCAGATCACC GCGACGGAGA TCGAGCGTAT CGAGATCTCC GGCACCGGCG CGGGATCGAC CCTGACGGTC CTGGCCGACG ATTTCGATCC CGATCCAGGA CAGCCAGGCG ACGAACGAAT CGAGGTCGCT GACGACGGCC TGAGCCGCTT CACCATCGGG TTCGCCGAGA GTCTCGAGCA GGTGCAGGGA AGTGACTACG ATTACCTGGT GATGGACGGT CGCGGTGGCG ACGACATGTT CGATCTGTCC AAGCTGACGG CGGCGTCGGG GCTCACCTCG GTCGAGCTGA CCGGCGGTGA CGGCAGGGAC GAATTCATCC TCGGTTCGGC GAATCCGTTG ATCAAGATCA CCGATTTCGG CGTCGGCGCA ACGCCCAACG ATCTGCTCGA TCTTTCACAG CTCCGCGCCG CGGGCGTCGA TGCTGACGAC ATCATCTTCG ACGACGGAAG TAACGAGATC AGTCTCGGCG ATCTACTGGG AGGCAGTACG ACCAATTTGT CCGGCACCGT CCAGGTGCTC GACGCCAGCG GCGGCAGCAA CGTCGTCGTC GCAGAGTTCA ATATGGTGGG TAGTACCATC AATCAGGACA TGATGCAGCT GGTGTGGCAG CACGTCGAAA CAATCACCGT CTAG
|
Protein sequence | MNALIKMAQL QLSEATTSAV GLKTFKVSKP ETGQALAIQL DGKSVVDLTS FADEKITLVR LGNRLVILFD NQSTVTINPF YDALGRPIAD VSLQVSQGQP ISGAEFVASF PISTDQSILP AAGAGGPAPV ASSASFSDIT IDQLNPSGNG IGSRAPVETG VVTTTIRDVF EPGSGSPLGD TNDAAVITGT STASLLEADV VLGTGGTLVA SDPDSSNAFV PQAAVAGSNG YGNFSIDAAG VWTYAMTTPH NEFVGGVDYT DSVTVATVDG TTQVVTVTIT GTDDATVITG ASTATLIEAD VVLSTGGTLV ASDPDSSNAF VPQAAVAGSN GYGNFSIDAA GVWTYAMTTP HNEFVGGVDY TDSVTVATVD GTTQLLTVTI TGTDDATVIA GFSTATLTEA DVVLSTGGTL VASDPDSSNA FVPQAGVAGS NGYGSFSIDA AGVWTYSLTT AHDEFVGGFD YTDSMTVATV DGTTQLVTVT ITGTNDAAVI TGTSTASLVE ADAVLTTGGT LVASDIDNLN AFVVQTGVAG SNGYGTFSID ASGAWTYSMT TAHDEFVEGV DYTDSITVTT VDGTTQLVTV TITGTDDATV VSSADVSVAE GNAPISTGGT LTISDVDSAV TFVAQPGTPG AHGIFSIGTN GAWTYVANDA FDGLNVGQSV SDTFSVSAAD GTLTSVQVTI TGTNDLAFVS SADVSVAEDN APISTGGTLT ISDVDSAATF VEQSGTPGVH GIFSIGTNGA WTYVANDAFE SLSVGQSVSD TFSVSAADGT LTSVQVTITG TNDAPVITVA SPDSSSASLT ESDAALTASG TLTVTDVDVS NTVTASVQSV VVAGPSGGLS SAALLAMLAV PGAAIAADPT DAHNLAWTFN SAPQAFDFLA QGETLSLTYT VGVQDGAGGS DTQTVTVTVA GTNDTPVLIT GPGYSEPASQ MTGAVTEDAA ANQVRGIVNV FDADNGHGLT AVIDNPNGTY GTLARDTWVD GNGDTQDGWV YTLDNTRSAT QALKEGEVQT ETFTFKVMDE HGAIDTETVT ITVTGNNDAP IVAHAITNQT LLEDHAWTFQ VPADTFADVD GPPLEYSAFL ANGDPLPDGM HFDPATLTFT GTPPLNETHS PDLKVVAWDG EFSAEAYFTL NITPVNDAPA GTNGAVTTLE DIAYTLTLAD FGYTDANDDP ANTLLGVKIT TLPGAGTLAL DGHPVTAGAT ASAADILAGK LQFVPAANAN GSNYASFTFQ VQDDGGTANG GVDLDPLANT LTINVTPVND APVLSGHENP SAIAEDSGAA GAATTVAHLL GLDDVNPAND HAIDVDAASL GIVITAVDDS HGVWQFQSGG GTWTDVDLAP GEGLHLAQTD ALRFIPSRNV QTTDATLFDA SGNPVLGSTN PSFVAEPPGL TFRAWDMSNG IAAGITALVV GFGGTSAYSA TAETASLVVT GTLDRVFTNG DDTVDLTGLS NDPAVDAVWF EDQNFFDAGE GDDNIVLPRM AGPPGPDPLA PVYQNQTFHL GAGDDTADAS ATIGMTILGG DGADRTIASA ETTDLAFAGG GDESTDVLQV DAIADAIVDK TGTLSGNYNA FDVGWNGSRH VVATEIEHIE ISGTGAGSTL TVLADDFDPD PVQPHDERIA VTDQSVDGFT VEFAESLEQV RGSGYGRLLI DGRRGDDSID FTGLKAGGTN GLNDDISALE VRGGEGEDQF VFGVDSVAAT VVGGLESDLL RFGGDTVIAK QVGDAVVDKT GTLSGNYNAF DVSWNGSRHV TATEIERVEI SGTGAGSTLT VLADDFDPDP AQPQDERIEV TDQSVNGFTV EFAESLEQVQ GSGYGRLVID GRRGDDSIDL TGLKAGGLNR LNDSIDAVEL RGGEGEDRFV FGVDSVAATV IGGVETDLLR FGGDTVVAQH VGDAIVDKTG MLSGNYNAFD VSWGGSRQIT ATEIERIEIS GTGAGSTLTV LADDFDPDPG QPGDERIEVA DDGLSRFTIG FAESLEQVQG SDYDYLVMDG RGGDDMFDLS KLTAASGLTS VELTGGDGRD EFILGSANPL IKITDFGVGA TPNDLLDLSQ LRAAGVDADD IIFDDGSNEI SLGDLLGGST TNLSGTVQVL DASGGSNVVV AEFNMVGSTI NQDMMQLVWQ HVETITV
|
| |