Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3316 |
Symbol | |
ID | 3911117 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3792806 |
End bp | 3795811 |
Gene Length | 3006 bp |
Protein Length | 1001 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637885218 |
Product | hypothetical protein |
Protein accession | YP_486923 |
Protein GI | 86750427 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.104809 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.215386 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAGGCCCC CATTGAAATC CCGAACGCCC TACCGCACAT TCCGCGATAA CTCAAAAGCC AAGGCCGCCG TTTACAAGGC AATCAGAACG ATTCGCGACC AACGCTCCCG CAACCTAGTA CGGGGAGCCA TCGCGAGGAA CACCGATTCG TCGCCCTCGC TTTCGATCAA CACTCTCCGT CGCCAAGCAG ATCCGGGGGA CCTTTGCCTC GCAACAACTC TCCTCCTAGG TCAAAACGGC CCATTTCCGT TGCCGCAGCT CGGAGGCACC TACCGTAACC TCATTATCAA GCACACGATG CTGGCGACAG AACCCGACTC CGAGGTCGCT TTTGTTGTCG GTCACGTGAA CGGCTGGGCG GACAACGCCA GACGCATCCT CGCCGACATG TCGGAACTAG CGCGATTGCC GCAGTACCAA CCATCCCTTG CAATGGAAGC GCTCGCCGCG TTCGCGGAGT CGTACGGCTC ATCCTTCTAT ATCCTGCGGA AGACTGCATA CCTAATGTCC CGCTACGCTG ACGACTCAAC ACTCCAGTCA GCATTCCAAC GGATTGCAAA AGCGTTCAAC CAATCCGCTT ATCCCGAGCC ATACTTTTCT GCCCTTCAAT TGATGGAAAC CGACTCCAGC TACTGTTCCA CCGCGACCAC CCGCGTCCGC CTCTTTCAAA AGTATGTTGA AGACGACTAT CGCCAATATC TCCCCTTGAA CGAACTCGTT CCGACCCCGC TCTCTCGGAC TGATCTTGCC GCGCTTCTTC GTCGCTCCCA CTCAAGTTCT CTTGTCGATG AACTCGCCGC ACTCGCTATA ACCGCACATC TCAGATCGTT CTGGCCAGCC TTATTTGACC AGACCATCGC CCTTCTCGAT CCGTCTATTG CACACCTCTT TCTTGAGTTC TTGTCGATTC CGTTCGACCC ATCCGCTCTG TATTCGGACG TCGCTCGCAT GGACGCGGAC GTGGTATATT ACCAGCGCGC AGCCGCCTTT TGCGAGTTCA AAGATTGCGC CATCTTCCGT CGTTCAGTGG ACGCAATTCT TGTACCACGA TTGATGGAAG ACATCTCACC ACCCGTCGAC ACAGACACCT CGCCGCACTT CCATACCAAG ATGCCAGACC TCATCAAGGC TACACAAGGC TTCGTTTCTC CCACCGACTA CGAACGAGTA CAGAGCTGCG GGACATTCCT CCGAACCGTG CAATTCCTAC AGTTCCTGAA CACTCAACGA AACTACTCAC CCCTGTCCGC CCATGACTTC CGATTCATCT GCGAACATAC CGTCGCCCTC GACATACTAT TGTGCGACTC AGAGATCGAG ATGCTCTACG CATCATCGGA CGACGACTCA CGCCCACTGA TAACCGTTCT CGCGCTCGCA CTACATAAAG CAAAAAGTCG CGACGATGAC ATCGATTTCA AATTTCGCCA CTCGCTCTGC AATACAGTAA TCACTCAATT CAATGGGAGC CTCGAAGCCT TCATTGCTTG GCTACTTCCG AATACCCCCT CCATTGCCGA CTACCTCCTA ACAATACTCG ACAGACCAAC ACTCCAGAAA CTCTACTGGA TAGTCAGATC AGCAGACGAA GCGGATCGCA CGAGACAATC GCTACTGCGC ACCGTCGGAA AACAACGAAA TCAGATCGCA CATCTGATAG AGGCTGACGC AATCGAAGCG CGGCGCCAAG TCGCAAAACT TCGAAAGTTC TTCGATGACA GCCGCATCTA TGTCGACGGC CCAGCGATGA AAGAGTGGCT GGTCGCAAAC CCCAGCACCT ACGCTCAACA ATACGTAAAG ATGATTGAGC ACGAGTTCAG CTCGTTGACA TTCCTGTCAA CATCAACCAG CGGAAAACTA GTAATCACGG AGTGGAGCGA TCTCGACTAC GTCTTAGTCG AAGCCGGAAA AGCCGCGTTT GAGCAGTTCT GCACAAACAA GCAGTTCGGG ATTGAGTCGT ACCTTGGTCG GCGCATTCGC CACAACACTA TGAGCGGGAT GATGCGAGGC GGCATCGACG ACCTAATCGA GTCACCCACT TATGATTTGC TAACATATGA TAGCGCTTTT GTTGACGCCA ACAAGCGATG GGTTGCTTCT TATCACCGAA CGATTGAGCA CTTGCGGAAG GATCTACTGC AGTTTCGGTC CGACGCAAAG CCATTGGGAA TCTTCAACTC AACTCTGAAG CGCGACGACA ATACCAACCT CGCCATCGCA TCCCTTCGAA ACATGCTATT GAATGGACGC AATCCGTTAT TGATGAACGA CCTGCTTATA CGCTTCTGCT GGCAAGAGAT TAACCCTCAG CTTCAAACTG CCTCACGCAT GATTTCCATC GACCTCGTGA AAGAGGCCAC GAGAGAAATC GAACAGCATT TCTTTCATTT CGATTCCGAC GATCTCCAGC GACGATACCG ACAACAACTC AGAGCGCTAG TCCACGAACG CTTCATGCGA CTCGGTAGTT GGTTTCGACA ACCCGAGGAT GGCTTTGTGA GCGCACGCAC TCGTCAACTC TGCGAACTCG TTCTTGTCGA GGCCACAGAT AGTAACTTAT TTGGCACACC AACGGTCGAG TGGTCCGGTG ACGCTCTCGA CCTGGAAATC GATGGTCTAT CCGTCCACCG AATGTACGAT TGCCTCTTCG TTATTCTTCA CAATGCGCTT ACTCACGGAC AAGAGAACGG CCTCATAACA ATTCAGGTCT CACAGGAGGC CATGGCCTTC GAGAGCGTCG GCCACCTCAA GGCCACCGTT TCCTCTCGTT TCTCCAGCAC AAGCGACAGG TCAAAACACA TAGCGCGATT GGCAGAGAGC TTCGGATATG GAGATCCAGA GTCAGCCATG GTAACTGAAG GATATTCAGG AATTAAAAAG CTACGGTATA TAACACGAAC TGGCGACAAC TATTCGAATG CCGGATACAC TATTGATGCC GACACCTGCT CTGTCTTCTT CACACTTGCG GTCGAATTAG CTGATCTCGA AAGGCCAGAT GTATGA
|
Protein sequence | MRPPLKSRTP YRTFRDNSKA KAAVYKAIRT IRDQRSRNLV RGAIARNTDS SPSLSINTLR RQADPGDLCL ATTLLLGQNG PFPLPQLGGT YRNLIIKHTM LATEPDSEVA FVVGHVNGWA DNARRILADM SELARLPQYQ PSLAMEALAA FAESYGSSFY ILRKTAYLMS RYADDSTLQS AFQRIAKAFN QSAYPEPYFS ALQLMETDSS YCSTATTRVR LFQKYVEDDY RQYLPLNELV PTPLSRTDLA ALLRRSHSSS LVDELAALAI TAHLRSFWPA LFDQTIALLD PSIAHLFLEF LSIPFDPSAL YSDVARMDAD VVYYQRAAAF CEFKDCAIFR RSVDAILVPR LMEDISPPVD TDTSPHFHTK MPDLIKATQG FVSPTDYERV QSCGTFLRTV QFLQFLNTQR NYSPLSAHDF RFICEHTVAL DILLCDSEIE MLYASSDDDS RPLITVLALA LHKAKSRDDD IDFKFRHSLC NTVITQFNGS LEAFIAWLLP NTPSIADYLL TILDRPTLQK LYWIVRSADE ADRTRQSLLR TVGKQRNQIA HLIEADAIEA RRQVAKLRKF FDDSRIYVDG PAMKEWLVAN PSTYAQQYVK MIEHEFSSLT FLSTSTSGKL VITEWSDLDY VLVEAGKAAF EQFCTNKQFG IESYLGRRIR HNTMSGMMRG GIDDLIESPT YDLLTYDSAF VDANKRWVAS YHRTIEHLRK DLLQFRSDAK PLGIFNSTLK RDDNTNLAIA SLRNMLLNGR NPLLMNDLLI RFCWQEINPQ LQTASRMISI DLVKEATREI EQHFFHFDSD DLQRRYRQQL RALVHERFMR LGSWFRQPED GFVSARTRQL CELVLVEATD SNLFGTPTVE WSGDALDLEI DGLSVHRMYD CLFVILHNAL THGQENGLIT IQVSQEAMAF ESVGHLKATV SSRFSSTSDR SKHIARLAES FGYGDPESAM VTEGYSGIKK LRYITRTGDN YSNAGYTIDA DTCSVFFTLA VELADLERPD V
|
| |