Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0768 |
Symbol | |
ID | 3969935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 850533 |
End bp | 852224 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637923883 |
Product | hypothetical protein |
Protein accession | YP_530658 |
Protein GI | 90422288 |
COG category | [S] Function unknown |
COG ID | [COG5338] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.8087 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGATCC TGGGGCCAGC ATCGGGCCGC CGCAAGCGCG CAGAGGGATT GTGCGCGGTC TCGGCATGCC TGCTGCTCGC CCTGCTCGGC GCGACCGACG CCGCCGCGCA AAGCCTGACC CCGGAATTGC TCAGCCCGGA ACCCGGCGGA TTCGTCGCGC CGCAGGACCT GCCGCTGCGC CGCACCGCGC AAGCCCGGCC CGACCTCAAT GCGCTTCCCG ACGCCGAGCC GATGGCGCCA TCGCGGATCG GCAAGATTCC CAGCTACGGC ATCCCGGCCG CCGCCGGCGC CTCCGACAGC GGCTATGATT CGCTGAACCG CCGCCGCCGC GCGCCACGCT TCTATCCCGG CGCGCCGCGG CCGAAACGCG CCGGCCCCGG CAGCCCGGCG GTGATCGCGC CGCCGCCGCC GCGGCGGGTG GCGCCGCCCT CGGCGAGCGC GCATCGCACA CCGCTTGCGC CGTCCATGGC AGGAACGGTG ATCGGCCAGC CGCCGCGCCG CCAGCTCAAG GTCGATAACG ATCCGTTCGG CGCGGTCGGC GACTACGCCG GCAGCTTCCT GGTGAAATCC GCGGTCGAAA CCTCCGGCGG CTACGACAGC AATCCGGGCC GGGTGTCGGG CGGCAGCAAA GGCTCATCGT TCTACGTCGT GGCGCCGGAG CTGATGGTGA CCTCGGATTG GGAGCGCCAC GCGCTGGTCG CCGACCTGCG CGGCTCCTAC ACCGGGTATG GCAGCACGCG ACCCGACGAC GCCGGTTCGG CGTCCTCGGC GCCGGCCAAT CTCGACCGCC CGAGCTTCGA CGGCCATGTC GATGGCCGGC TCGACGTGTC GCGCGACACC CGCCTGCTCG GCCAGTTGCG GATGCGGCTG TTCACCGACA ATCCGGGCAG CCCGAACCTC GACTTTGGCC TGGCGAAATA TCCGCTGGCG CTGAACACCG GCGCCTCGGC CGGCTTCGAC CAGAATTTCA ACCGGCTGCA GGTCTCCGGC GTCGGGACGC TCGACCGCAC CACCTATCAG CAATCGGTGC TCACCGACGG CTCCACCGAG CCTAACAACG ACCGCAATTT CAACCAGTAC GGCGGCGTCG CCCGGGTCAG CTACGAGTTG ATGCCGGGGT TGAAGCCGTT CGGCGAGGTC GAATACGACA GCCGGGTGCA CGATCAGGCG GTCGACCGCG GCGGCTATCA GCGCGACTCC ACCGGCGGCT ACGTCAAGGC CGGCACCAGC TTCGAATTCA CCCGGCTTTT GACCGGCGAA ATGTCGATCG GCTACACCGC GCGCAGCTAT ACCGACCCGC GGCTGGAGAA ACTGACCGGG CTGTTGACCA GCGGATCGCT GATATGGGCG GCGACGCCGC TGACCACGGC GAAGTTCATC ACCAATACCT CGGTCGACGA ATCGACGCTG CCGGGGGTTT CCGGTGTGCT GACCCGCAGC TACACCGCCG AGGTCGACCA CGATTTCCGG CGCTGGCTGA CCGCGATCGG CAAATTCACC TATGGCAGCC AGGACTATCA GGGCTCGTCC CGGCTCGACC ATTTCGGCGC GGTGGCCGGC AATCTGGTCT ACAAGCTGAA CCGCTCGTTC CAGCTCAAGG CCGAGCTGCG CCGCGAATGG CTGGATTCCA ACCTGCCCGG CAACAGCTAT TCCGCCAATG TGGTGATGCT CGGGGTGCGG CTGCAGAACT GA
|
Protein sequence | MVILGPASGR RKRAEGLCAV SACLLLALLG ATDAAAQSLT PELLSPEPGG FVAPQDLPLR RTAQARPDLN ALPDAEPMAP SRIGKIPSYG IPAAAGASDS GYDSLNRRRR APRFYPGAPR PKRAGPGSPA VIAPPPPRRV APPSASAHRT PLAPSMAGTV IGQPPRRQLK VDNDPFGAVG DYAGSFLVKS AVETSGGYDS NPGRVSGGSK GSSFYVVAPE LMVTSDWERH ALVADLRGSY TGYGSTRPDD AGSASSAPAN LDRPSFDGHV DGRLDVSRDT RLLGQLRMRL FTDNPGSPNL DFGLAKYPLA LNTGASAGFD QNFNRLQVSG VGTLDRTTYQ QSVLTDGSTE PNNDRNFNQY GGVARVSYEL MPGLKPFGEV EYDSRVHDQA VDRGGYQRDS TGGYVKAGTS FEFTRLLTGE MSIGYTARSY TDPRLEKLTG LLTSGSLIWA ATPLTTAKFI TNTSVDESTL PGVSGVLTRS YTAEVDHDFR RWLTAIGKFT YGSQDYQGSS RLDHFGAVAG NLVYKLNRSF QLKAELRREW LDSNLPGNSY SANVVMLGVR LQN
|
| |