Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1789 |
Symbol | |
ID | 3972054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1942447 |
End bp | 1944192 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637924902 |
Product | hypothetical protein |
Protein accession | YP_531667 |
Protein GI | 90423297 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0254074 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGC TCGGCATGGC GATCTACGAT CCGCGCCGTC TCGACGACGA GACGTTCCTG GCCGGCTTCG TCGCGCGCGG CGATTTCGTC GCGTTCCTGC TCGACAAGCT GCGCGATATG CCGGAGCTGG CCGAGCATCA CATCATCGTC GGACCGCGCG GCATGGGCAA GACCAGCCTA CTGCGGCGTT TGGCGATCGG CATTTCCAGC GAGCCCGCGC TGCGGCAGCG CTTCATCCCG CTGACGTTTC GCGAGGAGCA ATACAATGTC CGCTCGCTGG ATGCGTTCTG GCGCAATTGC GCGGAGTCGC TCGCCGAATG GTGCGAGAGC GAAGCCAAGG CGGATGTCGC GGCCGAGATC GATCGCAGCC TGCTGACGCC GCAGTGGCGC GACTCGAACT CCGCCATGGA GGCGTTCCTG GCGCTGTCGA AGCGGCTCGG CGGCCGGCCG GTGCTGTTCG TGGATAATCT CGATCTGATC CTCGATGCTT TGTCGGCCGA ACAGAACTGG GAGTTGCGGC GGATTCTGCA AGCGCCCGGC GGCCCGGTGC TGTTCGGCGC GGCCACGCAG CTGCTGCGCC AGTCGGGCGA GCGGGATGCC GCGTTCTACG AATTCTTTCA TCCGCACATG CTGGATCCGC TCTCGGAAGC CGAACTGCTG CAGTGCATGA ACCGGCTGGC CCAAGCCCGC GGCGAAGCCG GCAAGGCGGT CAGCGAAATT CTTGCGCGGG AGCCCGAACG AATCCGCACG CTGCACACGT TGACCGGGGG CAACCCGCGG GTGCTGACTC TGGTCTATCA ATTGCTCGAG CGCAGCGAAA GCGACACGGT GTTTTCCGAT CTCGAAGTGC TGCTCGATCA ATTGACGCCG TTCTACAAGG CGCGCGTCGA GGAGTATCAG ACCGATCTGC AACGCGCGGT GATCGACGCC ATCGCGCTGC ACTGGCACCC GATCACCTCG CACGATCTCA GCCTGGCGAC CGCGGTCGAC GTTACGACCA TTTCGCCGCA GCTCAACAGG CTGAAGCGCG ACGGCCTGGT CGAAGAGGTG GAGACGTCGG GCGCGCGCGC CGGCTATCAG CTGGTCGAGC GGTTCTTCAA CATCTGGTAC CTGATGCGCC ACGGCACGAG GCGCACGCGG CAGAAGATCT ATTGGCTGAC GGCGTTCCTG AGCAGCTTCT GCGCGGCGGC CGAACTCGCG AAAATGAAAT CGCAGGCGAT GTCCGAGGGC CGCGCAAATT GGCATCCGCT GTATCTGGAA GCGCTCGAGG CGGTGGGCGA GGGCCGGATG GCGGAACTCG CGCCGCTCCG TGCTGCGCTT GGCGAACTCC CTCCGGTCGG ACGGGAACTG CTGGACGCCG CCGTTGAAAT CTCCAGCGAC AATTTCGGCG CCGCCATGCT ACATCTTCAA GAAGCGCTCG CCGGCGATCA AGACCAGCTC CGGTCGACGT TTTTCGAGGA TCTTCTGCGC CTTCTGCGCA TTGCCGAAGC CCGCGGCTAT GGCGAAAAGC TGATCGGCTG GTTCGTCGAG TCCGGTCAGG CGGAACGCCA GGCGCCGCTC TATGCGGCTT TTGTCGGCTA TGTGCGCGGC CAGCAGTTTC TGCTGGACTT CAGCCCGGAG ATTCGTAAAC CGGCCGAGCC GATCTTGCGC TGGCTGAGCA GCCGCCGGAA CAGCCAGCCG GTGCAGAACC CGCCCAAACC CAAACGTCGT CGTGGGCGCC CACCTCGGAA ACATTCAACG GCATGA
|
Protein sequence | MTTLGMAIYD PRRLDDETFL AGFVARGDFV AFLLDKLRDM PELAEHHIIV GPRGMGKTSL LRRLAIGISS EPALRQRFIP LTFREEQYNV RSLDAFWRNC AESLAEWCES EAKADVAAEI DRSLLTPQWR DSNSAMEAFL ALSKRLGGRP VLFVDNLDLI LDALSAEQNW ELRRILQAPG GPVLFGAATQ LLRQSGERDA AFYEFFHPHM LDPLSEAELL QCMNRLAQAR GEAGKAVSEI LAREPERIRT LHTLTGGNPR VLTLVYQLLE RSESDTVFSD LEVLLDQLTP FYKARVEEYQ TDLQRAVIDA IALHWHPITS HDLSLATAVD VTTISPQLNR LKRDGLVEEV ETSGARAGYQ LVERFFNIWY LMRHGTRRTR QKIYWLTAFL SSFCAAAELA KMKSQAMSEG RANWHPLYLE ALEAVGEGRM AELAPLRAAL GELPPVGREL LDAAVEISSD NFGAAMLHLQ EALAGDQDQL RSTFFEDLLR LLRIAEARGY GEKLIGWFVE SGQAERQAPL YAAFVGYVRG QQFLLDFSPE IRKPAEPILR WLSSRRNSQP VQNPPKPKRR RGRPPRKHST A
|
| |