Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2344 |
Symbol | |
ID | 3909342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2693067 |
End bp | 2694125 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637884241 |
Product | hypothetical protein |
Protein accession | YP_485960 |
Protein GI | 86749464 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.163785 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.669867 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTGCCC TGGTGATACT TCTTTTGATC GCCGCGGCGG CCTTGGCGGG CTGGCTGTAT TTTCACGGCT CGTTCGACAG CTACCGGCAG AAGACCGGCG AAGCCTTCAT CCGGGTCGGC GATTTTCGCG CGGCGCATCT GCCGTCCGCG AACTATGCCG GGCATTATCT GCCTTACGCG CTGTTCGCGC TGCGGGCCTA TGATCCGGCC GGACCCGGCG GTTCGCCGAA GCTGAATGCG TTGCTGGAGC GGCCGGCACC GGACGATCCT GCGCGGAGCC GGATCACCGC CGAACTGCGC GGCGGCTGGC TCGGCGACTG GGCGTTGATC GAGCAGGGCG AGGGGCCGTT GCCGTGTCCC GACGGCGACG GGCAGTGCGG CGGCCGGGCG CTCGGCGGGC TCGGCTACAA GGTGTTCAGC TCGCAGGCGC GCAACGAGAT CGTCGTCGCC TTCCGCGGCA CCGACTTCAA GGAGGCGCAT GACTGGATCG CCAATCTGCG CTGGGTGACG CGGTTTCTGC CGTACTATGA TCAATACGCG CAGGTGCAGC GCCACATCGG GCCGATCCTC GACCGGGCGC TGGCGGCGCA CAAGCTGGCT GATCCGACGA TCGTCGCCAC CGGGCATTCG CTCGGCGGCG GGCTGGCGCA GCAGGCGGCC TACAAGGACG GCCGCATCCG CGCGGTCTAT GCGTTCGATC CCTCGACCGT CACCGGCTAT TACGATCCCG GCGTTCCGGG CAAGGAGAAT TCGGTCGGCC TGCTGATCGA TCGGGTGTAT CAGCGCGGCG AGGTGCTGGC CTATCTGCGG TTCCTGATGA CGCAGCTTTA TCCGGTGGCG GCCTTCAATC CGCAGATCCG CACCGTGCGG TTCAGCTTCG GTGCCGAGGG CGGCGTCGTC GCCAGGCACA ACATGGCGGA TCTCGCCGCC GGCCTGCTCG AAACCTCGGG AACGCCGGAG GCATGGGCCG CGCGCGCCCT GCCGCTGCCG AATGCGCCCG GCAGCGACCC CGGCGAGACC TGGATCTATC GCCTCGTCGA CTGGATCAAC CGCAAGTAA
|
Protein sequence | MRALVILLLI AAAALAGWLY FHGSFDSYRQ KTGEAFIRVG DFRAAHLPSA NYAGHYLPYA LFALRAYDPA GPGGSPKLNA LLERPAPDDP ARSRITAELR GGWLGDWALI EQGEGPLPCP DGDGQCGGRA LGGLGYKVFS SQARNEIVVA FRGTDFKEAH DWIANLRWVT RFLPYYDQYA QVQRHIGPIL DRALAAHKLA DPTIVATGHS LGGGLAQQAA YKDGRIRAVY AFDPSTVTGY YDPGVPGKEN SVGLLIDRVY QRGEVLAYLR FLMTQLYPVA AFNPQIRTVR FSFGAEGGVV ARHNMADLAA GLLETSGTPE AWAARALPLP NAPGSDPGET WIYRLVDWIN RK
|
| |