Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1816 |
Symbol | |
ID | 3908975 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2077411 |
End bp | 2078376 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637883710 |
Product | hypothetical protein |
Protein accession | YP_485435 |
Protein GI | 86748939 |
COG category | [S] Function unknown |
COG ID | [COG1729] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR02795] tol-pal system protein YbgF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.18225 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.591091 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACCCA TTCATGCCCT TTCTGCGCGC AGCCTCGCGG TGTTCGCGAT GCTCGCTTTC GCCGCTCCCG CGCTGGCCCA GCAATATGGC GGCGAGGTCG ATCCCGAAAT CCGCATCCAG CAGCTCGAAG AGCGGTTGCG CACGCTGACC GGGCAGAACG AGGAGCTGCA GTATCGCAAC CGCCGGCTGG AAGATCAGGT CCGGCAATTG CAAAGCGGTG CAGGGGTTCA GCCGGCCGCA CCCGGCAACG CCGCGCCGCC GCCGGCTGCC GCTGCGCAGC AACCTTCTGT CTACGGCCAG CCGCCGCAGG CACCGATCGT GCAGGATCAG CCGGTGGCGC CGCCGGCGAC CGGCCGCCGC CGGGGCGATG CGTTCGATCC GAGCCAGAAC CCGAACGCGC CCGGCGTGCC GCGGGCGCTC GGCGGCGGAC AATTGCCGGT CCCGGCCGAG CAAAGCGGGG TGGCGGGCGC GCCGCTCGAC CTGTCGAACA ATTCGGGCGG TCGCTATCCC GACGCCGGCG CGCCGCCGCA ACCGGCGCCG AGCGCTGCGG CCGGGGGCGG GCTGACGACG CTGCCGCCAT CGGCCAGCCC GCGCGACGAG TTCGATCTCG GCATCGGCTA CATGCAGCGC CGCGACTACG CGCTCGCCGA GGAGACGATG CGCAACTTCG CCAGCAAATA TCCCAACGAC GCCCTGACGC CGGACTCGCA ATACTGGCTC GGCGAGAGCT TCTTCCAGCG CCAGATGTAT CGCGACGCGG CGGAAGCCTT CCTCGCGGTC ACCAGCAAAT ACGACAAGTC GGCGAAGGCG CCCGATGCGC TGCTGCGGCT CGGCCAGTCG CTGTCGGCGC TGAAGGAAAA GGAAGCCGCC TGCGCCGCGC TGGGCGAGAT CGGCCGCAAA TATCCGAAGG CATCGGCCGG CGTGAAGAAG GCGGTCGACA CCGAGCAGAA GAAGCTTAAG TGCTAG
|
Protein sequence | MSPIHALSAR SLAVFAMLAF AAPALAQQYG GEVDPEIRIQ QLEERLRTLT GQNEELQYRN RRLEDQVRQL QSGAGVQPAA PGNAAPPPAA AAQQPSVYGQ PPQAPIVQDQ PVAPPATGRR RGDAFDPSQN PNAPGVPRAL GGGQLPVPAE QSGVAGAPLD LSNNSGGRYP DAGAPPQPAP SAAAGGGLTT LPPSASPRDE FDLGIGYMQR RDYALAEETM RNFASKYPND ALTPDSQYWL GESFFQRQMY RDAAEAFLAV TSKYDKSAKA PDALLRLGQS LSALKEKEAA CAALGEIGRK YPKASAGVKK AVDTEQKKLK C
|
| |