Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3026 |
Symbol | |
ID | 3910825 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3449079 |
End bp | 3450059 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637884932 |
Product | hypothetical protein |
Protein accession | YP_486639 |
Protein GI | 86750143 |
COG category | [S] Function unknown |
COG ID | [COG4765] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.906877 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGCGG CCGAAGCTGT AGAATGTCGC AAGCCTGATT CGTCATTGAA GCCGCGCGGA ATGTCCCGAA GCCTCTGTTT TGTCGGTCTT GCGGCGCTGA TCGCCGCCCC GTTGATCACG CTCGCGCCGC CGGCGCAGGC GCAGATCGGC AATATTTTCT CCGACCCGGC GCCGCGTCCG CCCGGTGCGA TCCCGCGTGG CGCGCCGCAG ATGGACGACG AGGAAGAGGT TCCCGACCTG CCGCCGCAAG GCCGCGTGCT GCCGGCGCCG ACCCGGCCGC CGCAGGGCTC GGCACTTCCA GGCCCGGTGC AGTCTCAGCC GCTGCCGCCG CCGCCCGGCA CCACGATCAT TCCGCAGACC CCGCCGGCCA CGGCCAACGT GCCGCCCGGC CAGCCCGACA GCGGCGTCGC CAACGCGCCG CCGGGTGCCA ATCCGCTGCC GGGCCTGCCG CCCGGTCAGC GCCAGCCGCG TGGCGCGCCG CCGACGCCGG CGACGCTGCA GCCCGGCGAC GAGATCGTCA CCGAGCCGCC GGCGCAGAAG ATCGTCAACA AGAAGGCGAG CTTCACCGGC CTCGACAAGA TCACCGGTCG CACCATCAAT TTCGACGCCG ACATCGGCGA GACGGTGCAG TTCGGCGCGT TGCGCGTGAA GACCGACGCC TGCTACACGC GGCCCTCGAC CGAGGCTGCC AACACCGACG CCTTCGTCGA GGTCGACGAG ATCACGCTGC AGGGCGAGGT GAAGCGGATC TTCTCCGGCT GGATGTTCGC CGCCAGTCCC GGCCTGCACG CGGTCGAGCA TCCGATCTAC GATATCTGGC TGACCGATTG TAAGAACCCC GAGACGCCGG TCGTCAGCGC CCAACCCGAT GCGCCGAAGC CCGCCGCTGC ACAACCGCAG CAGCAGCAGC GCCGCCGCCA GCCGCCACCG CGCCAGACCC AGCAGGCGCC GCCGCCGCCT TTGCCGGCAT TCCGGCAGTA A
|
Protein sequence | MQAAEAVECR KPDSSLKPRG MSRSLCFVGL AALIAAPLIT LAPPAQAQIG NIFSDPAPRP PGAIPRGAPQ MDDEEEVPDL PPQGRVLPAP TRPPQGSALP GPVQSQPLPP PPGTTIIPQT PPATANVPPG QPDSGVANAP PGANPLPGLP PGQRQPRGAP PTPATLQPGD EIVTEPPAQK IVNKKASFTG LDKITGRTIN FDADIGETVQ FGALRVKTDA CYTRPSTEAA NTDAFVEVDE ITLQGEVKRI FSGWMFAASP GLHAVEHPIY DIWLTDCKNP ETPVVSAQPD APKPAAAQPQ QQQRRRQPPP RQTQQAPPPP LPAFRQ
|
| |