Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2662 |
Symbol | |
ID | 3910455 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3045565 |
End bp | 3046767 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637884562 |
Product | hypothetical protein |
Protein accession | YP_486275 |
Protein GI | 86749779 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.752873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.243579 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATGA TGGCCGTGCT CAGCGAAGGC CGTTCCGCGG AACACGCGTC GTCGTCGGTG CGTCCCGGTC GCATCGCACA TGTCGAGATT TTCCGCGACA TGGCCTCGAC GGAAGCGATC TGGCGCGCGC TGGAACAGCC CGAACAATTC TCCACGCCGT ATCAGAGGTT CGACTTGCTC GACGCGTGGC AGCGCCATGT CGGCCGCGCC GATCATGTCG AACCCTTCAT CGTGGTGGCC AGCGACGCCG AGCAGCGGCC CTTGCTGCTG CTGCCGCTCG GCCTGGAGCG GCGCTTCGGC GTTCGGATCG CGCGCTTCCT CGGCGGCAAG CACACGACGT TCAACATGCC GCTGTGGCGC AGCGATGTCG CGCGGACCGC GGATGCGAAC GACCTCGCCG CCCTTGTCGC AGGCCTGCGG GCGCGCCCGG ACGGCGCCGA CGTGCTGGCG CTGTCTCAGC AGCCGCTTCG CTGGCGCGAC CTCGCCAACC CGATGGCGCA GCTGCCGCAT CAGCCCTCGA TCAACGATTG TCCGGTGCTG CTGGTCGATC CTGCCGCGCC GCCGACCGAC CGGATCAGCA ACTCGTTCCG CCGCCGGCTC AAGACCAAGG AGAAGAAGCT CCAGACATTG CCCGGCTATC GCTACGTCCA GGCCAGGAGC GACGCCGACG TCGAACGCGT GCTCGATGCC TTCTTTCGGA TCAAGCCGAT CCGCATGGCG GCGCAGAAGC TGCCGAACGT GTTCGCCGAC CCGGGCGTCG CGGATTTCAT CCGCCAGGCC TGCATGACCG AGCTCCGGGG AGGCGGCCGG GCGATCGAGA TCCACGCGCT CGAATCCGAC GACGAGACGA TCGCGATGTT CGCCGGCGTG GCCGACGGCC ATCGCTACTC GATGATGTTC AACACCTATA CGCTGTCGGA GGCGTCGCGC TACAGTCCCG GCCTGATCCT GATGCGCTCG ATCATCGATC ACTACGCCGC GCAGGGCTAT CGCCGGCTCG ATCTCGGCAT CGGCTCCGAC GACTACAAGA AACTGTTCTG CAAGGACCTC GACCCGATCT TCGACAGTTT CATCGCGCTG TCGCCGCGCG GCCGTCCGGC CGCCGCAGCG ATGGCATCGA TCGCTCGCGC CAAACGCGTC GTCAAGCAGA CCCCTGCCCT GATGCAGATC GCGCAACGGC TGCGCAGCGC GCTGCATCGC TGA
|
Protein sequence | MTMMAVLSEG RSAEHASSSV RPGRIAHVEI FRDMASTEAI WRALEQPEQF STPYQRFDLL DAWQRHVGRA DHVEPFIVVA SDAEQRPLLL LPLGLERRFG VRIARFLGGK HTTFNMPLWR SDVARTADAN DLAALVAGLR ARPDGADVLA LSQQPLRWRD LANPMAQLPH QPSINDCPVL LVDPAAPPTD RISNSFRRRL KTKEKKLQTL PGYRYVQARS DADVERVLDA FFRIKPIRMA AQKLPNVFAD PGVADFIRQA CMTELRGGGR AIEIHALESD DETIAMFAGV ADGHRYSMMF NTYTLSEASR YSPGLILMRS IIDHYAAQGY RRLDLGIGSD DYKKLFCKDL DPIFDSFIAL SPRGRPAAAA MASIARAKRV VKQTPALMQI AQRLRSALHR
|
| |