Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3644 |
Symbol | |
ID | 4024158 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 4066954 |
End bp | 4068093 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637963848 |
Product | hypothetical protein |
Protein accession | YP_570768 |
Protein GI | 91978109 |
COG category | [S] Function unknown |
COG ID | [COG4645] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.82541 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.662915 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATCA TGAACGACGC AACGAAATCA GGCGGCACGG GACGGGATCT GCGTCTTGAT CTGTTCCGGG GCATGGCGAA CTGGGCGATC TTCCTGGATC ACGTGCCCAA CAACGTGGTC GCGTGGCTCA CCATGCGGAA CTACGGGTTT AGCGACGCGG CGGAGCTGTT CGTTTTCGTA TCGGGGTTCA CGGTCGCGTT CGTGTACTCG AAGACGCTGC ATGCGAAGGG TATTCTTGCA GCGACCGCCG GGATCCTCGG CCGGGTCTGG CAGATCTACG TCGCCTACGT GCTCCTTTTC GTCTTCTACG TGGTGGCGGT CGGCTACGTC GCTCAGCGCT ACGGTCACGC CCATCTGCTC GACGAATACA ACATCCGCAG CCTCATCGCC GATCCGGTCG AGTTTCTGAA ACATGGGTTG CTGCTAGAGT ATCGTCCCCT CAACCTCGAC GTGCTGCCGC TGTACATCGC CCTGATGGCC CCGTTTCCCT TGGTGCTCTG GTCGTTGACC AAGGCCCCAG GCGTCACGTT GGCGGGCTCG ATCGCCCTCT ATGCGGCGGC CCGGTCGTTC GGCTGGAACC TTCCGGGTTA CCCGGCAGGA TATTGGTATT TCAATCCGTT CGCCTGGCAG CTCCTTTTTG TGATCGGTGC GTGGACGGCC ACCGTCGATC GCGGCACGTT GGATCGGACG TTGCGCTCGG GCATCGTGCT GCCGCTCGCC ATCGCTGTCG TCGCTATTTC GGCGATCGTT ATGCTTGCGC CACTGGCGGG AAATGCCTGG TTGCTGCCGG AGATGCTGCG CCTTCCCTTT CCGATGGCCG ACAAGACGAA CCTTGCCCCT TACCGCATCG CCCACTTCCT AGCCTTGGCG ATCATCGTGG CGCGCCTCGT TCCGAGAAAC GCGCCGGCGC TGGCCTGGCC GGTCTGGCGG CCGTTGATCG TCAGCGGGCA GCATTCGCTC GAAGTTTTCT GCGCGGGAAC GTTTTTCGCG GCCATCGCCT ATTTCACCCT CGATCTCGTC GATGGATCCG TCAGATCCCA GCTCGTCGTG AGCGCGGCGG GCATCTGCGC GATGGTCGCC GTGGCCTACT TCCGAAAGTG GTCGAAAGAG AACAAATTGC GCCCTGCGAT CGCGAGATGA
|
Protein sequence | MDIMNDATKS GGTGRDLRLD LFRGMANWAI FLDHVPNNVV AWLTMRNYGF SDAAELFVFV SGFTVAFVYS KTLHAKGILA ATAGILGRVW QIYVAYVLLF VFYVVAVGYV AQRYGHAHLL DEYNIRSLIA DPVEFLKHGL LLEYRPLNLD VLPLYIALMA PFPLVLWSLT KAPGVTLAGS IALYAAARSF GWNLPGYPAG YWYFNPFAWQ LLFVIGAWTA TVDRGTLDRT LRSGIVLPLA IAVVAISAIV MLAPLAGNAW LLPEMLRLPF PMADKTNLAP YRIAHFLALA IIVARLVPRN APALAWPVWR PLIVSGQHSL EVFCAGTFFA AIAYFTLDLV DGSVRSQLVV SAAGICAMVA VAYFRKWSKE NKLRPAIAR
|
| |