Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0468 |
Symbol | |
ID | 4020936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 539674 |
End bp | 541113 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637960655 |
Product | hypothetical protein |
Protein accession | YP_567607 |
Protein GI | 91974948 |
COG category | [S] Function unknown |
COG ID | [COG4223] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.252316 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCAAGA ACAGGCCCGG ACACGAAAAT ACGCAGCACG AAAAGGACCC GCGCGAGAAC GCGGCGTCCG TTGGCGCAAG TGATGCGCAT CCGGAACAAG CCGCCGAGGT GGATGTGGAC GCGCCGATCG AGGCGCTGGC CCTGAACCCC GTCGAGCCGA CCGCGGAGGA GCTTCGCGAC GATCCGGCGC TGGAGCTATC CGCAGAGCAT CGGCAGCAGG ACGTGATCGA CGCCGAGCCG CTGCCGGAGA CGGCGACTGA CGATCCTCCC GAGACCCGGT TCGCCTCCAC ATCGGACAAA GCCGAGGAAG CCGCGGCGCG GTCTGCGGCC GCCGCCGAGC CGCGCCGGCC AGGCATCGTC GCGGCGATGC TGCCGCCGGT GCTGGCGGTC GCGATCGCCG CCGCGGTGGT CGTCGGCGCC GCCAAGACCG GGCTGCTGCC GCAGTTCCTG TCCTCGACCA GCGTGAGCGC GCCGGAGGGC GACGTTGCGG CGATCGATGC GCTGAAGGCG CGGATCGCCG ACCTCGAAGC GCGGCCGACG CCGACCGGTT CGAACACCGC CGCCGCAACG CCTGCCGCTG CAGCCGATCC GGCGCTGGCC GGCAAGGTCG ACGCGCTGGA GAAGACCGTC GCCGCGCTGC GCGACGACCT CGCCACGCTC CGCGACCGAT CCGAACAGCT CGCCAGCGCG TTGAAGGAGG TCAAGGCCGC GCCGTCCGAG CCTACCGCGA CCGCAAGCGA ACCGCCGGCG ATGGCCTCGA CCGACAAGAC CCCGCCCGAC AAGGCCGCTG CCGACAAGGC CGCGACCGAT TCGGCGGCGG CGCTGGCCGC GATCAACGCC CGCTTGACCG AGCTCGAACA CGCCGCCAAG ACCGCGACCG AGGCCGCCCC GCAAGCGCCG CAGCCCGCGG TCGTGTCCGA CGACGCGCCG CTGCGCCGTC TCGTCACCGC GACCATGCTC GATCTGACGG TGAAGCAGGG CGCGCCCTAT GCGGCGATCC TGAAGGCCGC CGAGCCGCTC GCCACCGAAA CGGGAGCGCT GAAGCCGCTG GAGCCGTTCG CGGCGACCGG GGTTCCCGCC GCCGCCGCCC TCGGCCGCGA GCTGATCGCC CTGCTGCCGA AGCTGTTGTC GGGCGCCGAG GGCGCCAGCA ACGCGAATTT CATCGACCGC TTCCAGTCCA ATGCGGAGCG GCTGATCCGG ATCCAGCGTT CCGACGCCAC CGCCGGGATC GATCGCACCG CGATCGTCGG CCGCATTACC GCGGCGGCGC AGCGCGGCGA CCTTGCCGAG GCGCGGCGCG AGCTGAAAGC GCTTGCGCCG GCCGACCGCG CTCCTGTTCA ATCCTGGATC GACAAATCCG AGGCGCGCGA CCAAGCTCTC GCCGCCTCGC ATTCCTTCGC CACCGCTGCG CTAGCCGCGC TGCAGAAACC GTCGCCATAG
|
Protein sequence | MVKNRPGHEN TQHEKDPREN AASVGASDAH PEQAAEVDVD APIEALALNP VEPTAEELRD DPALELSAEH RQQDVIDAEP LPETATDDPP ETRFASTSDK AEEAAARSAA AAEPRRPGIV AAMLPPVLAV AIAAAVVVGA AKTGLLPQFL SSTSVSAPEG DVAAIDALKA RIADLEARPT PTGSNTAAAT PAAAADPALA GKVDALEKTV AALRDDLATL RDRSEQLASA LKEVKAAPSE PTATASEPPA MASTDKTPPD KAAADKAATD SAAALAAINA RLTELEHAAK TATEAAPQAP QPAVVSDDAP LRRLVTATML DLTVKQGAPY AAILKAAEPL ATETGALKPL EPFAATGVPA AAALGRELIA LLPKLLSGAE GASNANFIDR FQSNAERLIR IQRSDATAGI DRTAIVGRIT AAAQRGDLAE ARRELKALAP ADRAPVQSWI DKSEARDQAL AASHSFATAA LAALQKPSP
|
| |