Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1833 |
Symbol | |
ID | 3971713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1987592 |
End bp | 1988995 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637924946 |
Product | hypothetical protein |
Protein accession | YP_531711 |
Protein GI | 90423341 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000567311 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.596658 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA CATCGACCGA GCGTCTCAGA GACTATCTCG CGCAGCTGCC GCCGCAGGCC CAGGCGCTGC TGATCCGGGA ATTTGAGCGC TCCATCGAAC GCGGCGAAGA CGCCACCGTC GCCAATTTCG TGCTCGGCCA GTTGCGCCAG ATCGTCCGCG GCACCGACGA AGAGGCGCCG GCACGCACCG ACGACCCGGC ACGGCTGCTG TTCCGACCGC TCGAGCCGTT CCTGGTCGAA CATTCCGGCG CGGTTCGGCC CGGTCAGATC CGCCGCGCCT CGCTGCTGCC GGTTTGGCAA TGGCTCGGGC GCGACGGCGC ACCGCAGGCG GTGGCCGAAT TCGAGGCGAC GCTGGCGCGG CTGCGTGGCG GCGGGACATC CTCGGAGATC GCTCACGCCG CGCGGAAGCT ACAGATGGCC GCCGCCGACG CGATCGCCCA GGCGACCTCG GTGGCCCCCG GCGGCGACAA TCAGCGCATG CTGGCTCGGA TCGGTTCGGC TGCGGTGGTG GAAGACCTGC GCGCGGTCGG CGCGGTGTTG AAGAATCGAG ACGCGCTCGA CGCATTGTCC GACAAGTTGC CCGGCGGCCT CGGGGCATTC GGCGATGCCC AGGTCAATTC GGTGATCGCC ACGCTCAACG TTCCCTCGCT GCAGACGCCG CTGGTGTTGC CGTTCGCGCT GTCGTTGGTG ATGCAGCGGC TGTCGGCGCC CTGGCAGATC ATCCGGATCG CGGTGCGGAT CGCCAGTTCC GACGACGAGG TCAGGGTCGC GGCAATTCCT CTTGGCGTCG CCGTCACCAT GGCGCTGCAC GATCTGGCGC ATCTGGTCGC CGACCTGCGC GCGCAGATCA AGCGCGGCCA CTTCGAGAAC TTCGCCGAAC GCCTGAAGCT GGTGCACGAT GGCTTGCGCG GCGTGCGCAC CGAACTCGAT CTGCGCAACG ACTCGGTGTG GGGTCGGCAA ATGGCCGCGA TCAGGGTCGA CATCTCCAAT TCGCTGCAAT CGGAGATCGA GAGCGTGCCG GGCCGGGTTC GCCGGCTGCT GCGGCAGCGC CCCGACAAGG ACATCGCGGC GACCACCAAG ATCGATCCCA GCGAAATCGA CGAGGTGCTG GCGCTGATCG CCTTCGTGGC GGTCTGCCGC ACCTATGCCA GCGAACTGGC GATCAACGAG GTGACGCTGC GCACCTATTC CGACCTGCAG CAATATGTCG AAAAATCCAC CGAGGCGCTG GTGCAGGCGT TGCGCGGCGC CGATGCGAAA GTGCGCGGGT TCCGGCAGAT GCAGGTCAAG GCGGCGATCC GGTTCTGCGA GGCGCTGTTC GGCCAGGACT ACGCCTCGCT GATGAGCCGG GCGGCGGACA ACGCCATGGT GGTGGTGGAG CGCAAGCCGA GCCGGGCGGG CTGA
|
Protein sequence | MSQTSTERLR DYLAQLPPQA QALLIREFER SIERGEDATV ANFVLGQLRQ IVRGTDEEAP ARTDDPARLL FRPLEPFLVE HSGAVRPGQI RRASLLPVWQ WLGRDGAPQA VAEFEATLAR LRGGGTSSEI AHAARKLQMA AADAIAQATS VAPGGDNQRM LARIGSAAVV EDLRAVGAVL KNRDALDALS DKLPGGLGAF GDAQVNSVIA TLNVPSLQTP LVLPFALSLV MQRLSAPWQI IRIAVRIASS DDEVRVAAIP LGVAVTMALH DLAHLVADLR AQIKRGHFEN FAERLKLVHD GLRGVRTELD LRNDSVWGRQ MAAIRVDISN SLQSEIESVP GRVRRLLRQR PDKDIAATTK IDPSEIDEVL ALIAFVAVCR TYASELAINE VTLRTYSDLQ QYVEKSTEAL VQALRGADAK VRGFRQMQVK AAIRFCEALF GQDYASLMSR AADNAMVVVE RKPSRAG
|
| |