Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2012 |
Symbol | |
ID | 3973875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 2190690 |
End bp | 2192408 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637925121 |
Product | hypothetical protein |
Protein accession | YP_531886 |
Protein GI | 90423516 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.999486 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTCCG ACATCACCTT GTCCTCCGCC ACACGCCAGA ATCTGCTCTC GCTGCAGGAT ACGGCGGAAT TGCTGGCAAC CACGCAGAGC CGGCTCTCGA CCCAAAAGAA GGTCAATTCG GCGCTCGACA ACCCGGTCAA CTACTTTGCC GCGCAAGGCC TCAGCGCGCG CTCGGCCGAC CTTCTCAACC TGATGGACGG CATCGCCAAT GGCGTTCAGA CCGTCCAGGC CGCCAACCAA GGCATCACCC GAATCCAGGG CCTGATCGAC GCCGCCAAGA GCGCCGCGCA GCAGGCGCTC GCTGTGCAGA ACACATCGGC CGGATCGACG GCGACAGGAG CCGTGGTGGC TGCCGCCAAT GGAAAGTCGC TGCTTGCGAC GGGCTCGAGC GGCACCGGGG CCGCGGCCGA CGGTACCCAC GACTACAGCG GCGCCAACGG CACCGCGGCC GCTTCGATCT TCAGTGTCAC GGACGGCTTT GGAAATGCCG CGACCGTAAC GCTGGATCGC AACCGCCTGG CTCAGAATGT CAAGGACCTC TCGAAGGTCA CCTCGACCGA GATCGTGTCC CAGATCAATC AGCAGCTTAG CCAGGCCGGT GGTTTTGCGG CCGCCTCGCT GACGACCGAC GGCCGGCTGA CCTTCACATC GACGATGACT GGGACCGATG CGGCGCTGAA GGTCGCCACC CTGACCGGGA ACACGGTCGA TGTCGGCTTT GGCGCCAACG TCGGTACAAC CGTCACCGCG ACCGGCATCG ACTCGACGGA CGGCTCGGCC AAGGCGAGTG TCTCGGGGGC GACCATCACC GCGCTTGGTG CGGCAAGTAA CTTCGATCTC ACGAGCGGCG ACGCCTCCAT CTCCGTGCAG CTCGGAAACG GGCTTACCAA GACCATCAAC CTCAACAAGA CCGCTGATGC GAGTCTCGGC GTGGCCACGC TGAAGGCGCA GGATATCGCG ACCGCCATCA ACAACCAGCT CAAATCCGAC ACCGGAATTT CGGGCAAGGT CGTGGCCACC TACGACAACG TCGCCGGCAC GGTCTCGCTA CGCACCACGG CGGCGGGCGG TGATCAGAAG ATCACGGTCA CCTCGGCCTC GACCAGCACC AAGGATATCG GCTTCGGTAT TGGCGGTGAC ACCACCAAGG CCAGAACAGC GGCCGGCGCA GGCGCGACGG CTGCCAACGG CAACAATCAA CGCGCCGTGT TGGCGCAGCA GTTCAACGAC CTGCTGACGC AGATCTCGCA GACCGCACAG GATGCCCGCT ATCAGGGCAT GAACCTTCTG TATCGCACGG GCAGCGATCC GAAGGAGAAC ACTCTGCACC TGCAATTCAA CGAGAAGGAT TCGAGCTTCC TCGAAATCAA GGGTGTGAAG TTCGACGCCA TGGGTCTCGG GATCACGCAG GTGACCGGCA ATTTTGCGTC GAACGAGGAA ATCAAGACAG CAACCAGTCA GTTGACGAAT GCTGCCTCGA CGCTGCGAAG CCAGGCCTCG ACGTTCGGCT CGAATCTGAC GGTGGTGCAG AACCGGCAGA ACTTCACCAA GCACATGATG AACATCCTCG ACAAGGGTGC ATCCGATCTC ACCAGGGCCG ATCTGAACGA GGAGACAGCG AACCAGCAGG CGCTGTCGCT ACGCAACTCG CTCTCGATCT CCGCGCTGTC GCTCGCCAAC CAATCGCAGC AGAGCATCCT GCAGCTGCTG CGAGGCTAG
|
Protein sequence | MPSDITLSSA TRQNLLSLQD TAELLATTQS RLSTQKKVNS ALDNPVNYFA AQGLSARSAD LLNLMDGIAN GVQTVQAANQ GITRIQGLID AAKSAAQQAL AVQNTSAGST ATGAVVAAAN GKSLLATGSS GTGAAADGTH DYSGANGTAA ASIFSVTDGF GNAATVTLDR NRLAQNVKDL SKVTSTEIVS QINQQLSQAG GFAAASLTTD GRLTFTSTMT GTDAALKVAT LTGNTVDVGF GANVGTTVTA TGIDSTDGSA KASVSGATIT ALGAASNFDL TSGDASISVQ LGNGLTKTIN LNKTADASLG VATLKAQDIA TAINNQLKSD TGISGKVVAT YDNVAGTVSL RTTAAGGDQK ITVTSASTST KDIGFGIGGD TTKARTAAGA GATAANGNNQ RAVLAQQFND LLTQISQTAQ DARYQGMNLL YRTGSDPKEN TLHLQFNEKD SSFLEIKGVK FDAMGLGITQ VTGNFASNEE IKTATSQLTN AASTLRSQAS TFGSNLTVVQ NRQNFTKHMM NILDKGASDL TRADLNEETA NQQALSLRNS LSISALSLAN QSQQSILQLL RG
|
| |