Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4163 |
Symbol | |
ID | 3972306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 4625387 |
End bp | 4627318 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637927266 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_534007 |
Protein GI | 90425637 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.737802 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.887363 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGGA TCTCCGCTTT TACTTACCGG AACTGGCTGA TCGCGATTCA CGATGCCGGG GTGACGGCGT TCGCGGTGAT CGCCAGTTTT TTCTTGCGGT TCCAAGGCGA GAACTTCACC GAGCGACTGC CGCTGCTGCT GACGATTCTG CCTTATTTCG TGGTTTTCAG CTTTTTCGTC TGTTACTTTT TCCATCTGAC CACCACCAAG TGGCGGTTCA TTTCCGTCCC CGATCTTTTG AACATCCTGC GCGCGGCCAG CGTACTGACG GTCGCGCTGC TGGTGCTCGA TTATATCTTC CTGGCGCCGA ACGTGTTCGG AACGTTCTTT CTCGGCAAGA CCACCATCAT CATCTATTGG TTTCTCGAGG TGTTCTTCCT CAGCGGTTCG CGGTTGGCCT ACCGGCATTT CCGCTACACC CGCACCCGCA ACCAGGCCAA GCTCAAGGAC GCCGCGGCGA CCTTGCTGAT CGGGCGCGCC GCCGACACCG AAGTGTTTCT GCGCGCGGTC GAGAGCGGCG CGGTGAAGCG GCAATGGCCG GTCGGCATCC TGTCGCCGTC CAGTTCGGAT CGCGGGCAGC TGATCCGCGG CATCCCGGTG CTGGGCACGA TCGACGATCT GGCCAACGTG GTCGACGATT TCGCCGGCCG TAACAAGCCG ATCAAGCTCG TGGTGATGAC GCCGTCGGCG TTCGAGAGCG AGGCGCAACC GGAATCCGTG CTGATGCGGG CGCGCAAGTT GGGCCTCGCG GTCAGCCGGT TGCCGTCGCT GGAGGAGAGC CGCGATACGC CGCGGCTGGC GCCGGTGGCG GTGGAGGATC TGTTGCTGCG GCCGAGCGTC AAGATCGACT ACGCCCGGCT GGAGGCGTTC GTGCGCGGCA AATCCGTGGT GGTCACCGGC GGCGGCGGAT CGATCGGTTT CGAGATCTGC GACCGCGTCA CCACCTTCGG CGCCGCGCGG CTGTTGATCA TCGAGAATTC CGAACCGGCG CTGCATGCGG CGATGGAAAC CATTCTCGCC AAGGAGCCCG CGGTGGCGGT CGAAGGCCGC ATGGCCGACG TGCGCGACCG CGATCGCATC CACCAATTGC TGACCGCGTT CAAGCCGGAC ATCGTGTTCC ACGCCGCGGC GCTGAAGCAT GTGCCGATCC TGGAGCGCGA CTGGGCGGAG GGCGTCAAGA CCAACATCTT CGGTTCGGTC AACGTCGCCG ACGCCGCACT GGCGTCGGGC GCCGCCGCGA TGGTGATGAT TTCCACCGAC AAGGCGATCC AGCCGGTGTC GATGCTGGGG CTGACCAAGC GGTTCGCCGA AATGTACTGT CAGGCGCTGG ATCGCCAATT GATGACCGGC GCCGACGGCG GCAAGCCGCC GATGCGGCTG ATCTCGGTGC GGTTCGGCAA CGTGTTGGCC TCGAACGGCT CGGTGGTGCC GAAGTTCAAG GCGCAGATCG AGGCCGGCGG GCCGATCACG GTGACCCATC CGGACATGGT GCGCTACTTC ATGACCATCC GCGAAGCCTG CGATCTGGTG ATCACGGCCG CGACTCATGC GCTCAGTCCG GCGCGGTTCG ATGCCTCGGT CTATGTGCTG AACATGGGGC AGCCGGTGAA GATCGTCGAT CTCGCCGAGC GGATGATCCG GCTGTCCGGG CTGCAGCCCG GCTACGACAT CGAGATCGTG TTCACCGGGG TGCGCCCCGG CGAGCGGCTG AATGAAATCC TGTTCGCCGA GCAGGAGCCG ATCAGCGAGA CCGGCATCGC CGGCATCGTC GCGGCCAAAC CGAACCAGCC GCCGATGGCG TTGCTGCGGC AATGGCTGAC CCAACTCGAG CAGGGCGTCA GCAACGAAAC CTGCTCGGAG ATCTCCGGGG TGTTGAAAGC CGCGGTGCCG GAGTTCGGCG CCGAGGCCGA GCTTCGCACC GCGGCGCAGT GA
|
Protein sequence | MTRISAFTYR NWLIAIHDAG VTAFAVIASF FLRFQGENFT ERLPLLLTIL PYFVVFSFFV CYFFHLTTTK WRFISVPDLL NILRAASVLT VALLVLDYIF LAPNVFGTFF LGKTTIIIYW FLEVFFLSGS RLAYRHFRYT RTRNQAKLKD AAATLLIGRA ADTEVFLRAV ESGAVKRQWP VGILSPSSSD RGQLIRGIPV LGTIDDLANV VDDFAGRNKP IKLVVMTPSA FESEAQPESV LMRARKLGLA VSRLPSLEES RDTPRLAPVA VEDLLLRPSV KIDYARLEAF VRGKSVVVTG GGGSIGFEIC DRVTTFGAAR LLIIENSEPA LHAAMETILA KEPAVAVEGR MADVRDRDRI HQLLTAFKPD IVFHAAALKH VPILERDWAE GVKTNIFGSV NVADAALASG AAAMVMISTD KAIQPVSMLG LTKRFAEMYC QALDRQLMTG ADGGKPPMRL ISVRFGNVLA SNGSVVPKFK AQIEAGGPIT VTHPDMVRYF MTIREACDLV ITAATHALSP ARFDASVYVL NMGQPVKIVD LAERMIRLSG LQPGYDIEIV FTGVRPGERL NEILFAEQEP ISETGIAGIV AAKPNQPPMA LLRQWLTQLE QGVSNETCSE ISGVLKAAVP EFGAEAELRT AAQ
|
| |