Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1616 |
Symbol | |
ID | 4022096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 1810567 |
End bp | 1812483 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637961811 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_568754 |
Protein GI | 91976095 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.688327 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAATGC GTTTTTTTTC AAAGATGCGA TCGCGAAACT ACCTCATCAT TTTGCACGAT GTGATTGCGA CGGCGTGCGC GATTCTGACG AGTTTCTATC TTCGCTTCGA CGGTGAAGAT CTGTCTGCTC GCCTCCCGCT ACTTTGGACG TTGATGCCGG GCTTTTTGCT GCTGAGTGTC GTCGTTTTCT ATGTCTTCAG GCTAACGACG TCGAAATGGC GGTTTGTTTC GCTTCCCGAC GGCATCAATA TTCTTCGCGC AGCAACCGTC CTCACCGTCG TTCTCATCAT TACCGATTAC ATTTTCCTGG CGCCCGACGC TTACGGAACC TTCTTCCTGG GTCGGATTAC CATCGTCCTG TATTGGTTCC TCGAAGTCGC CTTTCTGAGT GCGCCGCGAT TTGCTTACCG TTACTTCCGC TACACACGAA CACGCAATCA GGCTAAGGCG ATCAATGCCG CTCCCACGGT TCTGATCGGC CGCGCAGCCG ATGCCGAGAT TCTGATTCGG GGAATCGAGA GTGGCGCCGT CAAGCAGCTT TGGCCGGTGG GGATTTTATC GCCTTCGGCC GCTGACCGCG GCCAACTCGT TCGAAATATT CCTGTCCTCG GAGGGATTGG CGACCTTCAG GATGTCGTCG CGGACTACGC GAGACGTGGA AAGCCGATCG AGCGCGTGGT AATGACGCCG TCCGCCTTCG AGCCTGACGC CCATGCGGAA TCGATTTTGG TCCGCGCACG AAAACTCGGA CTGCTGGTCA GCCGGTTGCC GTCACTTGAA GAAAGCCGCG ACGTGCCCCG GCTGGCGGCC GTTGCGGTCG AAGATTTGCT GCTCAGGCCG ACTGAGAAGA TCGACTACGG CCGTCTGGAG GCCCTGGTGA AGGGCAAGGC GGTGATCGTG ACCGGAGGCG GCGGCTCGAT TGGCGCTGAA ATCTGCGCCC GAGTCGCAAC CTTCGGCGCC GCAAGATTGC TGGTGATCGA GAACTCCGAA CCGGCGCTCT ACGCGGTCGT CGAAACGCTC TCGGCGACTC TGGTTCAGAT CCGCGTCGAA GGTCGCATCG CGGATATTCG GGATCGCGAG CGTATTCTTG GCTTGATGAA CGAGTTCAAG CCGGATCTGG TGTTCCATGC CGCCGCGCTG AAGCACGTTC CAATTCTGGA GCGCGACTGG AGCGAGGCCG TGAAGACTAA CATTTTCGGA TCCATCAACG TTGCAGACGC GGCCTTGGCC GCGGGTGCGG AAGCGATGGT GATGATCTCG ACCGACAAGG CTATCGAGCC GGTGTCGATG CTGGGGCTCA CCAAGCGTTT CGCGGAGATG TATTGTCAGG CGCTCGATCG CGAACTCTCG GACAAGGGCG CAGGAAAGAC GCCGATGCGG CTGATTTCCG TGCGATTTGG TAATGTGCTG GCTTCGAATG GCTCCGTCGT TCCGAAGTTC AAGGCTCAGA TTGAAGCGGG CGGCCCTGTG ACTGTCACCC ATCCGGATAT GGTGCGGTAC TTCATGACGA TCCGTGAGGC ATGTGATCTG GTCATCACGG CGGCCACGCA TGCTCTGACG TCTGCATGTT CCGACGTGTC GGTTTACGTG CTCAACATGG GGCAGCCGGT ACGGATCGTC GAACTGGCCG AGCGCATGAT CAAATTGTCG GGCCTTGAAC CCGGGCACGA TATCGAGATC GTCTATACGG GCGTGCGCCC AGGTGAGCGC CTGAATGAAA TCCTGTTCGC GCATCAGGAG CCGACGATTG AAATCGGGAT CGCGGGGGTG ATGGCGGCGA AACCCAACGA GCCAAAAATG CAGACATTGA GAGACTGGAT CATCTCGCTC GATGAGGCCA TCGCAACCAA TGACCATGCA AGAGTTCAGC TGGTGCTGAA AGATGCGGTT CCGGAGTTCG GCGTCAGCAT GGCGTGA
|
Protein sequence | MVMRFFSKMR SRNYLIILHD VIATACAILT SFYLRFDGED LSARLPLLWT LMPGFLLLSV VVFYVFRLTT SKWRFVSLPD GINILRAATV LTVVLIITDY IFLAPDAYGT FFLGRITIVL YWFLEVAFLS APRFAYRYFR YTRTRNQAKA INAAPTVLIG RAADAEILIR GIESGAVKQL WPVGILSPSA ADRGQLVRNI PVLGGIGDLQ DVVADYARRG KPIERVVMTP SAFEPDAHAE SILVRARKLG LLVSRLPSLE ESRDVPRLAA VAVEDLLLRP TEKIDYGRLE ALVKGKAVIV TGGGGSIGAE ICARVATFGA ARLLVIENSE PALYAVVETL SATLVQIRVE GRIADIRDRE RILGLMNEFK PDLVFHAAAL KHVPILERDW SEAVKTNIFG SINVADAALA AGAEAMVMIS TDKAIEPVSM LGLTKRFAEM YCQALDRELS DKGAGKTPMR LISVRFGNVL ASNGSVVPKF KAQIEAGGPV TVTHPDMVRY FMTIREACDL VITAATHALT SACSDVSVYV LNMGQPVRIV ELAERMIKLS GLEPGHDIEI VYTGVRPGER LNEILFAHQE PTIEIGIAGV MAAKPNEPKM QTLRDWIISL DEAIATNDHA RVQLVLKDAV PEFGVSMA
|
| |