Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1609 |
Symbol | |
ID | 4022089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 1803355 |
End bp | 1804368 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637961804 |
Product | UDP-glucose 4-epimerase |
Protein accession | YP_568747 |
Protein GI | 91976088 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1087] UDP-glucose 4-epimerase |
TIGRFAM ID | [TIGR01179] UDP-glucose-4-epimerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.647452 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTGC TCGTGACCGG CGGCGCCGGA TACATCGGAA GTCACACCGT GCTGGCGCTG GTCGAGGCCG GCGAAAGCGT CGTGGTGATC GACAATCTGA GCACCGGCTT TTCCAGCTTC ATCCCGGAAG GCGTGCCGCT GTTCATCGGC GATGCCGGCG ACGAGAATCT GGTCGAGGGC GTGATCAGGA ATCACGACGT CGACGCCATC ATTCATTTCG CCGGATCGGT GATCGTGGCG GACTCGATGC GCGATCCGCT CGCTTACTAT CGCAACAACA CCATGACCTC GCGCAACCTG CTGAGCGCAG CAGTGACGTG CGGCGTGAAG AACTTCATCT TCTCCTCGAC CGCAGCGGTC TACGGCAATC CCGACCGCAC GCCGGTCCCG GAAGAAGCGC CGACCCGACC GCTGTCGCCC TATGGCTGTT CCAAGCTGAT GACCGAGATC ATGCTGCACG ACACCGCGTC GGCCTGCGGC ATGAACTACG TGGCACTGCG CTACTTCAAC GTCGCAGGAG CCGATCCGCA GGCGCGGATC GGGCTCGCAA CCGCCGGAGC GACGCATCTG ATGAAGATCG CGGTCGAAGC CGCGACCGGC CAGCGCCCGC AGGTCGAGAT CTATGGCGCC GACTATCCGA CGCCCGACGG AAGCTGCATC CGCGACTTCA TTCATGTCAG CGACCTCGCA CAGGCCCACG GCGCGGCGCT CGGCTATCTG CGCCAGGGCG GCGCGCCGGT GACGCTGAAC TGCGGCTACG GCCGCGGCTA CTCGGTGCTG CAAACCATTG AGGCGGTGCG GCGCGTCGCG GGACGCAATT TCGCGGTGTC CACCGCCGCC CGTCGGCCCG GCGACATCGT GGCGATGGTC GCCGACACGC GCCGGATACG CGCGACTCTG GACTGGACGC CGCGCTACGA CGACCTCGAC ACCATCGCGG CGGATGCGCT GGGATGGGAG CGCAAGCTGG TCGCGCAGCG CCAAGGCTTT GAACGGCAAG CGATTCCAGC TTAA
|
Protein sequence | MTVLVTGGAG YIGSHTVLAL VEAGESVVVI DNLSTGFSSF IPEGVPLFIG DAGDENLVEG VIRNHDVDAI IHFAGSVIVA DSMRDPLAYY RNNTMTSRNL LSAAVTCGVK NFIFSSTAAV YGNPDRTPVP EEAPTRPLSP YGCSKLMTEI MLHDTASACG MNYVALRYFN VAGADPQARI GLATAGATHL MKIAVEAATG QRPQVEIYGA DYPTPDGSCI RDFIHVSDLA QAHGAALGYL RQGGAPVTLN CGYGRGYSVL QTIEAVRRVA GRNFAVSTAA RRPGDIVAMV ADTRRIRATL DWTPRYDDLD TIAADALGWE RKLVAQRQGF ERQAIPA
|
| |