Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0998 |
Symbol | |
ID | 4021473 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 1126892 |
End bp | 1128952 |
Gene Length | 2061 bp |
Protein Length | 686 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637961189 |
Product | glycosyl transferase family protein |
Protein accession | YP_568137 |
Protein GI | 91975478 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1215] Glycosyltransferases, probably involved in cell wall biogenesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.759416 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.641778 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGTCG GCGGCCGCGG CGGACACAGC GATGCGTTGG GGCGGCGACG CAGCGAAGAT CGGGGATCTT CGTCATGGTC GGCACGGCCG GGCTTTCTCG CATTCTGGCG GACGCCGCAC GCCTGCGCGG ACGGGGCAAG GCATCGCGTC CACGACGCTG CGACCGAGCT GGACTGTCTG CGCGGGGTGC TTGCGCCGGC ACTGTTGCGG GCCGCCGAAT GCCGCGCGCG CGAACTGGAC GTCGGGGCGG AGCGTGTCCT GATCCAGTGG GGGATGATCG ACGAGGAGGC CTACCTTCGC CGCCTCGCTT TTCATCTCGA TCTTCCGCTG GCTGATCTGT CGACCGCCGA CCGCGCCGAC TGCCCATCTT CGGATCGCCA GATTGCGGCC GCGGCGGAAA CCGGACTCAT TCCATTGCGG CAGGACGGCG AACTGGTCTG GGTGCTGGCG CCGACGATCC GGCACACGGC GCGAACGCTG TGCCGGGTGC TCGACCGGCT TCCGGACCTG CGCGGGCGGC TGCGGCTGAC CTCGGCCGCT TCGCTGCAGC GATTTCTGAT GCAGCAGGGC CGCGACGCGA TCGCCGACGC AGCGACCGGC GATCTGCAGC AGCGATTTGC GGCGATGTCG GCGGCGCCGG GGCACGCCGC GGGTCCGGTA TGGCGGCAGC GGCTGCGCCG CTTCGCAGGC CTGCTCGGAT TGGCGATGCC GGCGATGATC GCGCCCGGTC TCGTCGCGAA CCTGCTGGCG GTGTGGTTCA TGGGGTTCGC GACGCTGCGA CTGGCGGCGT GCTTCTGGCC GCGCGCGGCG CAGCGGCCGC TGCGGCGGCG GCCCGACGCG ACGCTGCCGA TCTATACCGT GGTGGCGGCG CTGCATCGGG AGGAACGCTC GGTCGCAGGG CTGGTCGCGG CGATCGAGGC GCTGGACTAT CCGCGCGAGA AGCTCGATGT CATCCTCGTC ATCGAACCCA ACGATCTCGC CACCCGCGCG GCGATCGCCC GGCTCGGACC GCGGCCCCAT CTGCGTGTCC TGATCGCGCC ACCGGTCGCG CCCCAGACCA AACCGAAGGC GCTGAACTGC GCGCTGGCGT TCGCGCGCGG CAGCTTCATC GCGGTGTACG ACGCCGAGGA TCAGCCGGAG CCCGGCCAGT TACGCGCCGC GCTCGACGCC TTCGACCGCC ACGGCGCGAC CACCGCCTGC GCGCAGGCCA GCCTGTGCAT CGACAACATC ACTCATAGCT GGCTGTCGCG CACCTTCGCC GCCGAATATG CCGGGCAGTT CGACCGGTTG CTGCCCGGCC TGTCCGAAAT GAACCTGCCG CTGCCGCTCG GCGGCACCTC GAACCACTTC CGCACCGACG TGCTGCGCGC GATCGGCGGC TGGGACCCCT ACAACGTCAC CGAGGACGCC GATCTCGGCT TCCGGCTGGC GCGGTTCGGC TACCGCTCGG TCAGCTTCGC GTCGACCACC TATGAGGAAG CACCGATTAC TTTCGACAAT TGGCGGCGGC AGCGCGCGCG CTGGATGAAG GGCTTCATCC AGACCTGGCT GGTGCATATG CGCCATCCGC TGCGGTTGTG GCGCGACATC GGCCCGCGCG GCGTGCTCGC GCTGAATCTG ATCGTCGGCG GCAATCTGCT GACCGCGCTC GTCCACCCGC TGTTCCTGGG CATCGCCCTC GCCTCGCTCG CAGGCGCATG GCTCGAGTTG CCGGCCGTGC TGCAGCCGTC GCCGCCATCG CCGCTGCATT GGCTGGCGAT CGCGGCCGGC TACGCCTCGA CCGTCGTGGT CGGCCTGCGC GGCCTGGCCG GACGCCGGCA ATTGCGGCTG GGCTTCGTCC TGCTGCTGAC GCCGGCCTAT TGGATCTGCC TGTCGATCGC GGCCTGGTGC GCGGTGGCGC AGTTTGTCTG GCGGCCTTAT TACTGGGAGA AGACCGTCCA CGGCGTCGCA AAGCGAGCCA AGGCGCCGTT GCCGGGGGTC GCGGCCGGGC CGGCGATACG CCGAGCTACA AATAGCGTTT CAGATCCGCG GCGGCTTCTT CGGGCTTCCG CTTCATGTTG A
|
Protein sequence | MAVGGRGGHS DALGRRRSED RGSSSWSARP GFLAFWRTPH ACADGARHRV HDAATELDCL RGVLAPALLR AAECRARELD VGAERVLIQW GMIDEEAYLR RLAFHLDLPL ADLSTADRAD CPSSDRQIAA AAETGLIPLR QDGELVWVLA PTIRHTARTL CRVLDRLPDL RGRLRLTSAA SLQRFLMQQG RDAIADAATG DLQQRFAAMS AAPGHAAGPV WRQRLRRFAG LLGLAMPAMI APGLVANLLA VWFMGFATLR LAACFWPRAA QRPLRRRPDA TLPIYTVVAA LHREERSVAG LVAAIEALDY PREKLDVILV IEPNDLATRA AIARLGPRPH LRVLIAPPVA PQTKPKALNC ALAFARGSFI AVYDAEDQPE PGQLRAALDA FDRHGATTAC AQASLCIDNI THSWLSRTFA AEYAGQFDRL LPGLSEMNLP LPLGGTSNHF RTDVLRAIGG WDPYNVTEDA DLGFRLARFG YRSVSFASTT YEEAPITFDN WRRQRARWMK GFIQTWLVHM RHPLRLWRDI GPRGVLALNL IVGGNLLTAL VHPLFLGIAL ASLAGAWLEL PAVLQPSPPS PLHWLAIAAG YASTVVVGLR GLAGRRQLRL GFVLLLTPAY WICLSIAAWC AVAQFVWRPY YWEKTVHGVA KRAKAPLPGV AAGPAIRRAT NSVSDPRRLL RASASC
|
| |