Gene RPD_0998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0998 
Symbol 
ID4021473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1126892 
End bp1128952 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content70% 
IMG OID637961189 
Productglycosyl transferase family protein 
Protein accessionYP_568137 
Protein GI91975478 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.759416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.641778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTCG GCGGCCGCGG CGGACACAGC GATGCGTTGG GGCGGCGACG CAGCGAAGAT 
CGGGGATCTT CGTCATGGTC GGCACGGCCG GGCTTTCTCG CATTCTGGCG GACGCCGCAC
GCCTGCGCGG ACGGGGCAAG GCATCGCGTC CACGACGCTG CGACCGAGCT GGACTGTCTG
CGCGGGGTGC TTGCGCCGGC ACTGTTGCGG GCCGCCGAAT GCCGCGCGCG CGAACTGGAC
GTCGGGGCGG AGCGTGTCCT GATCCAGTGG GGGATGATCG ACGAGGAGGC CTACCTTCGC
CGCCTCGCTT TTCATCTCGA TCTTCCGCTG GCTGATCTGT CGACCGCCGA CCGCGCCGAC
TGCCCATCTT CGGATCGCCA GATTGCGGCC GCGGCGGAAA CCGGACTCAT TCCATTGCGG
CAGGACGGCG AACTGGTCTG GGTGCTGGCG CCGACGATCC GGCACACGGC GCGAACGCTG
TGCCGGGTGC TCGACCGGCT TCCGGACCTG CGCGGGCGGC TGCGGCTGAC CTCGGCCGCT
TCGCTGCAGC GATTTCTGAT GCAGCAGGGC CGCGACGCGA TCGCCGACGC AGCGACCGGC
GATCTGCAGC AGCGATTTGC GGCGATGTCG GCGGCGCCGG GGCACGCCGC GGGTCCGGTA
TGGCGGCAGC GGCTGCGCCG CTTCGCAGGC CTGCTCGGAT TGGCGATGCC GGCGATGATC
GCGCCCGGTC TCGTCGCGAA CCTGCTGGCG GTGTGGTTCA TGGGGTTCGC GACGCTGCGA
CTGGCGGCGT GCTTCTGGCC GCGCGCGGCG CAGCGGCCGC TGCGGCGGCG GCCCGACGCG
ACGCTGCCGA TCTATACCGT GGTGGCGGCG CTGCATCGGG AGGAACGCTC GGTCGCAGGG
CTGGTCGCGG CGATCGAGGC GCTGGACTAT CCGCGCGAGA AGCTCGATGT CATCCTCGTC
ATCGAACCCA ACGATCTCGC CACCCGCGCG GCGATCGCCC GGCTCGGACC GCGGCCCCAT
CTGCGTGTCC TGATCGCGCC ACCGGTCGCG CCCCAGACCA AACCGAAGGC GCTGAACTGC
GCGCTGGCGT TCGCGCGCGG CAGCTTCATC GCGGTGTACG ACGCCGAGGA TCAGCCGGAG
CCCGGCCAGT TACGCGCCGC GCTCGACGCC TTCGACCGCC ACGGCGCGAC CACCGCCTGC
GCGCAGGCCA GCCTGTGCAT CGACAACATC ACTCATAGCT GGCTGTCGCG CACCTTCGCC
GCCGAATATG CCGGGCAGTT CGACCGGTTG CTGCCCGGCC TGTCCGAAAT GAACCTGCCG
CTGCCGCTCG GCGGCACCTC GAACCACTTC CGCACCGACG TGCTGCGCGC GATCGGCGGC
TGGGACCCCT ACAACGTCAC CGAGGACGCC GATCTCGGCT TCCGGCTGGC GCGGTTCGGC
TACCGCTCGG TCAGCTTCGC GTCGACCACC TATGAGGAAG CACCGATTAC TTTCGACAAT
TGGCGGCGGC AGCGCGCGCG CTGGATGAAG GGCTTCATCC AGACCTGGCT GGTGCATATG
CGCCATCCGC TGCGGTTGTG GCGCGACATC GGCCCGCGCG GCGTGCTCGC GCTGAATCTG
ATCGTCGGCG GCAATCTGCT GACCGCGCTC GTCCACCCGC TGTTCCTGGG CATCGCCCTC
GCCTCGCTCG CAGGCGCATG GCTCGAGTTG CCGGCCGTGC TGCAGCCGTC GCCGCCATCG
CCGCTGCATT GGCTGGCGAT CGCGGCCGGC TACGCCTCGA CCGTCGTGGT CGGCCTGCGC
GGCCTGGCCG GACGCCGGCA ATTGCGGCTG GGCTTCGTCC TGCTGCTGAC GCCGGCCTAT
TGGATCTGCC TGTCGATCGC GGCCTGGTGC GCGGTGGCGC AGTTTGTCTG GCGGCCTTAT
TACTGGGAGA AGACCGTCCA CGGCGTCGCA AAGCGAGCCA AGGCGCCGTT GCCGGGGGTC
GCGGCCGGGC CGGCGATACG CCGAGCTACA AATAGCGTTT CAGATCCGCG GCGGCTTCTT
CGGGCTTCCG CTTCATGTTG A
 
Protein sequence
MAVGGRGGHS DALGRRRSED RGSSSWSARP GFLAFWRTPH ACADGARHRV HDAATELDCL 
RGVLAPALLR AAECRARELD VGAERVLIQW GMIDEEAYLR RLAFHLDLPL ADLSTADRAD
CPSSDRQIAA AAETGLIPLR QDGELVWVLA PTIRHTARTL CRVLDRLPDL RGRLRLTSAA
SLQRFLMQQG RDAIADAATG DLQQRFAAMS AAPGHAAGPV WRQRLRRFAG LLGLAMPAMI
APGLVANLLA VWFMGFATLR LAACFWPRAA QRPLRRRPDA TLPIYTVVAA LHREERSVAG
LVAAIEALDY PREKLDVILV IEPNDLATRA AIARLGPRPH LRVLIAPPVA PQTKPKALNC
ALAFARGSFI AVYDAEDQPE PGQLRAALDA FDRHGATTAC AQASLCIDNI THSWLSRTFA
AEYAGQFDRL LPGLSEMNLP LPLGGTSNHF RTDVLRAIGG WDPYNVTEDA DLGFRLARFG
YRSVSFASTT YEEAPITFDN WRRQRARWMK GFIQTWLVHM RHPLRLWRDI GPRGVLALNL
IVGGNLLTAL VHPLFLGIAL ASLAGAWLEL PAVLQPSPPS PLHWLAIAAG YASTVVVGLR
GLAGRRQLRL GFVLLLTPAY WICLSIAAWC AVAQFVWRPY YWEKTVHGVA KRAKAPLPGV
AAGPAIRRAT NSVSDPRRLL RASASC