Gene RPB_0886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0886 
Symbol 
ID3909066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1013469 
End bp1015490 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content71% 
IMG OID637882779 
Productglycosyl transferases 
Protein accessionYP_484508 
Protein GI86748012 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCCG ACCGCGCCGG ACACAACGTT GCGGTGGAGC AGCGCGGCAG GCACGCGGGG 
ACCTCGCCAT GGCCGGGGCG GCCGGGCTTT CTCGCCTGCT GGAACGGGTC CGCCGTCGAT
GCTGCGACCG TGCCGCCGCA GAGTCCCGAA CTGAATTGCC TGCGCGGTGT GCTGCCACCG
AACCTGCTGG AGGCCGCCCG CCGGCGCGCC GGCGAACTCG GAACGGGTGC CGATCAGGTG
CTGATCCGAT GGGGCGTGAT CGACGAAACG ACTTATCTGC ATCGACTGGC CCGGCATCTC
CGGATCGCCC CCGAGGATTT TGCGCAGGTC GGCCGCGACG ACACGCCGTT GTGCGACGAC
CAGATGCGGT TCGCCGCGGC GGCGGGGGTG ATGCCGCTGC GGCAGAACGG CGACCTGGTA
TGGGCGATGG CGCCACGGCG GATGGCGGCG CGCACCTTGT GCGGCGTGCT GCACGATCAT
CCATCGCTGC GCAGCCGGTT TCGCGTCGCC CCCCAGTGCG CGATGCAGCA GTTCCTGCAG
CAGACCGGTC AGGGCCTGGC GCAGCACGCG AGCTTCGGCC TGCAGCGCCG CTATCCCGCG
CTGGCGGTTT CACCCTGCGA TGTCGAGATC GGCTGGCGGA CCGGGCTCAG GCGCGCGGCG
GCCGGTGCGT TGCTGGCCGC CGCCTTGCCG CTGCTGTGCG CCGTCCATTC CGGCGTGGTC
ACGGCGCTGC TGTTTCTCGG CTTCATCGGA TTGCGGCTGG CGGCGAGCCT GCAGCCGCGA
CCGCCCGCAC CACGATCGGC GCGCCGGCCG GACGACGCAT TGCCGATCTA CACCGCGATC
GCGGCGCTGT ATCGCGAGGC GGCGTCGGTG GCGTCGCTGG TCGAAGCGAT CGAGGCGCTG
GACTATCCGC GCGAGAAACT TGACATCATT CTGGTCGTCG AGCTCGACGA TCTCGCCACC
CGCGCCGCGA TCGCGCGGCT CGGGCCGCGA CCACATCTGC GCGTGCTGAT TGCGCCGGCG
GTCGGGCCCA GGACCAAGCC GAAGGCGCTC AACTACGCGC TGCCGTTCGT GCGCGGCGGC
ATGGTGACGG TGTTCGACGC CGAGGACCGC CCGGAGCCCG ATCAGCTTCG CGCCGCGCTC
GACGCCTTCG CGCGCGGCGG GCCGACGACC GGCTGCGTCC AGGCCGGCCT GTGCATCGAC
AACATCACCC ATAGCTGGCT GTCGCGGCTG TTCCTCGCCG AATATGCCGG CCAGTTCGAG
GCGGTGCTGC CCGGCCTGAC GCGACTGGGT CTGCCGCTGC CGCTCGGCGG CTCGTCGAAT
CACTTCCGCA CCGCCGTGCT GCGCGAGGTG GGCGGTTGGG ATGCTTACAA CGTGACGGAG
GATGCCGATC TCGGCTTCCG GCTGGCGCGG TTCGGCTACA GCGCCATCAG CTTCGACTCC
CGCACCTTCG AAGAGGCGCC GATCGGCCTC GCCGCGTGGC TCGGCCAGCG CACCCGCTGG
ATGAAAGGCT GGATGCAGAC CTGGTGCGTG CACATGCGCC GGCCCCGGCT GTTCTGGCGC
GACGCCGGCT GGCGCGGCGT GCTGGCGCTG AACCTGTTCG TCGGCGGCAG CGTGCTGTCC
GCCCTGATCC ATCCGCTGCT GCTCCTGGAC CTCGCCACGA CAGGGCTCGC GCTCGCGCAG
GGCGAGCCGC TGTCCCCGCC TTCGCCGTGG GCCTCGCTCC ACGGTCTGGC CGTCGCCGCC
GGATACGTCG GCAGCGCGGT CGTCGCTGCG ATCGGCCTGA AGCGGATCGG TCGGCTGCAC
GATGCGGCCT GGCTGCTGCT GATGCCGCTG TACTGGATCT GCCTGTCGAT CGCGGCCTGG
CGCGCGCTCG GCGAACTGGT GTGGAAGCCG CATCATTGGC AGAAGACCGA GCACGGCGTC
GCCGCGCGTG CCGCCCCTTC GCCGAAGGCC GTCGGGAAAA CGCTCGTCAG AGATAGCGCT
TCAGATCCGC GGCGGCCTCT TCGGGCTTCC GCTTCATGTT GA
 
Protein sequence
MVADRAGHNV AVEQRGRHAG TSPWPGRPGF LACWNGSAVD AATVPPQSPE LNCLRGVLPP 
NLLEAARRRA GELGTGADQV LIRWGVIDET TYLHRLARHL RIAPEDFAQV GRDDTPLCDD
QMRFAAAAGV MPLRQNGDLV WAMAPRRMAA RTLCGVLHDH PSLRSRFRVA PQCAMQQFLQ
QTGQGLAQHA SFGLQRRYPA LAVSPCDVEI GWRTGLRRAA AGALLAAALP LLCAVHSGVV
TALLFLGFIG LRLAASLQPR PPAPRSARRP DDALPIYTAI AALYREAASV ASLVEAIEAL
DYPREKLDII LVVELDDLAT RAAIARLGPR PHLRVLIAPA VGPRTKPKAL NYALPFVRGG
MVTVFDAEDR PEPDQLRAAL DAFARGGPTT GCVQAGLCID NITHSWLSRL FLAEYAGQFE
AVLPGLTRLG LPLPLGGSSN HFRTAVLREV GGWDAYNVTE DADLGFRLAR FGYSAISFDS
RTFEEAPIGL AAWLGQRTRW MKGWMQTWCV HMRRPRLFWR DAGWRGVLAL NLFVGGSVLS
ALIHPLLLLD LATTGLALAQ GEPLSPPSPW ASLHGLAVAA GYVGSAVVAA IGLKRIGRLH
DAAWLLLMPL YWICLSIAAW RALGELVWKP HHWQKTEHGV AARAAPSPKA VGKTLVRDSA
SDPRRPLRAS ASC