Gene RPB_3839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3839 
Symbol 
ID3911642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4384806 
End bp4386140 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content69% 
IMG OID637885739 
Productglycosyl transferase, group 1 
Protein accessionYP_487443 
Protein GI86750947 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.24097 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGCAGC CCCCCGACAG CCAGACCCCG CCGCCGCGAC AACCCTGGCT GTGGATGGAC 
GTATCGACCA GCGCTCGCGC GCGCTCCGGC CAGATGAACG GCACGCTGCG GGTCGAGCAC
AGAATCGCCA CCGCGTTGCG CGAACGGATC GGACCGCAGC TCGGATTTTG CCGCTACGAA
CCGCTGCGAC AGGACTACGT GCCGGTTGCG GCGGTGCCCG ATCTCGGCGC CAAGCCGGTC
GCCGCGGCGA AGCCGAAGGC GGCGCGTGCC ACGATGCTGT CGTCGATCAA GCCGCTCGGC
AAGAAACTCG AACGCGCGGT TCGTACTTCG GTGCGTGGCG CTGTCGCGCC GCTGCTGCAG
AAAATGGCCG GCGGCGAGGG GCTGCCGCCG CCCGGCCCGG CTGCGGGCCA CGAGGTGCTG
CTGCTCGCCG GCGAGAACTG GTCGCGGGTC GACTACGCAG CGGTGGCGCG GATGCGCCGC
GCGCGCGGAA CCCGGATCGC GGCGGTGTGC CAGGACTTCA TCCCGGTGGT CGCGCCGCAA
TTCTTCGCCG ACGGCGAGTT CGTCACGCAG TTCGAGGCCT ATGCGCAGTT CCTGATCCGC
GAATGCGACA TGGTGATTTC GATTTCCGAC TCCACCAGCG CCGACGTGCG CGCCTATGCG
CAGCGCCATG GCGGGTTGCG CGGCGCGATC GAGGTGGTGC ATCTGGGCGC CGATCTGGCG
ACGCCGGCGA CCGCGCGCCG GCCCGCTGCG CTCGGCGACG GGCAGGCGCA GCGCTTCGTG
CTCAGCGTCT CGACCATCCA GTCGCGCAAG AATTTCGATC TGCTGTATCA CTTGTGGCGG
CGCTTGACCG AGCAGGGCAC GCCTCGTCTG CCGACGCTGG TGATCGTCGG GCAGCCCGGC
TTCGGCAGTG CCGACCTGCT GTGGCAGATC GCGCATGATC CGGTCACCGC GTCGTCGATC
CTGCATCTGC CTCGCGCTGG CGACGAGGAA CTGGCGTGGC TGTACCGGAA CTGCGCGTTT
ACGCTGTATC CCTCGTTCTA CGAGGGCTGG GGACTGCCGG TGTCGGAGAG CCTCGCCTTC
GGCAAGTACT GTCTCGCATC GAACACGTCG TCACTGCCGG AAGCCGGCGC CGGCCTCGCG
GGGCACCTCG ATCCGCTGGA TTTCGCAGCC TGGCGCGATG CGGTGCTTGA CCTGATCCAT
TCCCCTGAGC AACTTGCCGG ATACGAAGCC GCGATCCGGT CGAACTATCG CCCGGTAACC
TGGGCGCAAT CCGCCGGCCG GATGGTGGAG GTGCTGCGCA GCGTAGCGGC TGCCCCTGCC
GGTTTCACGA GCTAG
 
Protein sequence
MAQPPDSQTP PPRQPWLWMD VSTSARARSG QMNGTLRVEH RIATALRERI GPQLGFCRYE 
PLRQDYVPVA AVPDLGAKPV AAAKPKAARA TMLSSIKPLG KKLERAVRTS VRGAVAPLLQ
KMAGGEGLPP PGPAAGHEVL LLAGENWSRV DYAAVARMRR ARGTRIAAVC QDFIPVVAPQ
FFADGEFVTQ FEAYAQFLIR ECDMVISISD STSADVRAYA QRHGGLRGAI EVVHLGADLA
TPATARRPAA LGDGQAQRFV LSVSTIQSRK NFDLLYHLWR RLTEQGTPRL PTLVIVGQPG
FGSADLLWQI AHDPVTASSI LHLPRAGDEE LAWLYRNCAF TLYPSFYEGW GLPVSESLAF
GKYCLASNTS SLPEAGAGLA GHLDPLDFAA WRDAVLDLIH SPEQLAGYEA AIRSNYRPVT
WAQSAGRMVE VLRSVAAAPA GFTS