Gene RPB_4237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4237 
Symbol 
ID3912045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4812559 
End bp4813644 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content71% 
IMG OID637886140 
Productglycosyl transferase, group 1 
Protein accessionYP_487839 
Protein GI86751343 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.905319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTCA TGCTGGCGAT CTTCAAACTC GACAGACTGG GCGGCAAGGA ACGCGATTGC 
ATGGCGATCG CGCGGCACCT TGCGGCGCGT GGTCACGACG TCACCGTGCT GACGACGTCG
GCCGACGTCG CGGCCATCGA CGATCTGCGG ATCGAGAGCC TGCGCGCCCG CGGGCTCGCC
AATCACGTGC TGCTGCGCAA TTTTGCGCGC GACGTGATCG ACCGCAGGCA GCGCGAGCGG
CCCGATGCGC TGCTGTCGTT CGAGCGAATC CCCGACGCCG ATTATCACTA CGTCGCCGAC
GGCGCCGCGA TCCTGCGCGC CTGGCAGCTG CTGGCGTGGC CGCCGCGCCG CCGCGCCAAG
CTGGCGCTGG AGCGCGCGGT GTTCGCGGCG CCGGCCGCGA CCCGGTTGTT CTTCCTCACC
GAGCGCCAGC GCGACGAATA CATCATCGCC TATGATTTCG AGCCGGCCCG CGCCAGCGTG
CTGCCGATGG TGCTGCACGA CGACCGCTAC GCCGCCGCGC GCAAGCTCGG CGCCTCCCGC
TGGCGCAGCG AGCTCGGCAT TCCCGGCGAC GCGCTGATGG CGGTGTCGGT CGCGGTCGAT
CCGAAGCTCA AGGGCGTCGA TCGTAGCCTC GCCGCGCTGG CGTCCTATCC GAAGCTTCAC
CTCGTCGTCG CCGGTTCGGA TTCGCCGTGG CTGCATCGCG GCGTGGTGCG GCGCGATCTC
GAGCGGCGCG TGCACATCGT GCCTTACGTC GCCGAGGTGA TGGAGCTGAT CGCGGCGGCC
GATTTCATGC TGCACCCCGC GCGCTCCGAA GCGGCCGGGC AAGTGATCGG CGAGGCGCTG
CTCGCCGGCG TGCCGGTGCT CGCCTCGGCC GCCTGCGGCT ATGCCGGCGA GATCGAGCGC
AGCGGCGCCG GCCTGGTGCT GCCGGAGCCG TTCCAGCAGG AGGCGCTGGT CGCCGGCATC
GCCGCGATGA TCGACGCACT GCCGGCGATG CGCAAGCAAG CGGCGGCGCG CGCGAAGAGC
CTGCAGCAGC AGCGCGGCGC GTGGCTGTTG GCGATCGCCG AACGGATCGA ACAGCGCGAC
GTCTGA
 
Protein sequence
MKVMLAIFKL DRLGGKERDC MAIARHLAAR GHDVTVLTTS ADVAAIDDLR IESLRARGLA 
NHVLLRNFAR DVIDRRQRER PDALLSFERI PDADYHYVAD GAAILRAWQL LAWPPRRRAK
LALERAVFAA PAATRLFFLT ERQRDEYIIA YDFEPARASV LPMVLHDDRY AAARKLGASR
WRSELGIPGD ALMAVSVAVD PKLKGVDRSL AALASYPKLH LVVAGSDSPW LHRGVVRRDL
ERRVHIVPYV AEVMELIAAA DFMLHPARSE AAGQVIGEAL LAGVPVLASA ACGYAGEIER
SGAGLVLPEP FQQEALVAGI AAMIDALPAM RKQAAARAKS LQQQRGAWLL AIAERIEQRD
V