Gene RPB_1550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1550 
Symbol 
ID3908749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1747269 
End bp1748408 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content64% 
IMG OID637883446 
Productglycosyl transferase, group 1 
Protein accessionYP_485171 
Protein GI86748675 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.252909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.720427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCTGTCA AGAAGCGGCT GATGTTCGTG GTCACGGAGG ACTGGTATTT CGTCTCGCAC 
CGGCTGCCAC TGGCCCGCGC GGCGCGCGAC GCCGGATACG ACGTGCTCGT CGCGACACGG
CTGGGAGATC ATGCCGAACT CATCGCAAGG GAAGGTATCA CGCCCATCGG ATTGCGGAGC
ATGCGGCGCG GTGGGCGCAA TCCGATCGGC GAACTGGCGG CGATAGCCGA ACTGGCCGCG
CTGTACCGAC AGTACAAACC GGATATCGTC CATCACATCG CGATCAAGCC GGTGCTGTAC
GGTTCAATTG CCGCGCGTAT GGCCGGCGTC CGTGGCGTGG TCAACAATCT GGCGGGTCTT
GGTTTCGTAT TCTCCTCCAA GACCCGAAAG GCGGCGTTGC TGCGTCCCGC GATCAGGGCA
TTGCTGGCGA TTGCCCTGAA GCGAAGGGGC ACGCTGACAA TCGTGCAGAA CTCCGACGAT
GCGAACGTGC TCGCCACCAA TATCGGGATT CCCGCGGACC GCATTCGCCT CATCAAGGGA
TCAGGCGTGG ATATGCAATT GTTCGCAGAG CAGCGCGCCG AATCCAGTCC GCCTATCGTG
ATCCTGGCGT CCAGGATGAT CTGGGACAAG GGAATCGGCG ATTTTGTGAA GGCGGCGTCG
CTGCTGAAAG CAGACGGCGT CGCCGCCCGC TTCGTCGTTC TCGGAGCGCC GGATCCGGGC
AATCCCGGGT CGATCCCTCA ATCGGTGCTG GAGGGGCTCA ACAACGAAGG CATCGTGGAG
TGGTGGGGGC ATCGCAGCGA CATGCCCGCG ATCATCGCCG GCGCGGCACT GGTCTGCCTG
CCGACGACGT ACGGCGAGGG CGTGCCGAAG ATCCTGATTG AAGCCGCGGC GGGTGGCTGC
GCCATCGTCG CCTATGACGT TGCGGGGTGC CGCGAGATCG TCACCGATGG CGACAACGGC
AAGCTGGTTC CGGCCGGCGA TATCGGCCAA CTGGCAGCGG CGATCAAGGT CCTGCTGGAG
GATCCCGACC GGCGGGCGGC GATGGGATCG AGAGGCAGGA AGCGCGTCGA AACCGAATTC
GCTCTGGAGC ATGTCGTCGC CCAGACCCTG AGCGTCTACC GCGGACTGGA ACCGACGTGA
 
Protein sequence
MPVKKRLMFV VTEDWYFVSH RLPLARAARD AGYDVLVATR LGDHAELIAR EGITPIGLRS 
MRRGGRNPIG ELAAIAELAA LYRQYKPDIV HHIAIKPVLY GSIAARMAGV RGVVNNLAGL
GFVFSSKTRK AALLRPAIRA LLAIALKRRG TLTIVQNSDD ANVLATNIGI PADRIRLIKG
SGVDMQLFAE QRAESSPPIV ILASRMIWDK GIGDFVKAAS LLKADGVAAR FVVLGAPDPG
NPGSIPQSVL EGLNNEGIVE WWGHRSDMPA IIAGAALVCL PTTYGEGVPK ILIEAAAGGC
AIVAYDVAGC REIVTDGDNG KLVPAGDIGQ LAAAIKVLLE DPDRRAAMGS RGRKRVETEF
ALEHVVAQTL SVYRGLEPT