Gene RPC_4204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4204 
Symbol 
ID3972659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4674677 
End bp4675873 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content67% 
IMG OID637927306 
Productglycosyl transferase, group 1 
Protein accessionYP_534047 
Protein GI90425677 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.186805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.199136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAC GGCGCGCCGA CGTCGTGATG TTCTCGACGG CCGATTGGGC GTCGCAGTAC 
TGGACCAACA AGCAGCACAC CGCTGCGCAT CTCGCCGCGC GCGGCCATCG CGTGCTCTAT
GTCGAAACCG TGGGGCTGCG CCGCCCCGGC CTCAACAGCA TGGACGTCGG GCGGATCTGG
GCGCGGCTGC GGCGCGGTCT GAAGCCGATC GCGGAAGTGC GCGACAATCT GTGGGTGCTG
TCGCCGCTCA CCGTGCCGCT CGGTCAGCGC TATGCGCCGA TCGCCGCATT CAACCGCTGG
CAGTTGCGGG CGCGGATCGG CGGCTGGCTG CGCAAACACC GCATCGGCCG GCCGCTGATC
TGGACCTATC ACCCCTACAT GCTGGAGGCG GCCGAGGCGC TCGATCCCTC GGCCATGGTC
TATCATTGCG TCGACGATAT CGGCGCGGTG CCGGGCGTCG ACCGCGCCGC CTACGACGCC
GCCGAACAGC GGCTGTTGCG GCGCGTCGAT CTGGCCTTCA CCACCAGCGG GCATCTGCAG
CAGCGCTGCG CCGCCATCGC CGGCGAGCGC GCCCGTTACT TCGGCAATGT CGCCGATATC
GACCATTTCG CCACCGCGCG CGGCAACATC GATCTGCCGC CGGAACTCGC CGCGATTCCG
CGGCCCAGGC TCGGCTATGT CGGGGTGATC TCCGACTTCA AGATCGATCT GGAACTGTTG
CAGACGCTGG CCGTGGCCCA TCCCGATTGG CATTTCGTCT TCATCGGCGA CGAACGCGAG
GGCCAGCACA GCGACGTCGT GACGCGCATG GCGCAGCTGT CGAACGTGCA TTTTCTCGGC
TGGCGATCCT ATCAAGACCT GCCGCGTTAT CTTGCCGGCT TCGATGTCGG CCTGCTGCCG
CAGCTGATCA ACGACTACAC CCGCGCGATG TTTCCGATGA AGTTCTTCGA ATACCTCGCA
GCCGGACTTC CAGTCGTCGC CACCCCGCTG CCGGCGTTGC GCGACCTTGC CGCGGTTCAC
GGCATCGGCG CCGATACAGC AACGTTTGCG CAGGCGATTT CCGCAGCTCT CGATGGCCGC
GGGCCGCAGC CGCTACCGAT CGACGACCCG CTGCTACAGG CCAATTCCTG GGACGCGCGG
CTCGACCAGA TGTTGGCAAT GATCGAGGCA ACCTCGGCCG CGTCGCCGCG CGGTTAG
 
Protein sequence
MTTRRADVVM FSTADWASQY WTNKQHTAAH LAARGHRVLY VETVGLRRPG LNSMDVGRIW 
ARLRRGLKPI AEVRDNLWVL SPLTVPLGQR YAPIAAFNRW QLRARIGGWL RKHRIGRPLI
WTYHPYMLEA AEALDPSAMV YHCVDDIGAV PGVDRAAYDA AEQRLLRRVD LAFTTSGHLQ
QRCAAIAGER ARYFGNVADI DHFATARGNI DLPPELAAIP RPRLGYVGVI SDFKIDLELL
QTLAVAHPDW HFVFIGDERE GQHSDVVTRM AQLSNVHFLG WRSYQDLPRY LAGFDVGLLP
QLINDYTRAM FPMKFFEYLA AGLPVVATPL PALRDLAAVH GIGADTATFA QAISAALDGR
GPQPLPIDDP LLQANSWDAR LDQMLAMIEA TSAASPRG