Gene RPC_4719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4719 
Symbol 
ID3972695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5280285 
End bp5281325 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content67% 
IMG OID637927831 
Productglycosyl transferase family protein 
Protein accessionYP_534560 
Protein GI90426190 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.19129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCTTG GTTCTGACCT GTCCGCCCTC GCCGCCGGCA ACGGCGCTGC CGCTGCGAAA 
ACGCTGTCAC TGGTGGTGCC GCTGTTCAAT GAGGCGGCGG GCCTGCCGCA GCTGCACGAC
CGGCTGGTGG CGCTCGCCGC CACGCTGCGG CAGCGCTACG GGCTTGGCTG CGAAGTGATC
TATGTCGACG ACGGCAGCGC CGACGCCACG CTGAGCGTGG CGCGCGGTCT GTCGGCGCCG
ACGCTCGACG TCCAGGTGGT GTCGCTGTCG CGCAATTTCG GCAAGGAAGC CGCCTTGATG
GCCGGGCTCG ACCATGCCGG AAACGGCGCG GTGCTGTTCA TGGACGGCGA CGGCCAGCAT
CCGCCGGCGC TGGTCGAACA ATTGGTGCAG CACTGGATCG TCGACAGCTA CGACGTGGTC
TACACCGCCA AGGCGCACCG CGACAACGAA ACCTGGCTGC GCCGCACCGC GGTGCGCGGC
TTCTACATGC TGATCAATTG GGGCGCGCGG CAGAAGATCC CGGAAGACGC CGGCGATTTC
CGGCTGCTGT CGCCGCGCGC CGCCGCGGCG TTGCGGCAAT TGCCGGAGCG CAACCGCTTC
TTCAAGGGAC TGGCGAGCTG GATCGGGTTT CGCCAGATCC GCGTCGACTA TGAGCCGGAG
CCGCGCTCGC ACGGCATCAC CTCGTTCAAC GCCGCACGGC TGGTCGGGCT GTCGATCGAG
GGCCTGACCT CGTTTTCGGT GGCGCCGTTG CGCATCGCCA GCCTGCTCGG CCTGTTGCTC
GCCTTCGTGG CGTTCCTGTT CGGGCTGTCG ATCCTGTGGG AGACCATGGT CAGCGGCAAA
TCGGTGCCGG GCTATCCGTC GCTGGTGGTC GGACTGATGA CGATCGGCGG CGTGCAGCTG
ATCATGATCG GCATCGTCGG CGAGTATATC GGCAAGATCC TCTCCGAATT GAAGGCGCGG
CCGATCTATT TCGTCGCCGA ACACAGCGTC AAGCGCGCCG ACACCACCAC CAACACCGGC
GAACGGACCG CCGCCGAATG A
 
Protein sequence
MILGSDLSAL AAGNGAAAAK TLSLVVPLFN EAAGLPQLHD RLVALAATLR QRYGLGCEVI 
YVDDGSADAT LSVARGLSAP TLDVQVVSLS RNFGKEAALM AGLDHAGNGA VLFMDGDGQH
PPALVEQLVQ HWIVDSYDVV YTAKAHRDNE TWLRRTAVRG FYMLINWGAR QKIPEDAGDF
RLLSPRAAAA LRQLPERNRF FKGLASWIGF RQIRVDYEPE PRSHGITSFN AARLVGLSIE
GLTSFSVAPL RIASLLGLLL AFVAFLFGLS ILWETMVSGK SVPGYPSLVV GLMTIGGVQL
IMIGIVGEYI GKILSELKAR PIYFVAEHSV KRADTTTNTG ERTAAE