Gene RPC_4001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4001 
Symbol 
ID3969347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4449338 
End bp4450705 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content67% 
IMG OID637927105 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_533846 
Protein GI90425476 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.443077 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCTCTCC TGGAAGCTCG TTCAGCCACC CCGCTCGCCG GCCTCAAGGC GCGGTTGCGG 
GCGATGTTCG GCGGCGGCCA CGAGGCCGCG CTGACCAACC GGCTGGCCGG CACCATCTTC
ATCATCCGCG TGGTCAGCGC CGCCTTGGCG TATCTGTCGC AAGTGCTGCT GGCGCGCTGG
ATGGGCGGCG CGGATTACGG CACTTACGTC TATGTCTGGA CCTGGGTGCT GCTACTCGGC
TCGATGCTGG ATTTCGGCAT CGCGATGTCG TGCCAGAAGA TCATTCCGGA ATATCGCGCC
GCCGGCGCCC ACGCCTTGTT GCGCGGTTTT CTGTCCGGCA GCCGCTGGGC CACGCTGGCG
GCCTCGAGCG CGGTGGCGCT GGCGCTCGCC GGGCTGGTGC GGCTGCTGTC GCCGTGGATC
GATCCGCCGG CCGTGGTGCC GCTGTATCTC GGCTGCCTGA CGCTGCCGGC CTTCGTGGTC
GCCAACACCC AGGACGGCAT CGCCCGCTCG CACGACTGGA TGCGGCTCGG CTTGATGCCG
CAATTCATCG TCCGGCAATC GCTGATCATC GGCTTCACCG CCGGCGCCGT GGTGCTCGGC
TTTCAGCTCG GCGCGGTGGC TGCGATGATC GCCAGCTGCG CCGCGGTGTG GATCGCGATG
CTCGGCCAGC TGCTCGCGCT GAACCGCCGG CTTGAGGGCG TGATCGACCC CGGCCCCAAA
GCCTATGAAT TCCGCAGCTG GCTGAAAACC TCGCTGCCGA TCATGATGGT CGAGGGCTTC
TATCTGCTAT TGTCCTATAT CGACGTCCTG GTGCTGCAGC ATTATCGCTC GGCCGAAGAA
GTCGGGGTGT ATTTCGCGGT GATCAAGACG TTGGCGCTGG TGTCGTTCAT CCACTACGCG
ATGTCGGCGG TCACCGCGCA TCGCTTCAGC GAGTATCACA CGAGCGGCGA CAAGGCGCGG
CTCGCCGCCT ATCTCCGCCA CGCCATCACC TGGACGTTCT GGCCGTCGCT GGCCGCCACC
GTCGTGCTGC TGGCGCTGGG CAAGCCGCTG TTGTGGCTGT TCGGGCCGCA ATTCGTCGCC
GGCTACGACA TCATGTTCAT CGCCGCGATC GGCCTCGTGG TGCGCGCCGC GATCGGCCCG
GTGGAACGGC TGCTCAACAT GCTCGGCCAG CAGAACCTCT GCGCGCTGGC CTATGCGCTG
GCGTTCGCGA TCAACCTCGT GCTGTGCATC GCGCTGGTGC CGCGGTTCGG CGGCCACGGC
GCCGCCGCCG CCACCTCGCT GGCACTCACT TTCGAAACCG TGCTGCTGTT CTGGATCACC
CGCCAGCGGC TCGGCCTGCA CGTGCTGGCG TTCGGCAAGC GGGCCTGA
 
Protein sequence
MALLEARSAT PLAGLKARLR AMFGGGHEAA LTNRLAGTIF IIRVVSAALA YLSQVLLARW 
MGGADYGTYV YVWTWVLLLG SMLDFGIAMS CQKIIPEYRA AGAHALLRGF LSGSRWATLA
ASSAVALALA GLVRLLSPWI DPPAVVPLYL GCLTLPAFVV ANTQDGIARS HDWMRLGLMP
QFIVRQSLII GFTAGAVVLG FQLGAVAAMI ASCAAVWIAM LGQLLALNRR LEGVIDPGPK
AYEFRSWLKT SLPIMMVEGF YLLLSYIDVL VLQHYRSAEE VGVYFAVIKT LALVSFIHYA
MSAVTAHRFS EYHTSGDKAR LAAYLRHAIT WTFWPSLAAT VVLLALGKPL LWLFGPQFVA
GYDIMFIAAI GLVVRAAIGP VERLLNMLGQ QNLCALAYAL AFAINLVLCI ALVPRFGGHG
AAAATSLALT FETVLLFWIT RQRLGLHVLA FGKRA