Gene RPB_0171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0171 
Symbol 
ID3907776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp185450 
End bp186520 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content69% 
IMG OID637882053 
Productsecretion protein HlyD 
Protein accessionYP_483794 
Protein GI86747298 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGTA GCTGGATGCG TTGGGTCGTG ATCGTCGCCG TGGTGGCGGT TGCGGGTGGA 
GGTTACTTCG CGTGGCGAAC GTTCGGCGCC AAGGGGTTGC CGCCCGGCAT CGCCAGCGGC
AACGGCCGGA TCGAGGCGAC CGAGATCGAC GTTTCCACGA AGTCGGCCGG CCGCATCCGC
GACATTCTCG TGCGCGAGGG CGATTTCGTC ACCGCGGGTC AGGTGCTGGC GCGGATGGAT
ACGGATCAGC TCGAGGCGCA GCGCCGCCAG GCGGAGGCGC AGTTGCGGCG GGCCAGCATC
GGCATCGAGA CCGCGACCAG CCTGGTCACG CAGCGCGAAG CCGAGCGTGA GGCGGCTGTC
GCGGTGATCG CGCAGCGCGA CGCCCAGCTC GACGCGCTGG AGCGCAAGCT GGCGCGCGCC
GAAGCGCTGA TCAAGACCAG CGCGGTGTCG CAGCAGGTGC TGGACGACGA CCGCGCCAAC
GAGCAGGGCG CGAAGGCCGC GGTCGCCGCC GCCAAGGCGC AGCTCGCGGC CAGCGAGGCG
GCGATCAGCT CGGCGAAAGC GCAAGTGATC GACGCCGGCG CGGCGGTCGA CGCCGCCAAG
GCCGCGATCG ACAGCATCAC TGTCGAGATC AACGACAGCA CGTTGAAATC GCCGCGCGAC
GGCCGCGTGC AATATCGCGT CGCCCAGCCC GGCGAAGTGA TCGCCGCCGG CGGGCGCGTG
CTGAATCTGG TCGATCTCAG CGACGTCTAC ATGACCTTCT TCCTGCCGAC CGCGCAGGCC
GGGCAGATCG CGATCGGCGC CGATGTGCGT CTGGTGCTCG ACGCGCTGCC GCAGGTGGTG
ATTCCGGCGA AGGCGACCTT CGTCGCCGAC ACCGCGCAGT TCACGCCGAA GACGGTGGAG
ACCGAAGAGG AACGGCAGAA GCTGATGTTC CGGGTCAAGG CGCACATCCC CCAGGAGCTG
CTGCGCAAGT ACATCCAGCG CGTCAAGACC GGACTGCCGG GCGTGGCCTA TATTCGGCTC
GATCCGAAGG CCGAATGGCC GGCCAATCTC AGCGGCACGC TGGCGCAATG A
 
Protein sequence
MASSWMRWVV IVAVVAVAGG GYFAWRTFGA KGLPPGIASG NGRIEATEID VSTKSAGRIR 
DILVREGDFV TAGQVLARMD TDQLEAQRRQ AEAQLRRASI GIETATSLVT QREAEREAAV
AVIAQRDAQL DALERKLARA EALIKTSAVS QQVLDDDRAN EQGAKAAVAA AKAQLAASEA
AISSAKAQVI DAGAAVDAAK AAIDSITVEI NDSTLKSPRD GRVQYRVAQP GEVIAAGGRV
LNLVDLSDVY MTFFLPTAQA GQIAIGADVR LVLDALPQVV IPAKATFVAD TAQFTPKTVE
TEEERQKLMF RVKAHIPQEL LRKYIQRVKT GLPGVAYIRL DPKAEWPANL SGTLAQ