Gene RPB_3411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3411 
Symbol 
ID3911213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3898432 
End bp3899523 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content67% 
IMG OID637885314 
Productsecretion protein HlyD 
Protein accessionYP_487018 
Protein GI86750522 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.591463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.695679 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAC CCACCATGTT ACCGCCGCTT CGGATCATCC TGTTGCTGCT TATGGCCACG 
CCGCTGGCGG CGTGCGGCGA GAAGGCTCAG CAAGCCCGGC CGATCAATCT GGTGAAGACC
GAGATCGTGC ACCTCGAGCC CCGGCAGACG ATTGTTCGTC TCACCGGCGA CGTCCAGGCG
CGCGTGACCA GCGAACTGTC CTTCCGCGTC AGTGGCCGGG TCACCGAGCG GCTGGTCGAT
GTCGGCGCCC ATGTGAATGC CGGCGACGTG CTGGCGCGCA TCGATCCGAC CGAGCAACAG
GCCGACCTGG TCGGCTCGCA GGCCGCCGTC GCCTCGGCGG AGGCGCAACT GCGCCTCGCC
AATGCGACCT TCGACCGGCA GAAATCGCTG ATGGCCAGCG GCTTCACCAC CCGCAGCTCC
TACGACCAGG CGCAGGAGGG ATTGCGGACC GCCGAAGGCT CCCTCGACAA CGCCAAGGCC
CAGCTCGAAA TCGCAAGGGA TGCGCTGACC TATACCGAAC TGCGCGCGAG CGCGTCGGGC
ATCATCACGG CGCGCAACAT CGAGGTCGGG CAGGTCGCCC AATCCGCCCA GTCGGCCTAC
ACGCTGGCGG AGGATGGCGC CCGCGACGCC GTGTTCGACG TCTACGAATC GGTGTTTCTG
ACGCCGCTCC AGGGCGGCAC CATCAAACTC ACGCTGGTGT CGGATCCGTC GGTCACCGCC
ATCGGGCGTC CGCGGGAGAT TTCTCCCACC GTCGACCAGA AGAGCGGCAC CGTTCGGGTC
AAGCTGTCGA TCGAGAATCC GCCGGCCGCG ATGACGCTCG GCAGCGCCGT CACGGGCGAG
GGGCGCAGCC GTTCCGTGGA CAAGATCGTG CTGCCCTGGA GTGCATTGAC CTCCGACCAG
AAAGGGCCGG CCGTCTGGGT GATCGACCCC AAGACCCGCG CCGTGTCGCT CAGGAGCGTC
ACCGTGGAAA GCTACGAAAC CAGCTCGATC ATCGTGGCCG ACGGCCTGAA GGCCGGCGAG
CGCGTCGTCG TCGACGGCGG CAAGATGCTC CGCCCCGCGG AGATCGTCAC TTATGACGGG
GAAAACGCAT GA
 
Protein sequence
MSQPTMLPPL RIILLLLMAT PLAACGEKAQ QARPINLVKT EIVHLEPRQT IVRLTGDVQA 
RVTSELSFRV SGRVTERLVD VGAHVNAGDV LARIDPTEQQ ADLVGSQAAV ASAEAQLRLA
NATFDRQKSL MASGFTTRSS YDQAQEGLRT AEGSLDNAKA QLEIARDALT YTELRASASG
IITARNIEVG QVAQSAQSAY TLAEDGARDA VFDVYESVFL TPLQGGTIKL TLVSDPSVTA
IGRPREISPT VDQKSGTVRV KLSIENPPAA MTLGSAVTGE GRSRSVDKIV LPWSALTSDQ
KGPAVWVIDP KTRAVSLRSV TVESYETSSI IVADGLKAGE RVVVDGGKML RPAEIVTYDG
ENA