Gene RPB_4301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4301 
Symbol 
ID3912114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4888903 
End bp4889910 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content69% 
IMG OID637886205 
Productsecretion protein HlyD 
Protein accessionYP_487899 
Protein GI86751403 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.404032 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGC GGATTCGCGA CATGATCACG GCCGGTGCCT GCGCCGCCCT CCTGCTCGGC 
GCGGCGGCGA CGCCCGCGGC GGCGGCGACG CTCACGGTGG CGGAGCAGAA GGTCTCGGAC
GAGAAGGCGG TGTTCGCCAC CGTCGAGAGC ATCAGCGTCG TGCCGGCGCG CAGCCGGATC
GGCGGCACCG TGATCGCCCT GAAAGTGCGC GAGGGCGACA GCGTCGCCCG CGGCCAGGAA
ATCGCGACGA TCGGCGACGA CAAGCTGACG CTGCAGATGA ATTCGCTCGA CGCGCAGATG
CAGGCGCTGC TGGCGCAGGC GTCGCAGGCG CAGATCGATT TCGACCGCAC CAGCGGCCTG
GTCGAACGCG GCACGCTGGC GCGCACCAAG CTCGACGAGG CGCGCACCAC GCTCAACGTG
GCCGAGAACA ATCTGCGCGC CAAGACCGCG GAGCGCGCGG TGGTGCAGCA GCAGTTCAAG
GAGGGCCAGG TGCTGGCGCC CGACGACGGC CGCGTGCTGA AGAAGATGGT CGCGGTCGGC
TCGGTGGTGC TGCAGGGCGA TACCATCGTC ACGGTGGCGC AGCAGCACTA CAAGCTGCGG
CTGCGGGTGC CGGAACGGCA CGCGCGGTTC CTCAAACAGG GTGATCGCGT TCGCGTCGAC
GGCGCCGAGT TCGGCGACCA CACGGCGAAG TTCGGCACGA TTGACCTCGT CTACCCGCTG
ATCGAGGACG GCCGCGTCGT CGCCGATGCC TCCGTCGAGG GGCTCGGCCA GTATTTCGTC
GGCGACCGGC TGCGGGTGTG GGTCTCCGGC GGCGAGCGCC CGGCCTTCGT CATTCCGTCG
CGCTACATCA AGACCGAATT CGGCATCGAC TACGTCCAGC TCGGCGAGCC GGGCAAGACC
GTCGCGGTGC CGGTGCAGCG CGGCCGCGAT CATCCCACGC CGGACATGCC GGACGGCCTC
GAGATCCTCT CGGGCCTGCG TAATGGTGAC AGGTTGGTGC AGCCGTGA
 
Protein sequence
MTMRIRDMIT AGACAALLLG AAATPAAAAT LTVAEQKVSD EKAVFATVES ISVVPARSRI 
GGTVIALKVR EGDSVARGQE IATIGDDKLT LQMNSLDAQM QALLAQASQA QIDFDRTSGL
VERGTLARTK LDEARTTLNV AENNLRAKTA ERAVVQQQFK EGQVLAPDDG RVLKKMVAVG
SVVLQGDTIV TVAQQHYKLR LRVPERHARF LKQGDRVRVD GAEFGDHTAK FGTIDLVYPL
IEDGRVVADA SVEGLGQYFV GDRLRVWVSG GERPAFVIPS RYIKTEFGID YVQLGEPGKT
VAVPVQRGRD HPTPDMPDGL EILSGLRNGD RLVQP