Gene RPD_3522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3522 
Symbol 
ID4024036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3910043 
End bp3911320 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content65% 
IMG OID637963726 
Productputative pilus assembly protein cpaE 
Protein accessionYP_570646 
Protein GI91977987 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4963] Flp pilus assembly protein, ATPase CpaE 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.257718 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGCT ACGCACGCCA GAACCAGGAA GAGCCCGCGG CCGGCGCCTC GTCGTCCGCC 
ACGACGCAGG ACGAGCACAT CGCGCCGGCG CCGCGTGTAT CAGTGCAGGC GTTCTGCGAA
TCGGTCGAGA CCGCGGCGGC CGTGCAGGCC GCCGGCGAAG ACCGCCGTCT CACCAAGGCG
CATCTGAAGA TCCAGATGGG CGGCATGATC GCGGCGATCG AAGCCTATCG CTCGGCGCCG
ACCCCGAACG TCATCATCCT CGAGACCGAT CCGCGCAACG ACGTGCTCGC CGGCCTCGAT
CAGCTCGCCA CGGTCTGCGA TCCGGGCACC CGCGTCATCG TGATCGGCAA GGTCAACGAC
GTCACGCTGT ATCGCGAGCT AGTTCGTCGC GGCGTCAGCG ACTACGCGAT CGCGCCGGTC
GATCCGATCG ACGTCGTGCG CTCGATCTGC AATCTGTTTT CGGCACCGGA AGCCAAGGCC
GTCGGCCGCA TCATCGCGAT CGTCGGCGCC AAGGGCGGTG TCGGCGCCTC CACCATCGCG
CACAACGTCG CCTGGGCGAT CGCCCGCGAT CTGGCGCTCG ACTCGGTCGT CGCCGACCTC
GACCTCGCTT TCGGCACCGC CGGCCTCGAC TACAACCAGG ACCCGCCGCA AGGCATCGCC
GAGGCGGTGT TCTCGCCGGA CCGCGTCGAC ACCGCCTTCG TCGATCGTCT GCTGTCGAAA
TGCACCGATC ATCTCAGCCT GCTGGCCGCG CCGGCGACGC TCGATCGGGT CTATGATTTC
GGCGCCGACG CGTTCGACTC GATCTTCGAC ACGCTTCGCG CGACGATGCC CTGCATCGTG
CTCGACGTGC CGCATCAATG GACCGGTTGG GCGAAACGCG CGCTGATCAA TGCCGACGAC
ATCCTGATCG TCGCCGCGCC TGACCTCGCC AATTTGCGCA ATGCCAAGAA TCTGTACGAT
CTGCTGAAGG CGTCGCGGCC GAACGATCGA CCGCCGTTAT ACTGCCTGAA CCAGGTCGGC
GTGCCGAAGC GGCCCGAGAT CAATGCGAGC GAGTTCGCCA AGGCGATCGA GAGCCAGCCG
ATCGTCAGCA TCCCGTTCGA TCCGCAGATG TTCGGTTCGG CCGCCAATAA CGGGCAGATG
ATCGCGGAGA TCGCGGCGTC TCACAAGACC ACCGAGATGT TCCTGCAGAT CGCCCAGCGA
CTGACGGGAC GCGGTGAAGC CAAGAAGCCG AAAGGCGGCT TCCTGTCGCC GATTCTGGAG
AAGCTGCGGG CCAGATAA
 
Protein sequence
MISYARQNQE EPAAGASSSA TTQDEHIAPA PRVSVQAFCE SVETAAAVQA AGEDRRLTKA 
HLKIQMGGMI AAIEAYRSAP TPNVIILETD PRNDVLAGLD QLATVCDPGT RVIVIGKVND
VTLYRELVRR GVSDYAIAPV DPIDVVRSIC NLFSAPEAKA VGRIIAIVGA KGGVGASTIA
HNVAWAIARD LALDSVVADL DLAFGTAGLD YNQDPPQGIA EAVFSPDRVD TAFVDRLLSK
CTDHLSLLAA PATLDRVYDF GADAFDSIFD TLRATMPCIV LDVPHQWTGW AKRALINADD
ILIVAAPDLA NLRNAKNLYD LLKASRPNDR PPLYCLNQVG VPKRPEINAS EFAKAIESQP
IVSIPFDPQM FGSAANNGQM IAEIAASHKT TEMFLQIAQR LTGRGEAKKP KGGFLSPILE
KLRAR