Gene RPC_3009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3009 
Symbol 
ID3973616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3303625 
End bp3305310 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content66% 
IMG OID637926120 
Producttype II secretion system protein E 
Protein accessionYP_532873 
Protein GI90424503 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.350498 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGGA CTGCCCAGGA TTTTGTCGCG CATTTTTGCC GCGAGAATGC TCTAGCGGCG 
GAATTCGCCG GCCGCGGTGC TGTTGCCGCC GGGGCGGCCG ACGCCGACCG CGGCCTGCGT
AAACTCTGGG AGATCAGCGA ATTGTCGGCG AGTGAGTTCG CCGACGCGGT GGCACGGTTT
TACGACCTGC CGCGAACTAC GCTTCCGGAG CTGATTGCAG CGCAGTCGCA GGCGCATCGG
TTCTCCCGCC GTTTCTTGCG GGAGATGGCG GTGTTTCCTT ATCAGTCAGC GACCGGTGAG
CCGATCGTCG CGATTGCCGA TCCGAGCGAC CACGCGTCGA TACGGGCGGC GGCGATCGTG
TTCGGCGCCG CCGTCAGCAC CCAGGTCGCT TCGTTTGAGG ACATCGCGAC GGTGCTCGAT
CAGCGTCTCG GCGAGGATCA CAACTCCGAG GACGAAAGCG TCGCGAGCAT CTCGTCGCAG
GACGACGACA TCGACAATCT GCGCGATCTG GCCAGCGGCG CGCCGGTGGT CCGCGCGGTC
AACGATCTGT TCGAAAGCGC GGTCGAACTG CGGGCCAGCG ACATTCATAT CGAGCCGACG
CGCACCGCGC TGATCGCGCG GATGCGGATC GACGGACTGT TGCGCACCGT GCCGACGCCG
GCCGGGGTGC CGCCGCAAGC GGTGATTTCC CGGATCAAGA TCTTGGCGGG TCTCAACATC
GCGGAACGGC GGCTGCCGCA GGACGGCGCT GCTCGGTTCC GTGCGGCGCG TTCGGAGATC
GACATGCGCG TTGCGATCAT GCCGACGCAA CACGGCGAGT CCGCGGTCAT CCGCTTGCTG
CCGAGGGACC GCGGCCTGCT CTCGATCGAG AAGCTCGGCT TCCTGCCGGG CGACGAAGGC
AAGCTCCGCG GCATGCTGAC GCTGCCACAC GGCATGATCG TGGTGACCGG GCCGACCGGG
AGCGGCAAGA CCACGACGCT TGCCACGGTC CTGTCGGTGC TCAACCAACC GACCCGGAAG
ATTTTGACGA TCGAGGATCC GGTCGAATAC GAAATTCCGG GGATATGCCA ATCCCAGGCC
AAGCCGTCGA TCGGTCTGAC CTTTGCGACC GCGCTGCGCG CCTTCGTCCG CCAGGACCCC
GACGTGATCA TGGTCGGAGA GATCCGCGAC GCCGAAACCG CGCATGTCGC GATCCACGCG
GCGCTGACCG GCCATCTGGT GCTCACCACG CTGCACACCG AAACCGCGGC CGCCGCGGTG
CCGCGGCTGC TCGACCTGGG GGTGGAGGCG TTCTTGCTGC GCTCGACGCT ACGGGCGGTG
ATCGCACAAC GCCTGGTTCG TCAGTTGTGC GATCGATGCA AGGCCGGCCG TGCTTTGACC
GAAGCCGATA TCGAGGTCGA TCCGCGCTAT GCCGCCATTG GGCTCAAACT CGGCGAGACC
ATCTTCGAGC CGGTCGGCTG CGAGCGCTGC GGCGGTACCG GGTATCGCGG CCGCTGTGGC
GTGTTCGAGA TCTTGGAGAT GAGCGAAGAC GTCCGCCAGT TGATCGACCA GCAATCCGAT
TGGGCCTCGA TCGACAAGGT CGCGGTTCGC AACGGCATGA CGACGATGAT CGATGACGGC
CTCGCCAAAT GCCGCTGCGG CATGACCTCG GCGGCCGAGA TTCTTCGCGT CACCACGGTG
CGGTGA
 
Protein sequence
MDRTAQDFVA HFCRENALAA EFAGRGAVAA GAADADRGLR KLWEISELSA SEFADAVARF 
YDLPRTTLPE LIAAQSQAHR FSRRFLREMA VFPYQSATGE PIVAIADPSD HASIRAAAIV
FGAAVSTQVA SFEDIATVLD QRLGEDHNSE DESVASISSQ DDDIDNLRDL ASGAPVVRAV
NDLFESAVEL RASDIHIEPT RTALIARMRI DGLLRTVPTP AGVPPQAVIS RIKILAGLNI
AERRLPQDGA ARFRAARSEI DMRVAIMPTQ HGESAVIRLL PRDRGLLSIE KLGFLPGDEG
KLRGMLTLPH GMIVVTGPTG SGKTTTLATV LSVLNQPTRK ILTIEDPVEY EIPGICQSQA
KPSIGLTFAT ALRAFVRQDP DVIMVGEIRD AETAHVAIHA ALTGHLVLTT LHTETAAAAV
PRLLDLGVEA FLLRSTLRAV IAQRLVRQLC DRCKAGRALT EADIEVDPRY AAIGLKLGET
IFEPVGCERC GGTGYRGRCG VFEILEMSED VRQLIDQQSD WASIDKVAVR NGMTTMIDDG
LAKCRCGMTS AAEILRVTTV R