Gene RPB_1781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1781 
Symbol 
ID3908862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2042291 
End bp2043694 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content67% 
IMG OID637883675 
Producttype II secretion system protein E 
Protein accessionYP_485400 
Protein GI86748904 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.615567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.227871 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCCGG CTCCGGAGCC GATGTCACGC GCGCCCGAGG CGCCGTCCGC GGTCGCCTCG 
CCGCCGCTCG CACCGAACCG CGCGCCGCCG CCGCCGCCCC AGGCCGAAGC CCGCCGCTCC
GACAATTACT ACCAGGTCAA GGCGACGATC TTCGGCGCGC TGATCGAGGC GATCGACCTG
GCGCAGCTCG CCAAGCTCGA CGCCGAATCC GCGCGCGAAG AAATCCGCGA CATCGTCAAC
GAGATCATCG CGATCAAGAA CATCGTGATG TCGATCTCCG AGCAGGAGGA ACTGCTCGAC
GACATCTGCA ACGACGTGCT CGGCTACGGC CCGCTGGAGC CGCTGCTGTC GCGCGACGAC
ATCTCCGACA TCATGGTCAA CGGCGCCGGC ACGGTGTTCA TCGAAGTCGC CGGCCGGATC
CAGCGCACCG GCATCCGCTT CCGCGACAAT CAGCAGCTCC TCAACATCTG CCAGCGCATC
GTCAGCCAGG TCGGCCGGCG CGTCGACGAA TCCTCGCCGA TCTGCGACGC CCGCCTCGCC
GACGGCAGCC GCGTCAACGC CATCGTGCCG CCGCTGGCGA TCGACGGCCC CGCGCTCACC
ATCCGCAAAT TCAAGAAGGA CAAGCTGACG CTCGACCAGC TCGTCAAATT CGGCGCGATC
ACGCCGGAGG GCGCGCAGAT CCTGCAGATC ATCGGCCGCA CCCGCTGCAA CGTGCTGATC
TCCGGCGGCA CCGGCTCCGG CAAGACCACG CTGCTGAACT GCCTGACCAA CTACATCGAC
GACGACGAGC GCATCATCAC CTGCGAAGAC GCCGCCGAAC TGCAGCTGCA GCAGCCGCAC
GTGGTGCGCC TCGAAACCCG GCCGCCGAAC ATCGAGGGCG AAGGCCAGGT GACGATGCGC
GAACTGGTGC GCAACTGCCT GCGTATGCGG CCGGAACGGA TCATCGTCGG CGAAGTCCGC
GGACCGGAGG CGTTCGACCT GTTGCAGGCG ATGAACACCG GCCACGACGG TTCGATGGGC
ACGCTGCACG CCAACAACCC GCGCGAGGCG CTGTCGCGCT GCGAGTCCAT GATCACCATG
GGCGGCTTCT CGCTGCCCTC GCGCACCATC CGCGAGATGA TCTGCGCCTC GATCGACGTC
ATCATCCAGG CCGCACGGCT GCGCGACGGC TCGCGCCGGA TCACCCACAT CACCGAGGTG
ATGGGGATGG AAGGCGACAC CATCATCACC CAGGATCTGT TCATCTACGA CATCATCGGC
GAGGACGCCA ACGGCCACAT CGTCGGCCGG CATCGCTCGA CCGGGATCGG CAGGCCGCGG
TTCTGGGAAC GCGCGCGCTA TTACGGCGAG GAAAAGCGGC TCGCCGCGGC GCTCGACGCC
GCCGAAGTCG CAGCCGTCAC CTGA
 
Protein sequence
MSPAPEPMSR APEAPSAVAS PPLAPNRAPP PPPQAEARRS DNYYQVKATI FGALIEAIDL 
AQLAKLDAES AREEIRDIVN EIIAIKNIVM SISEQEELLD DICNDVLGYG PLEPLLSRDD
ISDIMVNGAG TVFIEVAGRI QRTGIRFRDN QQLLNICQRI VSQVGRRVDE SSPICDARLA
DGSRVNAIVP PLAIDGPALT IRKFKKDKLT LDQLVKFGAI TPEGAQILQI IGRTRCNVLI
SGGTGSGKTT LLNCLTNYID DDERIITCED AAELQLQQPH VVRLETRPPN IEGEGQVTMR
ELVRNCLRMR PERIIVGEVR GPEAFDLLQA MNTGHDGSMG TLHANNPREA LSRCESMITM
GGFSLPSRTI REMICASIDV IIQAARLRDG SRRITHITEV MGMEGDTIIT QDLFIYDIIG
EDANGHIVGR HRSTGIGRPR FWERARYYGE EKRLAAALDA AEVAAVT