Gene RPD_3520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3520 
Symbol 
ID4024034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3907853 
End bp3909295 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content65% 
IMG OID637963724 
Producttype II and III secretion system protein 
Protein accessionYP_570644 
Protein GI91977985 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4964] Flp pilus assembly protein, secretin CpaC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.637726 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTACG GCGCCAATCA CCGATTCATC CGGACTCTGA AGGGCACGGC GATGGCGTTG 
TCCGCCGTCG TCGCGCTGAC GCTGTTTCCG ACCTTGGCGC CGGTGCAGGC CAGCGACTAT
CGCGACACGC CGGCTCGGAT GGGCAACTCC GGCCTCAATG CGCGGCCGCT CGCGCTCGGC
ATCGGCAAGT CGGTCGTGAT CGATCTGCCG CGCGACATCA AGGATGTGCT CGTCGCCGAT
CCGAAGATCG CCAACGCGGT GGTGCGTTCG GCGCAGCGCG CCTACATCAT CGGCGCCGCC
GTCGGCCAGA CCAACATCGT GTTCTTCGAC TCCACCGGCC AGCAGATCGC CGCCTACGAC
ATCGCGGTGA CCCGCGACCT CAACGGCATC CGCACCGCGC TGCGGCAGTC GATCCCCAAT
GCGGACATCC AGGTCGAGGG CCTCGGCGAC GGCGTGATGC TGATAGGGTC GGTGGCGACA
CCGATCGAGG CGCAGCAGGC CGCCGATCTC GCCGCGCGGC TGGCAGGTGA CGCGAGCAAA
GTCGTTAACA ACATCGCGGT CCGCGGCCGC GACCAGGTGA TGCTGAAGAT CACCGTGGCC
GAGGTGCAGC GCGACATCGT CAAGCAGCTC GGCGTCGATC TCACCGCCAG CATGAACTAC
GGCACCTCCG TGGTGAAGTT CAGCAACACC AATCCGTTCA CCCAGTCCGG TGGGCCGCTG
GTGGCGAACA ACGCGCTGAC GACATCGTTC GGCTCGGGGC CGTCAGTGTC AGCGACGCTG
CGCGCGATGG AGAGCGCCGG GGTCGTGCGG ACGCTGGCCG AACCGAACCT CACGGCGATC
TCCGGCGAAC CGGCGAGCTT CCTCGCCGGC GGCGAGTTTC CCGTTCCAAG CGGCGTAACC
TGCACCAACA GCCTTTGCAC GCCGTCGGTG ACGTTCAAGA AGTTCGGTGT TCTGCTCAAC
TTTACCCCGG TGGTTTTGAC CGAGGGCCGG ATCAGCCTGA AGGTCTCGAC CGAAGTCTCC
GAGGTCTCGA GCGACAACTC GATCGTCATC GGCGGCCTGT CGGTGCCCTC GATCAAGACC
CGCCGTATCG AAAGCACGGT GGAAATCCCG TCCGGCGGTT CGCTGGCGAT GGCAGGTTTG
ATCCAGGAAC AGACCAAGCA GGCGATCAAC GGCCTGCCCG GCATGACCCA ACTCCCGATC
CTCGGCACCC TGTTCCGCAG CCGCGACTAC ATCAACCGGC AGACCGAACT GATGGTGATG
GTGACGCCCT ATGTGGTGCG TGCTGTGGCG CAGAAGGATC TGTCGCGGCC CGACGACGGC
TTCGCCGACG CCTCCGATCC ACAGTCGGAT CTGCTCGGCA ATATCAACCG GATCTACGGC
GTCCCCGGAC GCACCGGGCC CGCACAAACC TACCGGGGCC GGTTCGGCTT TATCACCGAC
TGA
 
Protein sequence
MSYGANHRFI RTLKGTAMAL SAVVALTLFP TLAPVQASDY RDTPARMGNS GLNARPLALG 
IGKSVVIDLP RDIKDVLVAD PKIANAVVRS AQRAYIIGAA VGQTNIVFFD STGQQIAAYD
IAVTRDLNGI RTALRQSIPN ADIQVEGLGD GVMLIGSVAT PIEAQQAADL AARLAGDASK
VVNNIAVRGR DQVMLKITVA EVQRDIVKQL GVDLTASMNY GTSVVKFSNT NPFTQSGGPL
VANNALTTSF GSGPSVSATL RAMESAGVVR TLAEPNLTAI SGEPASFLAG GEFPVPSGVT
CTNSLCTPSV TFKKFGVLLN FTPVVLTEGR ISLKVSTEVS EVSSDNSIVI GGLSVPSIKT
RRIESTVEIP SGGSLAMAGL IQEQTKQAIN GLPGMTQLPI LGTLFRSRDY INRQTELMVM
VTPYVVRAVA QKDLSRPDDG FADASDPQSD LLGNINRIYG VPGRTGPAQT YRGRFGFITD