Gene RPC_3008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3008 
Symbol 
ID3973615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3302409 
End bp3303620 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content64% 
IMG OID637926119 
Producttype II secretion system protein 
Protein accessionYP_532872 
Protein GI90424502 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.314245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAATT TCCGCTACCG CGCGCTGACC CAGACTGGCG AGATCGTCTC GGGGTCGATT 
TGCGCGGCGA GTCTGGGAGA GGTGTCGCAA CGGATCGAGT ATCTCGGTCT GGTCGCCATC
GATATGGTCT TGGACAATGC GGCGAGCGGC TGGTCGTTGC AGCTGGGGTC GTTGTCGAAC
CCGCGCCCCG AGGACGTCAC GGTCTTGACG CGGGATCTCG CTCTGCTGAT CAAGGCGGGG
GCCCGGCTCA ACGATGCCCT CGAATTGTTG GCCAATGACA TGGACGTCGG CCGGCTGCGC
CCGGTGGTGC TGAAGATTAG AACGGCGATT CTTTCCGGCG AGAGTTTTGC CGAAGCGCTG
GAACGTCATC CGGCGCAGTT CCCAGCGATG TACGTTGCAT TGATCCGGGT CGGAGAGATG
TCCGGGACGC TCGACCGCAT CCTGGAGACG CTTGGCACCG AGCGTAATCG CGCCGAAGCG
CTGCGCCGGA AAGTGACCGA CGCGTTGCAG TATCCGGCCT TCGTGTTGCT GGCGGCCGGC
GGCGTGCTGA TCTTCTTTGT CTGTTTCGTG TTGCCGCAGT TTTCGTCGGT GCTGCGCGAC
TTCAACGCGA AACTCGATCC GGTCATGGAG ATGTTCATGG CGCTGTCGGA TCTGCTGCGC
GGCCACGGCA TCGAAGTTGC CGCAACCGCC GCTGGGATCA TCATCGGCGG CTGGCTGCTA
TGGCGCAGAC CCGGCGTTCG CGCCGGCACG GTCGCGCAAT TGGCGCGGCT GCCGGTGATT
TACTCGGTGG TCGAGTTTCA CCGCACCGCC TTGTTCTGCC GCAATCTCGG TATTCTGCTC
GGCAGCGGCG TGACGCTGAC CGCGACGCTG CGGATTCTGA CCGACATCAT GTCGGAGACC
GGAAACGTCC CGGTTTGGAC GGCGATGGCC GATCGGGTCC GCCACGGCGG CAAGCTCTGC
GATGCGCTCA CCAACGCAGC AATGCTACCG CCGGTGGCGG TGCGCATGCT TCGGCTGGGC
GAGGAAACCG GGCAACTGCC GACGCTCGCT ACCCGCGTCG CGGAATTCTA CGAGACCAAG
CTGCAGCGCC AGCTCGATCG CTTGGTCGGG GTCATCGGGC CGGCCGCCAT TGTCATGATC
AGCGTCGTGG TCGGCGGGCT GATTGTTTCG GTGATGACCG CATTGTTGTC GGTCACCCAG
GTTGTCGGAT GA
 
Protein sequence
MPNFRYRALT QTGEIVSGSI CAASLGEVSQ RIEYLGLVAI DMVLDNAASG WSLQLGSLSN 
PRPEDVTVLT RDLALLIKAG ARLNDALELL ANDMDVGRLR PVVLKIRTAI LSGESFAEAL
ERHPAQFPAM YVALIRVGEM SGTLDRILET LGTERNRAEA LRRKVTDALQ YPAFVLLAAG
GVLIFFVCFV LPQFSSVLRD FNAKLDPVME MFMALSDLLR GHGIEVAATA AGIIIGGWLL
WRRPGVRAGT VAQLARLPVI YSVVEFHRTA LFCRNLGILL GSGVTLTATL RILTDIMSET
GNVPVWTAMA DRVRHGGKLC DALTNAAMLP PVAVRMLRLG EETGQLPTLA TRVAEFYETK
LQRQLDRLVG VIGPAAIVMI SVVVGGLIVS VMTALLSVTQ VVG