Gene RPD_4319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4319 
Symbol 
ID4024843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4785713 
End bp4786816 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content68% 
IMG OID637964528 
ProductSel1 
Protein accessionYP_571437 
Protein GI91978778 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.754604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.455844 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGC TGCGCCCGAC CTCGATCCTA GCGGCGGCGC TGATGCTGCT CGCCACCAGC 
GCATCCGCGC AATTGTCTCT GACGCCGCCG CCGCCCAATC CTTTTCCGAA GCCGATCGAG
CCGGAAAAGC CGAAGCCCAA ACCGAAATCC GACCCGAAGC CGCCCGCGGC CGAGAAGGAC
AAGGCCAAAA AGCCTGCGGC CGACAAGGCC GGCGCGGCGA AGCCCGGCGG CGCGCCGACC
GCCGAGGACG CCGCCAATCT CGACGATCCC AATGTCGACC TGGTGTATGG CGCGTATCAG
CGCGGCTTCT ACAAGACCGC GTTCGAGATC GCGATCAAGC GCGCGCAGGA GCAGAACGAT
CCCAAGGCGA TGACCATGCT GGGCGAGCTC TACGCCAATG CGCTGGGGGT CAAGCGCGAC
TACAGCAAGG CCGTGGAATG GTACAGGCGG GCGGCCGATC TCGGCGATCG CGAGGCGATG
TTCTCGCTGG CGATGGCGCG AATGGCCGGG CGCGGCGGCG CCGCCAGCCG CGAAGAAGCC
GCCAAATGGC TGGCGTCCTC GGCCAAGCTC GGCGAACCGA AGGCGGCGTA TAATCTGGCG
CTGCTGTATC TCGACGGCCA GACCTTCCCG CAGGATATCA AGCGCGCCGC AGAATTGCTG
CGGGTGGCGG CCGACGCCGG AAATTCCGAG GCGCAATATG CGCTGGCGAC CTTCTACAAG
GAGGGCACCG GCGTCGAGAA GAACCTCGAC CAGGCGGTGC GGCTGCTGCA ATCGGCGGCG
CTCGCCGGCA ATGTCCCGGC CCAGGTCGAA TACGCGATCG CGCTCTACAA CGGCACCGGT
ACGGTGAAGA ACGAGCCCGC CGCGGTGGCG ATGCTGCGCA AGGCTGCGCG CGCCAACAAC
CCGATCGCGC AGAACCGGCT GGCGCATGTG CTGCTCAACG GCCAGGGCGC GCCGCGCGAT
CCGGTCGAGG CGATCAAATG GCACCTGGTC GCCAAGACCG CCGGCAAGGG CGACCTGATG
CTCGACGAGG CGCAGGCGCA GCTCAGCGCC GAGGACCGCG CCAAGGCCCA GGACGCCGCG
CGCAAATGGG TCGGCAGCAA GTGA
 
Protein sequence
MKALRPTSIL AAALMLLATS ASAQLSLTPP PPNPFPKPIE PEKPKPKPKS DPKPPAAEKD 
KAKKPAADKA GAAKPGGAPT AEDAANLDDP NVDLVYGAYQ RGFYKTAFEI AIKRAQEQND
PKAMTMLGEL YANALGVKRD YSKAVEWYRR AADLGDREAM FSLAMARMAG RGGAASREEA
AKWLASSAKL GEPKAAYNLA LLYLDGQTFP QDIKRAAELL RVAADAGNSE AQYALATFYK
EGTGVEKNLD QAVRLLQSAA LAGNVPAQVE YAIALYNGTG TVKNEPAAVA MLRKAARANN
PIAQNRLAHV LLNGQGAPRD PVEAIKWHLV AKTAGKGDLM LDEAQAQLSA EDRAKAQDAA
RKWVGSK