Gene RPD_1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1970 
Symbol 
ID4022452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2207879 
End bp2209174 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content68% 
IMG OID637962163 
Productphage major capsid protein, HK97 
Protein accessionYP_569106 
Protein GI91976447 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.141118 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.540615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCGT TCTTCATCAT CACCACAAGG GATCACATGA TGACCACCAC CTTCGACCAC 
GCTCCCGAGA CCAAGGCCGG AATCGCCGGC GACGATGCGC AGCAGGTGTA CGACGCGCTG
ATGCGGACGT TCGAGGACTA CAAGGCGGAG AACGATTCCC GGCTGCAGGC GATCGAGAAG
CGCGGCGGCG ATGTCATCGC CGAGGACAAG GTGGCGCGGA TCGACGCCGC GCTCAATGCG
CAGCAGCGCC GGCTCGACGA ACTGGCGCTG AAGCAGGCGC GGCCGCAGCT CGGCGCCGAC
AGCGCCCTGC GCCCCCGCGG CGCGGCCGAG CACAAGAGCG CGTTCGACGC CTATATCCGC
AACGGCGACG CCGCGACGCT GCGGCAGATC GAGACCAAGG CGCTGTCGGT CGGCTCCAAT
CCGGACGGCG GCTATCTGGT GCCGGAAGAG TTGGAGCGCA GCATCGCGGC GCGGCTCAGC
GCGATCTCAC CGATCCGCGG CCTGGCTTCG GTGCGGCAGA TCTCCGGCAG CGTCTACAAG
AAGCCGTTCA TGACCGCGGG TCCTGCGACC GGCTGGGTCG GCGAGGCCGC GGCGCGGCCG
CAGACCAGTT CGCCGACGCT GGACGCGCTG TCGTTCCCGG CGATGGAGCT GTATGCGATG
CCGGCCGCGA CCGCGACGCT GCTCGACGAT GCCGCGGTCA ATCTCGACGA CTGGCTCACC
GGCGAGATCG ACACCGTGTT CGCCGAGCAG GAGGGCGCCG CCTTCGTCTC CGGCGACGGG
ATCAACAAAC CGAAAGGCTT TCTCGCCGCG CCGACGGTGG CGAACGCCGC CTGGAGCTGG
GGCAATCTCG GTTTCGTCGC CACCGGCGCC GCCGGCGCAT TCCCCGCCAG CAATCCGTCC
GACGTGCTGA TCGACCTGAT GTTCGCGCTG AAGCCGGGCT ACCGGCAGAA CGCCAGCTTC
GTGATGAACC GGCGGACGCA AGCCGCGATC CGCAAGTTCA AGGACAATAA CGGCGTCTAT
CTGTGGCAGC CGCCGGCCAC CGCCTCGGGC CGCGCCAGCC TGATCGGCTT CCCGCTGGCC
GACGCCGAGG ACATGCCGGA CATCGCCGCC AATTCGCTCG CCATCGCCTT CGGCGATTTC
CGCCGCGGCT ATTTGATCGT CGACCGCCAG GGCGTCCGCG TCCTGCGCGA CCCGTATTCC
GCCAAGCCCT ACGTGCTGTT CTACACCACC AAGCGGGTCG GCGGCGGGGT GCAGGATTTT
GATGCGATCA AGCTGTTGAA GTTTGGGGGG AGTTGA
 
Protein sequence
MNPFFIITTR DHMMTTTFDH APETKAGIAG DDAQQVYDAL MRTFEDYKAE NDSRLQAIEK 
RGGDVIAEDK VARIDAALNA QQRRLDELAL KQARPQLGAD SALRPRGAAE HKSAFDAYIR
NGDAATLRQI ETKALSVGSN PDGGYLVPEE LERSIAARLS AISPIRGLAS VRQISGSVYK
KPFMTAGPAT GWVGEAAARP QTSSPTLDAL SFPAMELYAM PAATATLLDD AAVNLDDWLT
GEIDTVFAEQ EGAAFVSGDG INKPKGFLAA PTVANAAWSW GNLGFVATGA AGAFPASNPS
DVLIDLMFAL KPGYRQNASF VMNRRTQAAI RKFKDNNGVY LWQPPATASG RASLIGFPLA
DAEDMPDIAA NSLAIAFGDF RRGYLIVDRQ GVRVLRDPYS AKPYVLFYTT KRVGGGVQDF
DAIKLLKFGG S