Gene Rsph17029_1133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1133 
Symbol 
ID4895324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1177187 
End bp1178350 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content66% 
IMG OID640111719 
ProductHK97 family phage major capsid protein 
Protein accessionYP_001043015 
Protein GI126461901 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.308903 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCAG GCCCCGATCC GGCCGTGGAG GCGAAAGCCG CAATGGCCGG TTTCCTGAAG 
GAGATCAATC GCTTTCAGGA GGAGGTGAAG AATGTGCTGC AACAACAGGA AGAGCGTTTG
ACCATGCTGG ACCGCAAAAC CATGATCTAC GGGCGCCCGG CGCTGGCGGC CGCGGCCGAC
CAGGAGGCGC CGCATCGAAA GGCGTTCGGG GCCTATCTCC GCTCGGGCGA CGACGATGGT
CTGCGCGGCC TCGTCCTCGA GGGCAAGGCG ATGACGGCGA GCGTCGCCTC GGACGGCGGC
TATCTGGTCG ATCCGCAGAC CTCGGACGCC ATCCGCTCGA TGTTGCTGTC CACGGCCTCG
ATCCGTCAGA TCGCCGGTGT GGTCCATGTG GAAGCCACGA GCTTCGACGT GCTGATCGAC
CGCACTGAGG TGGGGTCGGG CTGGGCCACG GAGGCCGCCA CGATCAGCGA AAGCGCCTCG
CCCACCATCG AGCGGATCTC GATCAAGCTG CACGAACTGT CGGCGATGCC GAAGGCGAGC
CAGCGGCTGC TGGACGACTC GGCCTTCGAC GTCGAGAGCT GGCTGGCGGC CAAGATCGCG
ACGCGCTTCA TGCGGGCCGA GAGCGCGGCC TTCGTCAGCG GCGACGGGAT CGACAAGCCG
CGGGGCTTTC TGGCGCCGGC GAAGGTTGCG AACGCGAGCT GGAGCTGGGG CTCGATCGGC
TATGTCCCCT CGGGTGCGGC GAGCGATTTC CTCGCCACGA ACCCGGCCGA TTGTATCATC
ACCCTGATCT ATTCGCTCGG CGCCGATTAC CGCGCGAATG CGACCTTCGT GATGAATTCG
AAGACCGCGG GCGCGGTGCG GAAGATGAAG GACTCGGACG GCCGCTTCCT GTGGTCGGAC
GGTCTGGCCG CGGCGGAGCC TGCGCGGCTG ATGGGATATC CGGTGCTCCT CTGCGAGGAC
ATGCCGGACA TTGCCGCGGG CGCCTTTGCC ATCGCTTTCG GGGATTTCGC CGCCGGCTAC
ACGATCGCCG AGCGGCCCGA GGTGCGGGTT CTGCGCGATC CGTTCTCGGC CAAGCCCCAT
GTCCTCTTCT ATGCGACGAA GCGCGTGGGA GGCGATGTCA GCGACTATGC GGCGATCAAG
CTCCTGAAGA TCGCGGTGTC CTGA
 
Protein sequence
MSAGPDPAVE AKAAMAGFLK EINRFQEEVK NVLQQQEERL TMLDRKTMIY GRPALAAAAD 
QEAPHRKAFG AYLRSGDDDG LRGLVLEGKA MTASVASDGG YLVDPQTSDA IRSMLLSTAS
IRQIAGVVHV EATSFDVLID RTEVGSGWAT EAATISESAS PTIERISIKL HELSAMPKAS
QRLLDDSAFD VESWLAAKIA TRFMRAESAA FVSGDGIDKP RGFLAPAKVA NASWSWGSIG
YVPSGAASDF LATNPADCII TLIYSLGADY RANATFVMNS KTAGAVRKMK DSDGRFLWSD
GLAAAEPARL MGYPVLLCED MPDIAAGAFA IAFGDFAAGY TIAERPEVRV LRDPFSAKPH
VLFYATKRVG GDVSDYAAIK LLKIAVS