Gene Rsph17025_1078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1078 
Symbol 
ID5084681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1105610 
End bp1106773 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content66% 
IMG OID640482636 
ProductHK97 family phage major capsid protein 
Protein accessionYP_001167284 
Protein GI146277125 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.506947 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCAG GCCCCGACCC GGCTGGAGAG GCGAAAGCCG CAATGGCCGG CTTCCTGAGG 
GAGATCAAGC TCTTTCAGGA TGAGGTGAAG ACCGTGTTGC AACAACAGGA AGAGCGTTTG
ACCATGCTGG ACCGCAAAAC CATGATCTAC GGGCGCCCGG CGCTCTCGGC CGCTGCCGAC
CAGGAGGCCC CGCATCGCAA GGCCTTTGGG GCCTATCTCC GCTCGGGCGA CGACGACGGG
CTGCGCGGGC TTGTCCTGGA GGGCAAGGCG ATGACGACTG CGGTGGCGGC GGACGGCGGC
TATCTGGTGG ATTCCCAGAC GTCGGACACG ATCCGCTCGA TGCTGCTCTC GACGGCCTCG
ATCCGGCAGA TCGCCGGTGT GGTCAATGTG GAGGCCACGA GCTTCGAGGT TCTCATCGAC
CGTACCGAAG TGGGCTCGGG CTGGGCAACC GAGGCGAGCG CCATCAGCGA GAGTGCGACG
CCCGCGATCG AACGCATCTC GATCAAGCTC CACGAGCTGT CGGCCATGCC GAAGGCGAGC
CAGCGGCTTC TGGACGACGC GGCCTTCGAC GTGGAGGGGT GGCTTGCGGG CAAGATTGCC
ACACGCTTCA TGCGGGCCGA GAGCGCGGCC TTCGTGAGCG GCGACGGCGC TGACAAGCCG
CGCGGGTTCC TCGCGCCTGC GAAGGTGCCG AACGCGTCCT GGACCTGGGG CAACATCGGA
TACATCCCCA CGGGCGCGAC GAATGACTTT CTTGGAACGA ACCCGGCCGA CTGCATCATC
AACCTGATCT ATGCGCTGGG CGCCGATTAC CGCGCCAACG CGACCTTCGT GATGAACTCG
AAGACCGCGG GCGCGGTGCG GAAGATGAAG GACTCGGACG GCCGCTTCCT GTGGTCGGAT
GGTCTGGCCG CGGCCGAGCC CGCGCGGCTG ATGGGCTATC CGGTGCTGCT GTGCGAGGAC
ATGCCGGACA TCGCCGCCAA TGCCTTCGCC ATCGCCTTCG GGGATTTTGC AGCGGGTTAC
ACGATCGCCG AGCGGCCGGA GGTTCGGGTG CTGCGCGATC CGTTCTCGGC CAAGCCGCAC
GTCCTGTTCT ACGCCACGAA GCGGGTCGGC GGTGACGTGA CCGATTATGC GGCGATCAAG
CTGCTGAAGA TCGCGGTGTC CTGA
 
Protein sequence
MSAGPDPAGE AKAAMAGFLR EIKLFQDEVK TVLQQQEERL TMLDRKTMIY GRPALSAAAD 
QEAPHRKAFG AYLRSGDDDG LRGLVLEGKA MTTAVAADGG YLVDSQTSDT IRSMLLSTAS
IRQIAGVVNV EATSFEVLID RTEVGSGWAT EASAISESAT PAIERISIKL HELSAMPKAS
QRLLDDAAFD VEGWLAGKIA TRFMRAESAA FVSGDGADKP RGFLAPAKVP NASWTWGNIG
YIPTGATNDF LGTNPADCII NLIYALGADY RANATFVMNS KTAGAVRKMK DSDGRFLWSD
GLAAAEPARL MGYPVLLCED MPDIAANAFA IAFGDFAAGY TIAERPEVRV LRDPFSAKPH
VLFYATKRVG GDVTDYAAIK LLKIAVS