Gene Rsph17025_1753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_1753 
Symbol 
ID5083659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp1789457 
End bp1790764 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content67% 
IMG OID640483313 
Productphage major capsid protein, HK97 
Protein accessionYP_001167951 
Protein GI146277792 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.357661 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCGGA TTACGGCCCT GCGAGCCCGC CGCGCGGGCA TCCTCGACCA GATGGAAGCG 
CTGATCGCCT CGGTCCCCGA GGGTGACGAC ATGACCTCCG ATCAGGCGGC GCAGTTCGAC
GCGCTGAAGG CGGACGACGA CAAGGTGGCC GCGGAGCTCA CCCGCGCGGA AGACCTGGAG
CGCCGCCGCG CCGCCGCTGC GCGACCGGCG GCCGCTCTTC CCGGCGTGAC GCCGCCGGCT
GCATCGGTCC CGGCGCAGCC GGCCGAGAAG GGCATCACCT TTGCGCGCAT GGTCCGCACG
ATCGCGGCCG CCGGTGGCAA CGCATATGTT GCCGGACAGA TCGCCGAGGC GAACGGGGAC
AGCGGCCTCT TCGCAAACCA GAACATGGGG TCTGGCGCGG CTGGTGGCTT CCTCGTGCCC
GAGGATGTGT CGGCCGAGGT CATCGAGCTC CTTCGCCCTG CGAGCGTCGT CACGGCGATG
GGGCCGCGCA TCGTGCCGAT GCCGAATGGC AACCTTACCA CCAACCGGCG TGCGACCGGA
GCGACGTTCG GTTACGGCGG CGAGCAGACC GACGCTCCCG CCACCGGCTA CAACTATGGG
CAGGTGAAGC TCTCGGCGAA GAAGCTGCGT GGCATCATCC CCGTTTCGAA CGACCTGCTG
CGCTCCGCCT CCGTGTCGGT TGATCGGATG ATCCGTGATG ATGCGGTCGC CGACGCTGCT
CTGATCCAGG ACCGCTACTT CCTGCGCGGC GCCGGGACGG AGTTCGCGCC CCGCGGCCTT
CGCTATCAGC ATGTCGGGAC TCCGTTCGAA GAGACCCATG TCTTGGCCAT GACCGCTTCG
CCGACGCTGC AGAAGGTGAC CAACGACTTG GCGCGTCTCG AACTGGCGCT TGCGAACGCG
AACGTGGTGC AGACCAACGC CCACTGGATC ATGTCGCCGC GCACCGAGAG CTATCTGGCG
AACCTGCGCG ACGGGAACGG CAACCTTGCC TTCCCGGAAA TGCAGAATGG CGTGCTGCGC
CGGAAGCCCG TTCACGTCAC CACCGAGATC CCGGACAACC TCGGCGTCGG CGGAAACGAA
AGCGAGCTCA TGCTGGCCGA TCCCACGCAC ATCATGGTGG GTGAGCACAT GGGTATCGAA
ATCGCGATGT CCACCGAGGC CGCTTACAAG GATGCGTCCG GCACGATGCA GGCGGCGTTC
TCGCGCGATG AGACGCTGAT GCGGATGATC ATGCAGCACG ACATCGGGCT GCGGCATCTC
GCTGCCGTGG CCATTCTGAC CGGCGTCACC TGGTCGCCGG CCCCCTGA
 
Protein sequence
MDRITALRAR RAGILDQMEA LIASVPEGDD MTSDQAAQFD ALKADDDKVA AELTRAEDLE 
RRRAAAARPA AALPGVTPPA ASVPAQPAEK GITFARMVRT IAAAGGNAYV AGQIAEANGD
SGLFANQNMG SGAAGGFLVP EDVSAEVIEL LRPASVVTAM GPRIVPMPNG NLTTNRRATG
ATFGYGGEQT DAPATGYNYG QVKLSAKKLR GIIPVSNDLL RSASVSVDRM IRDDAVADAA
LIQDRYFLRG AGTEFAPRGL RYQHVGTPFE ETHVLAMTAS PTLQKVTNDL ARLELALANA
NVVQTNAHWI MSPRTESYLA NLRDGNGNLA FPEMQNGVLR RKPVHVTTEI PDNLGVGGNE
SELMLADPTH IMVGEHMGIE IAMSTEAAYK DASGTMQAAF SRDETLMRMI MQHDIGLRHL
AAVAILTGVT WSPAP