Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_1078 |
Symbol | |
ID | 5084681 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 1105610 |
End bp | 1106773 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640482636 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001167284 |
Protein GI | 146277125 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.506947 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCAG GCCCCGACCC GGCTGGAGAG GCGAAAGCCG CAATGGCCGG CTTCCTGAGG GAGATCAAGC TCTTTCAGGA TGAGGTGAAG ACCGTGTTGC AACAACAGGA AGAGCGTTTG ACCATGCTGG ACCGCAAAAC CATGATCTAC GGGCGCCCGG CGCTCTCGGC CGCTGCCGAC CAGGAGGCCC CGCATCGCAA GGCCTTTGGG GCCTATCTCC GCTCGGGCGA CGACGACGGG CTGCGCGGGC TTGTCCTGGA GGGCAAGGCG ATGACGACTG CGGTGGCGGC GGACGGCGGC TATCTGGTGG ATTCCCAGAC GTCGGACACG ATCCGCTCGA TGCTGCTCTC GACGGCCTCG ATCCGGCAGA TCGCCGGTGT GGTCAATGTG GAGGCCACGA GCTTCGAGGT TCTCATCGAC CGTACCGAAG TGGGCTCGGG CTGGGCAACC GAGGCGAGCG CCATCAGCGA GAGTGCGACG CCCGCGATCG AACGCATCTC GATCAAGCTC CACGAGCTGT CGGCCATGCC GAAGGCGAGC CAGCGGCTTC TGGACGACGC GGCCTTCGAC GTGGAGGGGT GGCTTGCGGG CAAGATTGCC ACACGCTTCA TGCGGGCCGA GAGCGCGGCC TTCGTGAGCG GCGACGGCGC TGACAAGCCG CGCGGGTTCC TCGCGCCTGC GAAGGTGCCG AACGCGTCCT GGACCTGGGG CAACATCGGA TACATCCCCA CGGGCGCGAC GAATGACTTT CTTGGAACGA ACCCGGCCGA CTGCATCATC AACCTGATCT ATGCGCTGGG CGCCGATTAC CGCGCCAACG CGACCTTCGT GATGAACTCG AAGACCGCGG GCGCGGTGCG GAAGATGAAG GACTCGGACG GCCGCTTCCT GTGGTCGGAT GGTCTGGCCG CGGCCGAGCC CGCGCGGCTG ATGGGCTATC CGGTGCTGCT GTGCGAGGAC ATGCCGGACA TCGCCGCCAA TGCCTTCGCC ATCGCCTTCG GGGATTTTGC AGCGGGTTAC ACGATCGCCG AGCGGCCGGA GGTTCGGGTG CTGCGCGATC CGTTCTCGGC CAAGCCGCAC GTCCTGTTCT ACGCCACGAA GCGGGTCGGC GGTGACGTGA CCGATTATGC GGCGATCAAG CTGCTGAAGA TCGCGGTGTC CTGA
|
Protein sequence | MSAGPDPAGE AKAAMAGFLR EIKLFQDEVK TVLQQQEERL TMLDRKTMIY GRPALSAAAD QEAPHRKAFG AYLRSGDDDG LRGLVLEGKA MTTAVAADGG YLVDSQTSDT IRSMLLSTAS IRQIAGVVNV EATSFEVLID RTEVGSGWAT EASAISESAT PAIERISIKL HELSAMPKAS QRLLDDAAFD VEGWLAGKIA TRFMRAESAA FVSGDGADKP RGFLAPAKVP NASWTWGNIG YIPTGATNDF LGTNPADCII NLIYALGADY RANATFVMNS KTAGAVRKMK DSDGRFLWSD GLAAAEPARL MGYPVLLCED MPDIAANAFA IAFGDFAAGY TIAERPEVRV LRDPFSAKPH VLFYATKRVG GDVTDYAAIK LLKIAVS
|
| |