Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_1753 |
Symbol | |
ID | 5083659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 1789457 |
End bp | 1790764 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640483313 |
Product | phage major capsid protein, HK97 |
Protein accession | YP_001167951 |
Protein GI | 146277792 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.357661 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCGGA TTACGGCCCT GCGAGCCCGC CGCGCGGGCA TCCTCGACCA GATGGAAGCG CTGATCGCCT CGGTCCCCGA GGGTGACGAC ATGACCTCCG ATCAGGCGGC GCAGTTCGAC GCGCTGAAGG CGGACGACGA CAAGGTGGCC GCGGAGCTCA CCCGCGCGGA AGACCTGGAG CGCCGCCGCG CCGCCGCTGC GCGACCGGCG GCCGCTCTTC CCGGCGTGAC GCCGCCGGCT GCATCGGTCC CGGCGCAGCC GGCCGAGAAG GGCATCACCT TTGCGCGCAT GGTCCGCACG ATCGCGGCCG CCGGTGGCAA CGCATATGTT GCCGGACAGA TCGCCGAGGC GAACGGGGAC AGCGGCCTCT TCGCAAACCA GAACATGGGG TCTGGCGCGG CTGGTGGCTT CCTCGTGCCC GAGGATGTGT CGGCCGAGGT CATCGAGCTC CTTCGCCCTG CGAGCGTCGT CACGGCGATG GGGCCGCGCA TCGTGCCGAT GCCGAATGGC AACCTTACCA CCAACCGGCG TGCGACCGGA GCGACGTTCG GTTACGGCGG CGAGCAGACC GACGCTCCCG CCACCGGCTA CAACTATGGG CAGGTGAAGC TCTCGGCGAA GAAGCTGCGT GGCATCATCC CCGTTTCGAA CGACCTGCTG CGCTCCGCCT CCGTGTCGGT TGATCGGATG ATCCGTGATG ATGCGGTCGC CGACGCTGCT CTGATCCAGG ACCGCTACTT CCTGCGCGGC GCCGGGACGG AGTTCGCGCC CCGCGGCCTT CGCTATCAGC ATGTCGGGAC TCCGTTCGAA GAGACCCATG TCTTGGCCAT GACCGCTTCG CCGACGCTGC AGAAGGTGAC CAACGACTTG GCGCGTCTCG AACTGGCGCT TGCGAACGCG AACGTGGTGC AGACCAACGC CCACTGGATC ATGTCGCCGC GCACCGAGAG CTATCTGGCG AACCTGCGCG ACGGGAACGG CAACCTTGCC TTCCCGGAAA TGCAGAATGG CGTGCTGCGC CGGAAGCCCG TTCACGTCAC CACCGAGATC CCGGACAACC TCGGCGTCGG CGGAAACGAA AGCGAGCTCA TGCTGGCCGA TCCCACGCAC ATCATGGTGG GTGAGCACAT GGGTATCGAA ATCGCGATGT CCACCGAGGC CGCTTACAAG GATGCGTCCG GCACGATGCA GGCGGCGTTC TCGCGCGATG AGACGCTGAT GCGGATGATC ATGCAGCACG ACATCGGGCT GCGGCATCTC GCTGCCGTGG CCATTCTGAC CGGCGTCACC TGGTCGCCGG CCCCCTGA
|
Protein sequence | MDRITALRAR RAGILDQMEA LIASVPEGDD MTSDQAAQFD ALKADDDKVA AELTRAEDLE RRRAAAARPA AALPGVTPPA ASVPAQPAEK GITFARMVRT IAAAGGNAYV AGQIAEANGD SGLFANQNMG SGAAGGFLVP EDVSAEVIEL LRPASVVTAM GPRIVPMPNG NLTTNRRATG ATFGYGGEQT DAPATGYNYG QVKLSAKKLR GIIPVSNDLL RSASVSVDRM IRDDAVADAA LIQDRYFLRG AGTEFAPRGL RYQHVGTPFE ETHVLAMTAS PTLQKVTNDL ARLELALANA NVVQTNAHWI MSPRTESYLA NLRDGNGNLA FPEMQNGVLR RKPVHVTTEI PDNLGVGGNE SELMLADPTH IMVGEHMGIE IAMSTEAAYK DASGTMQAAF SRDETLMRMI MQHDIGLRHL AAVAILTGVT WSPAP
|
| |