Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_0277 |
Symbol | |
ID | 4896429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 300012 |
End bp | 301169 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640110860 |
Product | HK97 family phage major capsid protein |
Protein accession | YP_001042167 |
Protein GI | 126461053 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCACA TGACGAAGGA CCAGCTGCTC GGCAGCACCA AGCTCGACCG GAAGGGCGAG GAGGACGATC CGGCCGGGAT CGTCACGAAG GCGCTCGAGG ATCTGTCGAA GACGCTCAAT GAGCGGATCG GCGACCTCGA GAAGAAAGCC GACACGTCGC CGCTCGTCGC GCGGCTCGAC AAGCTCGAGG CGAAGGTGAA CCGCCCCGGC ACGACCGAGC CGAAGCCCGA GGCTGAGGTC GAGCGCAAGG CGTTCGGCGC CTATCTGCGC TCCGGCCCCG CGGCCCCGGC CGAGGAGCTG AAGGCGCTGA CAGTCTCCAG CGATCCGCAG GGCGGCTATC TGGCGCCCGC CGAAATGTCG ACCGAGTTCA TCCGCGACCT GGTCGAGTTC TCGCCCGTCC GCGGCGTGGC GGCGATCCGC GGCACGGCCG CGCCCTCGGT GATCTACCCG ACCCGTACCG GCATCACGAA TGCGAAGTGG AAGGGCGAGA CGCAGGCGCA GGAGGCCTCC GAGCCGGGCT TCGGTCAGGC CGAGGTCGTG GTGAAGGAGG TCAACACCTA CGTCGATATC TCGAACCAGC TCCTCGCGGA CAGCGCCGGG CAGGCCGAGG CCGAGGTTCG CCTCGCGCTC GCCGAGGACT TCGGCCAGAA GGAGGGCCTC GCCTTCGTGT CCGGTGACGG CGTGCTCGCG CCGGAAGGCT TCATGAACGC GGCCGGCATC TCCTACACCG CCAACGGCCA CGCGACCGAT CTCAAGGCCG ACGCGCTCAT CACCATGCTC TATGCGATCC CGGCGACCCA CCGGAACCGC GGCGCGTGGG CCATGAACGG CACCACGCTC GGCGTCCTGC GGAAGCTGAA GGACGGACAG GGCAACTTCC TGTGGCAGCC GTCCTATCAG GCGGGCCAGC CCGAGACGAT CCTCGGCCGC CCGGTGGTCG AGATGGTGGA CATGCCCGAC CTCGAATCCG GCTCGTTCCC CATCGCCTAT GCGGACTGGT CGGGCTACCG GATCGTGGAC CGCACGAGCC TGAGCATCCT GGTCAACCCC TACATCAAGG CGACCGAGGG CCTGACCCGC ATCCATGCGA CCCGCCGTGT CGGCGGCCGC GTCCTGCAGC CTGCGAAGTT CCGCAAGCTG AAGATGGCCA CCTCGTAA
|
Protein sequence | MRHMTKDQLL GSTKLDRKGE EDDPAGIVTK ALEDLSKTLN ERIGDLEKKA DTSPLVARLD KLEAKVNRPG TTEPKPEAEV ERKAFGAYLR SGPAAPAEEL KALTVSSDPQ GGYLAPAEMS TEFIRDLVEF SPVRGVAAIR GTAAPSVIYP TRTGITNAKW KGETQAQEAS EPGFGQAEVV VKEVNTYVDI SNQLLADSAG QAEAEVRLAL AEDFGQKEGL AFVSGDGVLA PEGFMNAAGI SYTANGHATD LKADALITML YAIPATHRNR GAWAMNGTTL GVLRKLKDGQ GNFLWQPSYQ AGQPETILGR PVVEMVDMPD LESGSFPIAY ADWSGYRIVD RTSLSILVNP YIKATEGLTR IHATRRVGGR VLQPAKFRKL KMATS
|
| |