Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1805 |
Symbol | |
ID | 3972070 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1962048 |
End bp | 1963298 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637924918 |
Product | phage major capsid protein, HK97 |
Protein accession | YP_531683 |
Protein GI | 90423313 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.539863 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.129536 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATCT ACGCCGACGA TCACGCCCCG GAAACCAAGG CCGGCATCGC CGTCGACGCG CAGGACGAGC TGCGCGCCAC CTTCGAGGAT TTCAAATCCG CCAATGACGA GCGGCTGGCG GAGCTGGAGC GCCGCCGCGG CGACGTGCTG TTGGAGGAGA AAGTCGACCG CATCAACGCC GCGCTCGACG CCCAGCAACG CAAGCTCGAC GAGCTCGTGC TGCGCCAGGC CCGGCCGCAG CTCGAGGGCC GCGGCAAGAA CGCCGGCGAT GTCGCCGGCC GCGAGCACAA GAGCGCGTTC GAGGCCTACG TCCGTGGCGG CGACGCCGCG CCGCTGCGCG CGCTGGAAAG CAAGGCGATG TCGGTCGGCT CCAATCCGGA CGGCGGCTAT CTGGTGCCGG TGGAGCTGGA AACCGCGATC GGCGAGCGGC TGGCGGTGAT TTCGCCGGTG CGCGGGCTGT CCGCGGTGCG GACGATTTCC GGCAGCGTCT ACAAGAAGCC GTTCATGACC GCGGGCCCGG CCACCGGCTG GGTCGGCGAG ACCGACGCCC GCACCCAGAC CGCCTCGCCG ACGCTCGATG CGCTGTCGTT TCCGGCGATG GAGCTCTACG CCATGCCGGC GGCGACCGCG ACGCTGCTGG AGGACTCGGC CATCAATCTC GACGAGTGGC TGGCCTCCGA GATCGACCAG GTGTTCGCCG AGCAGGAGAG CACCGCCTTC GTCAACGGCG ACGGCATCAA CAAGCCGAAG GGTTTTCTGG CCTATCCGAC GGTGGTCAAC GCGACCTGGA GCTGGGGCAA CATCGGCAGC ATCCTGTCCG GCGCCGCCGG CGGCTTTGCG GCGCAAAATC CCTCCGACGT GCTGGTCGAC CTGATCTACG CGCTGAAGGC CGGCTATCGG CAGAATGCCA GCTTCGTGAT GAACCGCCGC ACCCAGGCCG CGATCCGCAA GTTCAAGGAC TCCACCGGGG TCTATCTGTG GCAGCCGCCG GCGCAGCCCG GCGGCCGCGC CAGCCTGATC GGCTTTCCGC TGGCCGACGC CGAGGACATG CCGGATATCG CGGCGAACTC GCTGTCGATC GCGTTCGGCG ATTTCCGCCG CGGCTACCTG ATCGTCGACC GCCAGGGCGT CCGCGTGCTG CGCGATCCGT ATTCCGCCAA GCCCTACGTG CTGTTCTATA CCACCAAGCG GGTCGGCGGC GGCGTGCAGG ACTTCGACGC CATCAAGCTG CTGAAGTTCG CGGCGGGGTG A
|
Protein sequence | MSIYADDHAP ETKAGIAVDA QDELRATFED FKSANDERLA ELERRRGDVL LEEKVDRINA ALDAQQRKLD ELVLRQARPQ LEGRGKNAGD VAGREHKSAF EAYVRGGDAA PLRALESKAM SVGSNPDGGY LVPVELETAI GERLAVISPV RGLSAVRTIS GSVYKKPFMT AGPATGWVGE TDARTQTASP TLDALSFPAM ELYAMPAATA TLLEDSAINL DEWLASEIDQ VFAEQESTAF VNGDGINKPK GFLAYPTVVN ATWSWGNIGS ILSGAAGGFA AQNPSDVLVD LIYALKAGYR QNASFVMNRR TQAAIRKFKD STGVYLWQPP AQPGGRASLI GFPLADAEDM PDIAANSLSI AFGDFRRGYL IVDRQGVRVL RDPYSAKPYV LFYTTKRVGG GVQDFDAIKL LKFAAG
|
| |