Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1970 |
Symbol | |
ID | 4022452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2207879 |
End bp | 2209174 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637962163 |
Product | phage major capsid protein, HK97 |
Protein accession | YP_569106 |
Protein GI | 91976447 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.141118 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.540615 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCGT TCTTCATCAT CACCACAAGG GATCACATGA TGACCACCAC CTTCGACCAC GCTCCCGAGA CCAAGGCCGG AATCGCCGGC GACGATGCGC AGCAGGTGTA CGACGCGCTG ATGCGGACGT TCGAGGACTA CAAGGCGGAG AACGATTCCC GGCTGCAGGC GATCGAGAAG CGCGGCGGCG ATGTCATCGC CGAGGACAAG GTGGCGCGGA TCGACGCCGC GCTCAATGCG CAGCAGCGCC GGCTCGACGA ACTGGCGCTG AAGCAGGCGC GGCCGCAGCT CGGCGCCGAC AGCGCCCTGC GCCCCCGCGG CGCGGCCGAG CACAAGAGCG CGTTCGACGC CTATATCCGC AACGGCGACG CCGCGACGCT GCGGCAGATC GAGACCAAGG CGCTGTCGGT CGGCTCCAAT CCGGACGGCG GCTATCTGGT GCCGGAAGAG TTGGAGCGCA GCATCGCGGC GCGGCTCAGC GCGATCTCAC CGATCCGCGG CCTGGCTTCG GTGCGGCAGA TCTCCGGCAG CGTCTACAAG AAGCCGTTCA TGACCGCGGG TCCTGCGACC GGCTGGGTCG GCGAGGCCGC GGCGCGGCCG CAGACCAGTT CGCCGACGCT GGACGCGCTG TCGTTCCCGG CGATGGAGCT GTATGCGATG CCGGCCGCGA CCGCGACGCT GCTCGACGAT GCCGCGGTCA ATCTCGACGA CTGGCTCACC GGCGAGATCG ACACCGTGTT CGCCGAGCAG GAGGGCGCCG CCTTCGTCTC CGGCGACGGG ATCAACAAAC CGAAAGGCTT TCTCGCCGCG CCGACGGTGG CGAACGCCGC CTGGAGCTGG GGCAATCTCG GTTTCGTCGC CACCGGCGCC GCCGGCGCAT TCCCCGCCAG CAATCCGTCC GACGTGCTGA TCGACCTGAT GTTCGCGCTG AAGCCGGGCT ACCGGCAGAA CGCCAGCTTC GTGATGAACC GGCGGACGCA AGCCGCGATC CGCAAGTTCA AGGACAATAA CGGCGTCTAT CTGTGGCAGC CGCCGGCCAC CGCCTCGGGC CGCGCCAGCC TGATCGGCTT CCCGCTGGCC GACGCCGAGG ACATGCCGGA CATCGCCGCC AATTCGCTCG CCATCGCCTT CGGCGATTTC CGCCGCGGCT ATTTGATCGT CGACCGCCAG GGCGTCCGCG TCCTGCGCGA CCCGTATTCC GCCAAGCCCT ACGTGCTGTT CTACACCACC AAGCGGGTCG GCGGCGGGGT GCAGGATTTT GATGCGATCA AGCTGTTGAA GTTTGGGGGG AGTTGA
|
Protein sequence | MNPFFIITTR DHMMTTTFDH APETKAGIAG DDAQQVYDAL MRTFEDYKAE NDSRLQAIEK RGGDVIAEDK VARIDAALNA QQRRLDELAL KQARPQLGAD SALRPRGAAE HKSAFDAYIR NGDAATLRQI ETKALSVGSN PDGGYLVPEE LERSIAARLS AISPIRGLAS VRQISGSVYK KPFMTAGPAT GWVGEAAARP QTSSPTLDAL SFPAMELYAM PAATATLLDD AAVNLDDWLT GEIDTVFAEQ EGAAFVSGDG INKPKGFLAA PTVANAAWSW GNLGFVATGA AGAFPASNPS DVLIDLMFAL KPGYRQNASF VMNRRTQAAI RKFKDNNGVY LWQPPATASG RASLIGFPLA DAEDMPDIAA NSLAIAFGDF RRGYLIVDRQ GVRVLRDPYS AKPYVLFYTT KRVGGGVQDF DAIKLLKFGG S
|
| |