Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3980 |
Symbol | |
ID | 6411662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4271235 |
End bp | 4272383 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 642713862 |
Product | phage major capsid protein, HK97 family |
Protein accession | YP_001992951 |
Protein GI | 192292346 |
COG category | [R] General function prediction only |
COG ID | [COG4653] Predicted phage phi-C31 gp36 major capsid-like protein |
TIGRFAM ID | [TIGR01554] phage major capsid protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCCC ACTACGAACT GGAACTGAAG GAAGCCGACA CCGACCCGGC CGAACTGGTC ACCAAAGCGC TCGCCGACTT CAAGGATGCC GTCGACGGCC GCTTGACCGC GATCGAAACC AAGGCCGCGA ACGACAACAA GCTGGCCGAT CGGCTTGACC GTATCGAAGC CAAAATGAAC CGGCCGGGCA TCGGTGCCGC GAACGACAAC GACGACCTCG AGCGTAAGGC GTTCATTTCG TTCGCGCGTC GTGGTGTCGA ACGCATGGGC GCCGACGAAC AGAAGGCGTT GACCGTCTCG ACCGATGCGT CCGGCGGCTA CCTCGCCCCG GAGCAGTTCG GCAACGAGCT GATCAAGTTG TTGCGGCAAT ATTCGCCTGT CCGACAGTAT GCCAACGTGG TCAGCATCGG CGCTGCCGAG ATCAAGTATC CCCGCCGCAC CGGCAGCACG GTGGCATCCT GGGTGGATGA AACCGAGGAT CGCAGCGAAA GCGAGCCGAG CTTCGAACAG ATCACGATCG CGCCCTTTGA GCTGGCGACC CATTCGGACG TGTCGACGCA GCTTCTGGAA GACAACGCCT ACAATCTCGA AGGCGAGCTC GCAGCGGACT TCGCCGAGAC TTTCGGCATC AAGGAAGCCG CGGCGTTCGT GAAGGGCTCC GGCGTCAAGC AGCCGACCGG CATCATGACC GCGGCGGGCA TCACCGAAGT CAAGACCGGC GCCGCCGCGA CGTTCCCGAC CTCCAACCCG GCCGACGTGC TGATCGGCAT GTATCACGCG CTGCCCGGCG TTCATGCCCA GAATGGCGTG TGGATGATGA ACCGCACCAC GCTCGGCACC ATCCGCCAGT GGAAGGACGG CAACGGCCGC TATCTGGTCC TCGATCCGAT CTCGGCTGGC GCACCGGTCA CCCTGCTTGG TCGCCCGATC GTCGAAGCGA TCGACATGGA CGACATCGGC GCCAACAAAT ATCCGGTGCT GTTCGGCGAT CTGAAGGGCT ATCGCATCGT CGATCGCGTC GGCCTTTCGG TGCTTCGCGA CCCGTATTCG CTCGCGACCA AGGGTCAGGT TCGGTTCCAC GCCCGGACGC GGGTCGGCGC CGGCCTCACT CACCCCGACC GCTTCATCAA GCTCAAGGTG GCGGCGTAA
|
Protein sequence | MKPHYELELK EADTDPAELV TKALADFKDA VDGRLTAIET KAANDNKLAD RLDRIEAKMN RPGIGAANDN DDLERKAFIS FARRGVERMG ADEQKALTVS TDASGGYLAP EQFGNELIKL LRQYSPVRQY ANVVSIGAAE IKYPRRTGST VASWVDETED RSESEPSFEQ ITIAPFELAT HSDVSTQLLE DNAYNLEGEL AADFAETFGI KEAAAFVKGS GVKQPTGIMT AAGITEVKTG AAATFPTSNP ADVLIGMYHA LPGVHAQNGV WMMNRTTLGT IRQWKDGNGR YLVLDPISAG APVTLLGRPI VEAIDMDDIG ANKYPVLFGD LKGYRIVDRV GLSVLRDPYS LATKGQVRFH ARTRVGAGLT HPDRFIKLKV AA
|
| |