Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_2177 |
Symbol | |
ID | 5713830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 2303499 |
End bp | 2304689 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641268099 |
Product | putative phage portal protein |
Protein accession | YP_001533514 |
Protein GI | 159044720 |
COG category | [S] Function unknown |
COG ID | [COG4695] Phage-related protein |
TIGRFAM ID | [TIGR01537] phage portal protein, HK97 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGTTCA ATCTGTTCCG ACAAAAACAA GACATACCGG CGACGGATCG CGTGCCCGAG GTCAAGGCGT CGGCCACGAG CCGGGTTGTG GCCATGGGCA GCTCCGGGCG GATTGCCTGG ACACCGCGGG ATTCGGGGTC GCTGACGCGC AACGGGTTCG CGGGCAATCC GGTGGGGTTT CGGGCGGTCA AGATGATTGC GGAGGCCGCC GCGGCGCTGC CGCTGGCGTT CCAGGATGCG GAGCGGCGCT ACGAGGCGCA TCCGCTGATC ACGCTGCTGG CGCGGCCCAG CCAGGCGCAG GGGCGGGCGG AGTTCTTCGA GGCGCTCTAT GCGCAGCTCC TGCTGACCGG CGACGGTTTC GTGGAGGCGG TGTTCGCCAA GCCCGAGCTG CCCACGGAGT TGCATGTGCT GCGCTCGGAC CGGGTGCGGA TCATTCCCGG CGCGGATGGC TGGCCGAGCG CCTATGAGTA CTCGGTCGGG GCGCACAAGC ACCGGTTCAT GGTCGAGGAG GGGCGCACGC CGATCTGTCA TCTGCGCACG TTCCATCCGC AGGACGACCA TTACGGGCTG TCCCCGATGC AAGCGGCGGC GACGGCGCTG GATGTCCATA ACGCGGCGAC CCGGTGGTCA AAGGCGCTGT TGGACAATGC GGCGCGGCCG TCGGGCGCGT TGGTCTACAA GGGGTCGGAG GGCGACGATA CGCTCTCGCC CGAGCAGTAC ACGCGGCTGG TCGACGAGAT GGACAGCTAC CACCAGGGTG CGCGCAATGC GGGGCGGCCC ATGTTGCTGG AAGGCGGGCT CGACTGGAAA CCCATGGGGT TCAGCCCCTC GGACATGGAG TTCCAGAAGA CCAAGGAGGC TGCGGCGCGC GAGATCGCGC TGGCCTTCGG GGTTCCGCCG ATGCTGCTGG GGATTCCCGG GGACGCGACC TATGCCAACT ACCAGGAGGC GCACCGGGCG TTCTATCGCC TGACGGTGCT GCCCTTGGCG CAGAAGGTGA CCGCGTCGCT GGGTCATTGG CTGACGGATC TGTCGGGGGA CGCCGTGAAT GTCGCGCCCG ATCTGGACAA GATCCCGGCC CTTGCCGCTG AGCGCGACGC GCAATGGGCG CGGATCGGGA CGGCGAGTTT TCTCACCGAT GCCGAGAAGC GGGTTCTGCT CGGGCTGCCG GCGGAAATGG ATTGCTCATG A
|
Protein sequence | MVFNLFRQKQ DIPATDRVPE VKASATSRVV AMGSSGRIAW TPRDSGSLTR NGFAGNPVGF RAVKMIAEAA AALPLAFQDA ERRYEAHPLI TLLARPSQAQ GRAEFFEALY AQLLLTGDGF VEAVFAKPEL PTELHVLRSD RVRIIPGADG WPSAYEYSVG AHKHRFMVEE GRTPICHLRT FHPQDDHYGL SPMQAAATAL DVHNAATRWS KALLDNAARP SGALVYKGSE GDDTLSPEQY TRLVDEMDSY HQGARNAGRP MLLEGGLDWK PMGFSPSDME FQKTKEAAAR EIALAFGVPP MLLGIPGDAT YANYQEAHRA FYRLTVLPLA QKVTASLGHW LTDLSGDAVN VAPDLDKIPA LAAERDAQWA RIGTASFLTD AEKRVLLGLP AEMDCS
|
| |