Gene Dshi_2177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2177 
Symbol 
ID5713830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2303499 
End bp2304689 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content67% 
IMG OID641268099 
Productputative phage portal protein 
Protein accessionYP_001533514 
Protein GI159044720 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGTTCA ATCTGTTCCG ACAAAAACAA GACATACCGG CGACGGATCG CGTGCCCGAG 
GTCAAGGCGT CGGCCACGAG CCGGGTTGTG GCCATGGGCA GCTCCGGGCG GATTGCCTGG
ACACCGCGGG ATTCGGGGTC GCTGACGCGC AACGGGTTCG CGGGCAATCC GGTGGGGTTT
CGGGCGGTCA AGATGATTGC GGAGGCCGCC GCGGCGCTGC CGCTGGCGTT CCAGGATGCG
GAGCGGCGCT ACGAGGCGCA TCCGCTGATC ACGCTGCTGG CGCGGCCCAG CCAGGCGCAG
GGGCGGGCGG AGTTCTTCGA GGCGCTCTAT GCGCAGCTCC TGCTGACCGG CGACGGTTTC
GTGGAGGCGG TGTTCGCCAA GCCCGAGCTG CCCACGGAGT TGCATGTGCT GCGCTCGGAC
CGGGTGCGGA TCATTCCCGG CGCGGATGGC TGGCCGAGCG CCTATGAGTA CTCGGTCGGG
GCGCACAAGC ACCGGTTCAT GGTCGAGGAG GGGCGCACGC CGATCTGTCA TCTGCGCACG
TTCCATCCGC AGGACGACCA TTACGGGCTG TCCCCGATGC AAGCGGCGGC GACGGCGCTG
GATGTCCATA ACGCGGCGAC CCGGTGGTCA AAGGCGCTGT TGGACAATGC GGCGCGGCCG
TCGGGCGCGT TGGTCTACAA GGGGTCGGAG GGCGACGATA CGCTCTCGCC CGAGCAGTAC
ACGCGGCTGG TCGACGAGAT GGACAGCTAC CACCAGGGTG CGCGCAATGC GGGGCGGCCC
ATGTTGCTGG AAGGCGGGCT CGACTGGAAA CCCATGGGGT TCAGCCCCTC GGACATGGAG
TTCCAGAAGA CCAAGGAGGC TGCGGCGCGC GAGATCGCGC TGGCCTTCGG GGTTCCGCCG
ATGCTGCTGG GGATTCCCGG GGACGCGACC TATGCCAACT ACCAGGAGGC GCACCGGGCG
TTCTATCGCC TGACGGTGCT GCCCTTGGCG CAGAAGGTGA CCGCGTCGCT GGGTCATTGG
CTGACGGATC TGTCGGGGGA CGCCGTGAAT GTCGCGCCCG ATCTGGACAA GATCCCGGCC
CTTGCCGCTG AGCGCGACGC GCAATGGGCG CGGATCGGGA CGGCGAGTTT TCTCACCGAT
GCCGAGAAGC GGGTTCTGCT CGGGCTGCCG GCGGAAATGG ATTGCTCATG A
 
Protein sequence
MVFNLFRQKQ DIPATDRVPE VKASATSRVV AMGSSGRIAW TPRDSGSLTR NGFAGNPVGF 
RAVKMIAEAA AALPLAFQDA ERRYEAHPLI TLLARPSQAQ GRAEFFEALY AQLLLTGDGF
VEAVFAKPEL PTELHVLRSD RVRIIPGADG WPSAYEYSVG AHKHRFMVEE GRTPICHLRT
FHPQDDHYGL SPMQAAATAL DVHNAATRWS KALLDNAARP SGALVYKGSE GDDTLSPEQY
TRLVDEMDSY HQGARNAGRP MLLEGGLDWK PMGFSPSDME FQKTKEAAAR EIALAFGVPP
MLLGIPGDAT YANYQEAHRA FYRLTVLPLA QKVTASLGHW LTDLSGDAVN VAPDLDKIPA
LAAERDAQWA RIGTASFLTD AEKRVLLGLP AEMDCS