Gene Dshi_3654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3654 
Symbol 
ID5714184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009955 
Strand
Start bp53399 
End bp55099 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content53% 
IMG OID641276572 
Producthypothetical protein 
Protein accessionYP_001541868 
Protein GI159046196 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.773483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000658966 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCAGATT ACAGTCAACA AGGTGATGAC GCCGAATATA CACTAAGTGC TGAATTTTGG 
GATAAGGGGC AAGCCCTTGG CTTTCCGATC CAGTTTGATC TCGTGCTCAA GCACCTACGA
CAGGACATGA GGGACGACTG GTACTTCGAC TGCCTGCAAT ATGACGATAT ATTTAAGAAT
CCGGACGAGG CCAAACGCAT CGTCCTTTCT CTACTGCAGG AATGGAACGG TGTGTACCTC
GGCACCCGCA GCGTTGTCCG TAACATTCCC AAGAAAGGCT ATGGGGAACG CTACGGTCTG
GAAACCGATT TCTTTGATAG GTTTGTCTAT CAGGCGATCT GCACCTTCCT TATTCCTTAC
TATGACAGTC TGCTCAGCCA CCGAGTGCTA AGCTACCGCC ACGACTCTGC GCCGCAGAAC
TCCAAATACT TGTTCAAGAA CAAGATCGAT CGATGGTTCA CATTTGAGGG GATAACTCTT
ACGTTCGCCC GATCGAACCA GCATCTTCTG GTGACCGATC TCAGCAATTT TTTCGAGAAC
ATCTCGCGAG AGCAGATCAT CGCAGCCTTG GAGAAGGCTA TACCAGAGAT CGTGGCGACG
GGCCCTGAAA AGCTCCAAAT TCGCAATGCT ATCCGTACGT TGGACAGGCT GCTCGAACAA
TGGACGTTCA GCCGCGACCA TGGTCTACCA CAGAACCGGG ACGCATCTTC GTTTCTTTCC
AACATCCTGC TCTCTTCCGT TGATCGCGAA ATGGCCAAGA AGGGTTACGA TTACTACAGA
TACGTGGACG ACATCCGCGT TTTGGCCGAC ACGGAGATCC ACGCGCGCCG CGCACTCCAG
GATATTATCA GGGAGCTACG GAAGGTCGGC CTGAACATCA ACGCGAGTAA GACCGAGATA
TTGCCGCCTA ACGCTTCACT AGAGAAGTTG GTCGCCCACT TCCCTTCGCA AGACAGCGCG
ACCACCGCCA TTAATCAGAT GTGGCAATCT CGGAGCCGCA GAATTGTGAC CCGTTCTGTT
GAATACATCT TTGGTATCCT GACCAGCTGC ATTGAAGCTG GTGACACTCA AACGCGACAG
TTCCGCTTCG CGGTAAACCG CGTGGCCCAG ATTGTGGAAT CTGGCCTCTT CGACGTCGGC
GACGCACTAT CAGCTTCACT GCTCGATACG CTTTCCCGCT CGCTTTCGGA GCATGCTGTA
TCCACAGATC AATACTGCCG GCTTATTTCA ACACTAGACC GAGATGGCCA ATGCCTTCCG
GCGCTGGAGG GGTTTCTGTT GGCGGAGGAC CGAGCCATCC ACGATTGGCA GAACTACAAC
ATTTGGATGC TGCTTGGGTC GAAGAGACAC CGATCGGATA GGTTGGTTGA TCTCGCCGCT
AGGAAGCTCC ACGAGGACAT CAGGTCCGGC GAGGCTGCTG CCATCTTAAT CTGGCTGCGG
TGTGTCGGTG AGACGGCACT TATCCGCGGA TGCATCGAAA AGTTTAGCGA ACTGCCTTAT
CAGAACGCCC GCTATCTCTT GATTGCCTCC TCGGTGCTTC ACAAAGATGA CCTCAAGCCG
CTCTATGGCC TTGTGCCTAT CTGCCTCAAG GGCACAGGGC CAAGAGCCGA GCGTTACACC
AGCGAGGAAG GTCTTCCCTT TGCAAAACGG GAAGCGCCCG ATCTGCTAAA CCTTGTCGAC
GAGGTCAGTG AGTATGATTG A
 
Protein sequence
MSDYSQQGDD AEYTLSAEFW DKGQALGFPI QFDLVLKHLR QDMRDDWYFD CLQYDDIFKN 
PDEAKRIVLS LLQEWNGVYL GTRSVVRNIP KKGYGERYGL ETDFFDRFVY QAICTFLIPY
YDSLLSHRVL SYRHDSAPQN SKYLFKNKID RWFTFEGITL TFARSNQHLL VTDLSNFFEN
ISREQIIAAL EKAIPEIVAT GPEKLQIRNA IRTLDRLLEQ WTFSRDHGLP QNRDASSFLS
NILLSSVDRE MAKKGYDYYR YVDDIRVLAD TEIHARRALQ DIIRELRKVG LNINASKTEI
LPPNASLEKL VAHFPSQDSA TTAINQMWQS RSRRIVTRSV EYIFGILTSC IEAGDTQTRQ
FRFAVNRVAQ IVESGLFDVG DALSASLLDT LSRSLSEHAV STDQYCRLIS TLDRDGQCLP
ALEGFLLAED RAIHDWQNYN IWMLLGSKRH RSDRLVDLAA RKLHEDIRSG EAAAILIWLR
CVGETALIRG CIEKFSELPY QNARYLLIAS SVLHKDDLKP LYGLVPICLK GTGPRAERYT
SEEGLPFAKR EAPDLLNLVD EVSEYD