Gene Dshi_0856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0856 
Symbol 
ID5710546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp868179 
End bp870593 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content69% 
IMG OID641266766 
Productprotein of unknown function DUF404 
Protein accessionYP_001532202 
Protein GI159043408 
COG category[S] Function unknown 
COG ID[COG2308] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.319945 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.446218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGATC CCCTGACTGC GCCCGCGGGT GATCCCGGCC CCTTCGGGGG CCTCTTGGGC 
AGCTATGTCC CGCAACCGGG CGTGGCCGAT GAGCTCTTCG ATGCGACGGG CGCCATCCGC
CCTGTCTGGC GCCCGTTCCT GGAACACCTG GGCCGGATCG ATGCCGAAGG GGTGGCCATG
CGGTTCGACC GGGGCACGCA GTATCTGCGC GATGCCGGGG TCTATTATCG CAAATACGGC
GCGGAGGCTG CGAGCGAGCG CGACTGGCCC CTCAGCCCGA TCCCCGTCCT GCTGTCGCAG
GAGGAATGGA CCGAGATCGC CGCGGGGCTG ATCGCGCGTG CGGACCTGCT GGAACAGGTG
GTCGCCGATC TTTATGGCGA GAACCGGCTG CTGGCCGAGG GACACCTGCC TGCGGAGCTA
ATCGCCCAGA ACCCCGCCTG GCTGCGCCCC ATGGTGGGGG TCAAACCGGC CTCGGGCCAT
TTTTTGAACT TCATCGCCTT CGAGGTCGGG CGCGCGCCCT CGGGCCAGTG GTGGGTGCTG
GGCGACCGGA CGGACGCCCC CTCCGGCGCG GGTTTCGCCC TGGAAAACCG CATCGCGACC
AGCCGGACCT TCCCGAACTT CTATGCCCGC GCCAATGTCC ACAGGCTCGC CGGGTTCTTC
CAGGCGTTCC GCGACACGCT CGTGGGCCAG CGGCAGGACC GCGACGAACC TCTGGCCCTG
CTCACGCCGG GGCAGATGAA CGATGTCTAT TTCGAGCACG CTTACCTTGC GCGCTATCTC
GGGATGCTGC TGGTGGAAGG CGAAGACCTG ACCGTGGACC AGGGGCAGCT GAAGGTGCGC
TCGGTCAACG GGTTGCGCCC GATCTCGGTG CTGTGGCGTC GGCTGGATGC GGAGTTCTGC
GATCCGCTGG AGCTGAACGA GGCCTCGACA CTGGGCACGG CGGGCTTTGT CGACGTGGTG
CGCCGGGGCG GCGTGTCCTG CGTGAACGCG CTTGGGGCCG GGGTGCTGGA AACCCGCGCG
ATGCTGGCCT TTCTGCCGAA GATATCCCGC GCGCTGACCG GGGCACCCCT GGCGCTGCCC
AACATCGCGA CCTGGTGGTG CGGCCAGGCG GCCGAACGCG CCCATGTGCT GGCCCACCGC
GACAAGATGA TGATCAGCTC GGCCTACGCC ACCCGCCTGC CCTATGACGA CCCCGGCGGG
ACCAGCGCGC CGGAGGATTA CAGCGCCGAG GGCAGCGCCC GGGTGGCGAA GCTGCTGGAG
GATCGGGGCG GCGACCTTGT GGGCCAGGAG ATCGTGACGC TCTCCACGAC CCCTGTCTGG
AAGGATGGCA AACTGGCCCC GCGCCCGATG TCCCTGCGCA TCTTCCTCGC GCGCACCGCG
ACGGGTTGGC GGGTGATGCC GGGCGGCTAT GCCCGGGTCG CCTCACGCAA CAACGCATCG
GCCATCGCCA TGCAGGCGGG CGGGTCGGTG TCGGATGTCT GGGTGGTCAG CGACAGTCCG
GTGCCCAGAC CCAGCCTCAT GCCCGCCCCG AGCGGCGATC CGGGGGGGCT GAATTCCGCC
TACAGCCTGC CCAGCCGGGC GGCGGACAAC CTCTTCTGGC TTGGGCGCTA TATCGAGCGA
GCCGAAGGGG CGATGCGGGC CTACCGCGCC TATTACGGAT TGATCAGCGC CGGGGTCGAA
CCCGACGCGG ACCTGCCCGA TTTCATCCTG CGCGCGTTTC TGAACAGCAC GGGCCGCGAC
CCGGTGGCCT TGTCCGAAGG GTTCCAGTCC GCGCTGGAAG CCGCGGTCGA CTGTGCTGCG
CGCATCCGCG ACCAGGTCTC GGTGGACGGG ATGCTGGCAC TGAAGGACCT GCTGAAGGCG
AGCCGGAAGC TGCGCGCAAA CCCCATCGCG GTCGAAGAGG CGGCGGCGGC GGTGGGCATC
CTGCTACGCA AGCTGACCGG GTTTTCCGGG CTGGTGCACG AGAACATGTA CCGCTCGGCG
GGCTGGCGGT TCATGAGTGC TGGCATGTCA GTGGAGCGGG CCAGCACCAT GTGCGCCATC
CTCGCCCGGG TGCTGGACCC CGATGCGCCG GACGGAGCGC TGGATCTTGC GCTGGAGCTG
GGCGACAGCA CCATGGCCCA CCGGGCGCGG TTCCTCAGGT CGGTGAGCCC CGACAGTCTG
CGCCAGATCC TTGCCCTCGA CCCCGACAAT CCCCGCGCGG TCGGCTATCA CCTGGGGCGG
CTCAAGAGCC ATATCGACGC CCTGCCCCAG ACCCGGCGCA GCCCGGGCCT GTCGCCGGTG
GCCCGCGCGG CCTTGCAGGT GCATACGGAC CTCGCCGTGC AGGTGCCCGA CACGCTCACC
CCGGACCGGC TGCGCACCCT GCAGCGGCAG ATCTGGACCC TTTCGGACCT GCTCGCGGCC
ACGTACCTGA CATGA
 
Protein sequence
MPDPLTAPAG DPGPFGGLLG SYVPQPGVAD ELFDATGAIR PVWRPFLEHL GRIDAEGVAM 
RFDRGTQYLR DAGVYYRKYG AEAASERDWP LSPIPVLLSQ EEWTEIAAGL IARADLLEQV
VADLYGENRL LAEGHLPAEL IAQNPAWLRP MVGVKPASGH FLNFIAFEVG RAPSGQWWVL
GDRTDAPSGA GFALENRIAT SRTFPNFYAR ANVHRLAGFF QAFRDTLVGQ RQDRDEPLAL
LTPGQMNDVY FEHAYLARYL GMLLVEGEDL TVDQGQLKVR SVNGLRPISV LWRRLDAEFC
DPLELNEAST LGTAGFVDVV RRGGVSCVNA LGAGVLETRA MLAFLPKISR ALTGAPLALP
NIATWWCGQA AERAHVLAHR DKMMISSAYA TRLPYDDPGG TSAPEDYSAE GSARVAKLLE
DRGGDLVGQE IVTLSTTPVW KDGKLAPRPM SLRIFLARTA TGWRVMPGGY ARVASRNNAS
AIAMQAGGSV SDVWVVSDSP VPRPSLMPAP SGDPGGLNSA YSLPSRAADN LFWLGRYIER
AEGAMRAYRA YYGLISAGVE PDADLPDFIL RAFLNSTGRD PVALSEGFQS ALEAAVDCAA
RIRDQVSVDG MLALKDLLKA SRKLRANPIA VEEAAAAVGI LLRKLTGFSG LVHENMYRSA
GWRFMSAGMS VERASTMCAI LARVLDPDAP DGALDLALEL GDSTMAHRAR FLRSVSPDSL
RQILALDPDN PRAVGYHLGR LKSHIDALPQ TRRSPGLSPV ARAALQVHTD LAVQVPDTLT
PDRLRTLQRQ IWTLSDLLAA TYLT