Gene Dshi_0997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0997 
Symbol 
ID5710513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1027607 
End bp1029982 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content73% 
IMG OID641266908 
Producthypothetical protein 
Protein accessionYP_001532340 
Protein GI159043546 
COG category[S] Function unknown 
COG ID[COG3002] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0514673 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCATA CAGCAACCCT GATTTCCGTC GACACCCTGA CCGCCGCCCA GGCGGGTGCG 
GTCCAGGCAA TCCCCCCGGC CTTCCCGCTT TCGGCCACCG TGGCGGTCAA CCCGTTCCTC
GGGCAGGCGG GCCACTCCCT GCCGGACACC GCAGCAGTCC TGGGCAAGAC CGCCGGCTGC
GCCACCACCG CCCCGCGGAG CTGGTTCGCC GCCGAGATCG CGGCGGGCCG GATCACCAAA
CCGGCCGTGG CCGAAGCGCT CGCCGCCGCG GGGCTCGACT GGCCCGTGGA CAAGGTGATC
TCGGCGGCCA GCCGCCAGCG CCCGGCGCCG CAGGCCCTGC CCACGGTCGC CGATCTCGCC
GGTGCCTCCG AGGCCCCCGG CTGGCCCGCC CAGGCAGAGG CACGGATCGC GGCCTGGGTG
GCCGGGCATT TCGATCAGGG CCAGGCGCTC TGGGCCGCTG GTGCGCCCTC CGGCACCTAT
GCCGACTGGC TGGCCTTCGC CACCACCGAC ATGACCCCCG ATCTCGCCGG GCTTCCGGGC
TTCCGGGCCT GGCTCAAGGC CCTGCCGTCC GACCCGACCG AGGCGCTTTT GGCCGCGGTC
AACACGCTGG GGCTGACCGA GGCGGCGCTG CCGCTCTACT TCCACCGGCT TGCCATGTCG
CTGGGCGGCT GGGCGCAGGC CGCGCGCTAC CGGCTCTGGC AGGCGGAGCT GGCGGGCCAG
ACCGACACCA CGCTGGCCGA GCTGATCGTA ATCCGCGCGG TCTGGGATGC CGGCACGCTG
GCCACAAGGC CCGCCCTGGC CGCACAGTGG GACACCGCGC GCGCCGCCTT CGCCGCCCCG
GTCACGCCCA GCGAGGATGA CCTGATCGAC GCCGTCCTGC AGGACGCGGC GGAGCGGAGC
ACCCAGGCCG ATCTCGCCCA GGCCTTCGCC CCCGTGGCCA AGGCCGAGGC CCGGCCCGCG
CTGCAGGCGG CCTTCTGCAT CGACGTGCGC TCCGAGGTGA TCCGCCGGGC GCTGGAGACC
TGCGATCCGG GCATCGAAAC CCTCGGCTTT GCGGGCTTCT TTGGCCTCAC TGCCGCCCAC
ACCCCCACCG GGTCCTGCAA TTCCGAGGCG CGGCTGCCGG TCCTTCTGAC CGCCGGGGTG
ACGAGCAAGG CGAGCGGCGA CCACGACGCC GCCCGGATCA CCACCCGCGT CACCCGCGCC
TGGGGCCGGT TCCGGCAGGC GGCGGTGTCC TCCTTCGCCT TCGTCGAGGC GGCGGGCCCG
TTCTATGCGG GCAAGCTGGT GCGCGACACG CTGGGCCTGG GCAAGGCCGA CGCGATCCCG
GGCAAGCCGG TCTTCGACCC GCCCCTGCCG GAGGAGGCGC AGATCGACGC GGCGGCCACG
ATCCTGAACG CCATGTCGCT GAAATCCAAC TTCGCGCCGC TGGTGGTGAT CGCGGGCCAT
GGCAGCCATG TGAACAACAA CGCCCATGCC AGCGCGCTGC AATGCGGGGC CTGTGGCGGC
TATGGCGGCG ACGTCAACGC CCGGCTTCTG GCCGACCTGC TGAACCAGCC CCATGTGCGG
GCCGGTCTGG CCGCCAGGGG CATCGCGGTG CCCGAGGACA CGATCTTCGT CGCGGCCCTG
CACGACACCG CGCAGGACGC GATCACGCTC TATGCCGATG ACCTGTCCGA GGCCCACCGG
GCCGCGGCCA CCGCGTCGCT GGCGCAGGCC CGGCAGTGGT GCGCCGAGGC CGGGCGGCTC
GCCCGGTCCG AGCGGCAGCC GAGCCTGCCG GGCGCGACCG AACGCGACGG CATCGCCGCC
CGCGCCCAGA GCTGGGCCGA AACCCGGCCC GAATGGGGGC TGGCGGGCTG CAAGGCCTTC
GTCGTCGCCC CGCGCACCCA GACCGCGCCC GCGCAGCTTG ACGGGCGGGT CTTCCTGCAC
AGCTACGACT GGGCGCAGGA CGAGGGCTTC GGGGTGCTGG AGCTGATCCT GACCGCGCCT
GTGGTGGTCG CGAGCTGGAT CAGCCTCCAG TATTACGGCT CCGTGGTGGC GCCCGAGGTG
TTCGGCGGCG GCTCCAAGCA GGTCCATAAC GTGACCGGCG GGATGGGCGT ACTCGACGGC
GGCACCGGGG CGCTGCGGAT CGGCCTGCCG ATCCAGTCGG TCCATGACGG CGGCAGCTTC
GTGCATGACC CGCTGCGCCT GACCATCGTG GTCAATGCCC CGCAGGAGGC GATCACCGAC
ATCCTCGCGC GCCATGACGG GGTGCGGGCG CTGTTCGACA ACGGCTGGCT GAAGCTTCTG
CGGCTCGAAG CGGATGGCAC CATCTCGGAG CGCTATACCG GCGACCTGAC ATGGGAGGCC
TTCGCGCCGG GCACCGAGGC TGCCCAGGCC GCCTGA
 
Protein sequence
MTHTATLISV DTLTAAQAGA VQAIPPAFPL SATVAVNPFL GQAGHSLPDT AAVLGKTAGC 
ATTAPRSWFA AEIAAGRITK PAVAEALAAA GLDWPVDKVI SAASRQRPAP QALPTVADLA
GASEAPGWPA QAEARIAAWV AGHFDQGQAL WAAGAPSGTY ADWLAFATTD MTPDLAGLPG
FRAWLKALPS DPTEALLAAV NTLGLTEAAL PLYFHRLAMS LGGWAQAARY RLWQAELAGQ
TDTTLAELIV IRAVWDAGTL ATRPALAAQW DTARAAFAAP VTPSEDDLID AVLQDAAERS
TQADLAQAFA PVAKAEARPA LQAAFCIDVR SEVIRRALET CDPGIETLGF AGFFGLTAAH
TPTGSCNSEA RLPVLLTAGV TSKASGDHDA ARITTRVTRA WGRFRQAAVS SFAFVEAAGP
FYAGKLVRDT LGLGKADAIP GKPVFDPPLP EEAQIDAAAT ILNAMSLKSN FAPLVVIAGH
GSHVNNNAHA SALQCGACGG YGGDVNARLL ADLLNQPHVR AGLAARGIAV PEDTIFVAAL
HDTAQDAITL YADDLSEAHR AAATASLAQA RQWCAEAGRL ARSERQPSLP GATERDGIAA
RAQSWAETRP EWGLAGCKAF VVAPRTQTAP AQLDGRVFLH SYDWAQDEGF GVLELILTAP
VVVASWISLQ YYGSVVAPEV FGGGSKQVHN VTGGMGVLDG GTGALRIGLP IQSVHDGGSF
VHDPLRLTIV VNAPQEAITD ILARHDGVRA LFDNGWLKLL RLEADGTISE RYTGDLTWEA
FAPGTEAAQA A