Gene Dshi_1955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1955 
Symbol 
ID5712949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2048152 
End bp2049948 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content72% 
IMG OID641267880 
Productputative short-chain dehydrogenase 
Protein accessionYP_001533297 
Protein GI159044503 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases)
[COG2910] Putative NADH-flavin reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00955158 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00174251 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATACC TTGTGACGGA TGCGGCGGGC GCGGTCGGGC ATCGCCTCGC CGCGACCCTT 
GTGGCCGCGG GGCACGAGGT GCTGGCCTTC TCGGACAGCC CCGTCAATGG GGTGGCCAGC
CGGGCCTGGC CCACCCGTGC GGACGTGCTG GCCAGGGCGC TGGCGGGGGT GGACGTGATC
TGCGCGGTGG ACCTGGCGAC GGTCGCGCCG GACGCGGCCC CGGCGCTGCT CAAACGTCTG
GACAAGCTGC GGGTTGCGGC GGGCAAGGCC GGGTGCGGCC GGTTCGTCCT GCTGGGGCGC
GCCGATATCT ACGACCCCGC GCTTCTGGCC GACAAGACGC TGCTGGAACG CTCGGCCACC
GCCGCGCCCG CCGACCTCGC CCCACCTGCG GCGGCGGCCC TCGCGCTGGA GGAGAGCCTG
CGCGCCGGCG CGGGGCTCGA CTGGACAATC CTGCGCGCGC CGATGATCCT CGCCCCCGAC
AGCGCCGAGG CCAAGGCCCT CTTCCGCCGC ATCCTGATCG ACGACGCCCC GTCGCCCGCC
CGGTTCCACC CGGTCGATGC CCGGGACGTG GCCACGGCCC TGGCCACCCT CGGCACGCAC
GACCGCGCAG GGGGGCAGGT GTTCAACATC GCCGCCCCCG CGGCCCTCGC CGCCCGGACC
CTGGACCAGG AGCTGGGCCG GCTCGCGCGG CTGCTGAACG ACGAGGCCAC GCCGCAGGAC
AAGCTCCGCC CGCCCCACCA GGTCGCGGAC CCGGTGCTGG CCACGGACAA GCTCGCGCGC
CTGACCGGCG TCACCCCGAC GCGGCCGATC TGGGTCAGCC TCGCCGAGAC GATGCAGCAG
GTGATGAAGG ACCTGCGCGC CGACGGCACC CTCCCGCCGC TGCCCGAGAA GATCAACCCG
GTCGTCAAGG CAGTGGAGTT GGGGCAAAAA CTGCTCGCCG GCAAGACCGC GGTGATCACC
GGCGTGACCT CCGGCATCGG CCGCGCGACC AGCCTGCTGC TGTCGCGGCT CGGGGCCACG
GTGGTGGGCA TCGCCCGCAA TGCCGAGGCC GGGGAAGCCT GGGAAGCGGC CCTGGCCGAA
GGCCGCCACA CCGTGCCCGG CCATTTCATC CAGGCCGACC TGATGTCCTT TGCCCGGATC
CGCACCCTCG CCGCCGACCT CGCCACGCGC TTTCCGCGGA TCGACATCCT GATCAACAAT
GCGGGCGCGA ACTTCCCCAC CCGGCGCCTG ACCGAGGACG GGGTCGAGGC CACCCTGGCG
ATCAACCATT TCGCGCCCTT CTTGCTGACC AACCTGCTGG CGGGCCCGCT GAAGGCGGCC
CCGGCGGCGC GGATCATCAT GGTGAATTCC GACTGGCACC GCCGCTCCCT GCCGGACATG
CACGACCTGC AGATGGAGAG CGGCTATCGC ACGGGCGAGG CCTATTGCCG GGCCAAGTTC
ATGAATTTGC AGATCACCTA CGCCATGGCC GCCCTGCTGG ACGGCACCAA CGTGACCGTG
AACGCGATCC ATCCCGGGGT GGTCCGCACG GGCATCTCGA CCCGCAACAC CGACGGCGCG
CCCCAGGTCC CCGCCCAGGC CCGCGAACGC GCGCAACAGC GGATGATCTC GCCGGAGTTT
TCCGCCGTCT ACCTCGCCAG CCTCGCCACC GCACCGGACT ACGAAGGCCA GAGCGGGCTC
TACCTCGATA CCGACGAGAT CAAGCAGTCT CACGAGGCCA CCTATGACGA GGATTGGGCC
TGGCAGATCT GGGAATGGAG CGCGCAGATC ACCGGGCTGA CCCCGGCCTC CGCCTGA
 
Protein sequence
MKYLVTDAAG AVGHRLAATL VAAGHEVLAF SDSPVNGVAS RAWPTRADVL ARALAGVDVI 
CAVDLATVAP DAAPALLKRL DKLRVAAGKA GCGRFVLLGR ADIYDPALLA DKTLLERSAT
AAPADLAPPA AAALALEESL RAGAGLDWTI LRAPMILAPD SAEAKALFRR ILIDDAPSPA
RFHPVDARDV ATALATLGTH DRAGGQVFNI AAPAALAART LDQELGRLAR LLNDEATPQD
KLRPPHQVAD PVLATDKLAR LTGVTPTRPI WVSLAETMQQ VMKDLRADGT LPPLPEKINP
VVKAVELGQK LLAGKTAVIT GVTSGIGRAT SLLLSRLGAT VVGIARNAEA GEAWEAALAE
GRHTVPGHFI QADLMSFARI RTLAADLATR FPRIDILINN AGANFPTRRL TEDGVEATLA
INHFAPFLLT NLLAGPLKAA PAARIIMVNS DWHRRSLPDM HDLQMESGYR TGEAYCRAKF
MNLQITYAMA ALLDGTNVTV NAIHPGVVRT GISTRNTDGA PQVPAQARER AQQRMISPEF
SAVYLASLAT APDYEGQSGL YLDTDEIKQS HEATYDEDWA WQIWEWSAQI TGLTPASA