Gene Dshi_2823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2823 
Symbol 
ID5710674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2979098 
End bp2980144 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content68% 
IMG OID641268749 
Productaldo/keto reductase 
Protein accessionYP_001534157 
Protein GI159045363 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.299574 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAC ACCCCATCGG CCGCGGCGGG CCGGACGTCA CCCGGTTCTG CCTCGGCACC 
ATGACCTTTG GCACCCAGAC CGGGCAGGCC GACGCCCATG CCCAGATCAC CATGGCGCTG
GAGGCTGGGT TGAACATCCT CGACACTGCC GAAATGTACC CGGTCAACCC GGTCTCGGCC
GAGACCGTGG GCCTGACCGA AACTATCATC GGGGCCTGGA ACGCGGCCAA CCCGGGGCGG
CGCGGCGAGT ACGTGCTCGC CACCAAGGTT TCCGGCGAGG GGCTGAAGGC GGTGCGTGAC
GGCGCACCGA TCTCGCGCGC AACCATCGAA ACGGCGGTGG AAGCCTCCCT GCGCCGGTTG
CAGACCGACC ATATCGACAT CTACCAGCTG CACTGGCCGA ACCGGGGCTC CTACCATTTC
CGGCAGAACT GGACCTTCGA TCCGAGTGGG CAGAACAAGT CCGACACGCT CGCCCATATC
GAGGAGGTGC TGGAAACCGT CGACCGCCTC GTGGCCGCGG GCAAGGTCGG CCATATCGGG
CTGAGCAACG AGAGCGCCTG GGGCACCGCC CAATGGCTGC GCGTGGCCGA GACCCACGGC
CTGCCCCGGG TGGTGTCGGT CCAGAACGAG TATTCCATGC TCGCACGGCT CTACGACACC
GATCTGGCGG AGTTGTCGGT CAACGAAGAG GTCGGGCTGC TGGCCTTCTC GCCCCTGGCC
ACGGGGCTGC TGACGGGCAA GTACCGGGGC GGCGCGGTGC CCGAAGGCTC CCGCATGTCG
CTCAACGGCG CGCTGGGCGG GCGGGTGACG GACCGGGTCT GGGGCGCGGT CGACGCCTAT
GCCGCCATCG CCGAGGCCCA CGGGCTCGAC ATGACCCATA TGGCGCTCGC GTGGTGCGCG
CAGCGGCCCT TCATGGGCTC GGTGATCTTC GGCGCGACCA CGCGGGACCA GCTGGCTCAT
ATCCTCGACG GTCTGGACCT GCGCCTGTCG CCGGAGGTGC TGGCCGAGAT CGACGCCGCC
CACAGGGCGC ATCCGATGCC GTTCTAG
 
Protein sequence
MQKHPIGRGG PDVTRFCLGT MTFGTQTGQA DAHAQITMAL EAGLNILDTA EMYPVNPVSA 
ETVGLTETII GAWNAANPGR RGEYVLATKV SGEGLKAVRD GAPISRATIE TAVEASLRRL
QTDHIDIYQL HWPNRGSYHF RQNWTFDPSG QNKSDTLAHI EEVLETVDRL VAAGKVGHIG
LSNESAWGTA QWLRVAETHG LPRVVSVQNE YSMLARLYDT DLAELSVNEE VGLLAFSPLA
TGLLTGKYRG GAVPEGSRMS LNGALGGRVT DRVWGAVDAY AAIAEAHGLD MTHMALAWCA
QRPFMGSVIF GATTRDQLAH ILDGLDLRLS PEVLAEIDAA HRAHPMPF