Gene Dshi_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2039 
Symbol 
ID5713034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2158316 
End bp2159335 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content72% 
IMG OID641267963 
Productmalate/L-lactate dehydrogenase family protein 
Protein accessionYP_001533379 
Protein GI159044585 
COG category[C] Energy production and conversion 
COG ID[COG2055] Malate/L-lactate dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.308465 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.10893 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTACATGT CCGAGACCGA AACCCTTTCC CTGCCCGACG CGCGCGACCT TCTGTTCCGG 
GCCTTTACGG CCAACGGCGT GCCGGAAGGC GCTGCGCGCA GCACCGCCGA CGCGCTGGTC
GCGGCCGAGG CCGAGGGCCA GGTGGGCCAC GGGTTTTCGC GGCTGGAAGA TTACGTGGCC
CAGGCCCGCA GCGGCAAGAT CGTCGCCGGG GCCGAGGTGA CGATCACCCG CCCTGCCCCG
ACCACACTGC TGGTGGATGC CGGCCACGGG TTCGCCTATC CGGCGCTGGA GCGTGCCATT
GACGAAGGCA TCGCCGTCGC GCGGGAACTT GGCACCGCCG CCATCGCCGT GACCCGCTCG
CACCATTGCG GCGCGCTGTC GATCCATGTG GAGCGGGCGG CAAAAGCCGG GCTGGTGGCG
ATGATGGTGG TCAACGCCCC CGCCGCGATT GCCCCCTGGG GCGGCAAGAC CCCGCTTTTC
GGCACCAACC CCATCGCCTT TGCCACGCCC AGGGCCGGGA GCGCGCCGTT GGTCATCGAC
CTGTCGCTGT CGAAGGTGGC CCGCGGCAAG GTGATGAATG CCAAGAAGGC GGGCAAGCCG
ATCCCCGAAG GCTGGGCGCT CGATGCCGCG GGCAATCCGA CCACGGATGC CGAGGCAGCG
CTCGGCGGCA CCATGGTGCC CATCGGCGAG GCCAAGGGCA CCGCGCTGGC GCTGATGGTC
GAGATCCTGT CCGCGGTGAT GACCGGTGCC GCGCTGAGCA CCGAGGCCGG GTCGTTCTTT
TCCGCTGACG GCCCGCCCCC TGGGGTCGGC CAGTTTCTGA CGCTCTGGCG TCCGCCCGAG
GGGGCGGAGG CGTTCACTGC CCGGCTCGCC CCGCTGCTGG CCCAGATCGA GACGATGGAG
GGCGCCCGCC TGCCGGGCAC ACGCCGTCTG GCCGCCCTGA ACGCCGCGCA GGCACACGGC
ATCGCGGTGC CGCGCGCCTA TCTCGACGGC GCCCGCCGTC TTGCCGCCAC CCATCCCTGA
 
Protein sequence
MYMSETETLS LPDARDLLFR AFTANGVPEG AARSTADALV AAEAEGQVGH GFSRLEDYVA 
QARSGKIVAG AEVTITRPAP TTLLVDAGHG FAYPALERAI DEGIAVAREL GTAAIAVTRS
HHCGALSIHV ERAAKAGLVA MMVVNAPAAI APWGGKTPLF GTNPIAFATP RAGSAPLVID
LSLSKVARGK VMNAKKAGKP IPEGWALDAA GNPTTDAEAA LGGTMVPIGE AKGTALALMV
EILSAVMTGA ALSTEAGSFF SADGPPPGVG QFLTLWRPPE GAEAFTARLA PLLAQIETME
GARLPGTRRL AALNAAQAHG IAVPRAYLDG ARRLAATHP