Gene Sare_4174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4174 
Symbol 
ID5703962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4741112 
End bp4742062 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content69% 
IMG OID641273601 
Productlactate/malate dehydrogenase 
Protein accessionYP_001538954 
Protein GI159039701 
COG category[C] Energy production and conversion 
COG ID[COG0039] Malate/lactate dehydrogenases 
TIGRFAM ID[TIGR01763] malate dehydrogenase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.714924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00171312 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGTAAGA AGGTCACTGT CGTCGGGGCC GGCTTCTACG GCTCCACCAC CGCGCAGCGC 
TTGGCCGAGT ACGACGTCTT CGACACTGTC GTCATCACCG ACATCGTGGA GGGCAAGCCG
GCCGGCCTCG CCTTGGACCT CAACCAGTCG CGAGCCATCG AGGGCTTCGA GACCAAGCTC
GTCGGCGTGA CCACCGGCCC GAACGGCGAG GGCTACGAGG CCATCGAGGG CTCGGACGTC
GTCGTCGTCA CCGCCGGCCT GCCGCGCAAG CCCGGCATGA GCCGGATGGA CCTGCTGGAG
ACCAACGCCA AGATCGTTCG GCAGGTCGCC GAGAACGTCG CCAAGTACGC CCCGAACGCC
GTCGTGATCG TGGTCTCCAA CCCGCTCGAC GAGATGACCG CGCTGGCCCA GATCGCCACC
CAGTTCCCGC ACAACCGGGT GCTCGGCCAG GCCGGCATGC TGGACACCGC CCGGTTCACC
AACTTCGTGG CCGAGGCGCT CGGCGTACCG GTGACGTCGG TGCGGACCCT GACGTTGGGT
TCGCACGGCG ACACCATGGT CCCGGTCCCG TCGAAGAGCA GCGTGGCCGG TAAGCCGCTG
CGCGAGGTCA TGCCCGCCGA GCAGATCGAG GACCTGGTGG TCAAGACCCG CAACGGCGGT
GCCGAGGTGG TGGCCCTGCT GAAGACCGGT TCGGCGTACT ACGCCCCGTC CGCCGCCGCC
GCCCGGATGG CGAAGGCCGT CGCCGAAGAC TCCGGCGAGG TCATGCCGGT CTGCGCCTGG
GTGGACGGTG AGTACGGCAT CTCCGGGGTC TACCTCGGCG TGGAGGCCGA GATCGGTGCG
CAGGGGGTCC GCCGGGTCGT CGAGACCGAC CTGGACGCCG ACGAACTGGC CGCCCTGAAG
GAGGCAGCCG AGGCGGTCCG AGCCAAGCAG GGCGACGTCG CCAGCATGTG A
 
Protein sequence
MGKKVTVVGA GFYGSTTAQR LAEYDVFDTV VITDIVEGKP AGLALDLNQS RAIEGFETKL 
VGVTTGPNGE GYEAIEGSDV VVVTAGLPRK PGMSRMDLLE TNAKIVRQVA ENVAKYAPNA
VVIVVSNPLD EMTALAQIAT QFPHNRVLGQ AGMLDTARFT NFVAEALGVP VTSVRTLTLG
SHGDTMVPVP SKSSVAGKPL REVMPAEQIE DLVVKTRNGG AEVVALLKTG SAYYAPSAAA
ARMAKAVAED SGEVMPVCAW VDGEYGISGV YLGVEAEIGA QGVRRVVETD LDADELAALK
EAAEAVRAKQ GDVASM