Gene Sare_2412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2412 
Symbol 
ID5703696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2777256 
End bp2778167 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content68% 
IMG OID641271889 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_001537260 
Protein GI159038007 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.330008 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCG ACGCACGTTC GCTGAGGGGG AAGGTAGCCC TGGTGGCCGG CGGTACCCGA 
GGTGGCGGCC GGGGTATCGC CGTTGAACTC GGTGCCGCCG GCGCAACGGT GTACGTCACC
GGACGAAGTG GCACCGGTGA ACGCTCTGAC CTCGACCGAC CGGAAACGAT CGAGCAGACC
GCCGAGCAGG TCACCGCCGC AGGCGGTCTG GGAATTCCTG TCCGAACCGA CCACAGCCGC
CCCGAGCAGG TCGAGGCCCT CGTCAACCGG ATCGCCACCG AGCAGGACGG CCAGCTCGAC
GTGGTGGTCA ATTCCGTATG GGGTGGCGAT CCGCTGACCG ACTGGGAACA TCCCCTGTGG
GAGCAGGACC TGGCCACCGG CCTACGGCTG CTGCGGCAGG CGGTGGAAAC CCACATCATC
ACCAGCCGCT TCGCGTTGCC CCTTCTGGTC GCCCGTGGCA GCGGCCTGGT TGTGGAAGTC
ACCGACGGTA ACACCGCCCG CTATCGCGGC ACTCTCTTCT ATGACCTGGC AAAGTCCGCG
GTCATTCGCC TCGCCGTCGC CCAGGCCGCC GAGCTCAAGC CGCATGGCGT GGCGGCGGTA
GCCATCACGC CTGGTTTCCT CCGCTCGGAG GCCCTGCTCG AGCACTTCGG TGTCACCGAA
GCCAACTGGC GCGACGGCGC GGCCCTGGAT CCGAACTTCG CCCATTCCGA GACCCCGGCC
TACCTCGGCC GAGCCGTTGC CGCCCTGGCC GCTGACCCAC ACATCATGGC CAAGTCCGGA
CGTGCCCTGG CCACCTGGGG CCTGTATCAG GAGTACGGTT TCACCGATGC CGACGGCACC
CAACCGGACT TCGCAGCCCA CTGGGCCAAA AACCTGGAGG AACAGCATGG GCCCCTCGGA
GACCCGCTCT AA
 
Protein sequence
MTTDARSLRG KVALVAGGTR GGGRGIAVEL GAAGATVYVT GRSGTGERSD LDRPETIEQT 
AEQVTAAGGL GIPVRTDHSR PEQVEALVNR IATEQDGQLD VVVNSVWGGD PLTDWEHPLW
EQDLATGLRL LRQAVETHII TSRFALPLLV ARGSGLVVEV TDGNTARYRG TLFYDLAKSA
VIRLAVAQAA ELKPHGVAAV AITPGFLRSE ALLEHFGVTE ANWRDGAALD PNFAHSETPA
YLGRAVAALA ADPHIMAKSG RALATWGLYQ EYGFTDADGT QPDFAAHWAK NLEEQHGPLG
DPL