Gene Sare_2544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2544 
Symbol 
ID5706398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2897358 
End bp2898299 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content72% 
IMG OID641272007 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_001537377 
Protein GI159038124 
COG category[R] General function prediction only 
COG ID[COG0300] Short-chain dehydrogenases of various substrate specificities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.638785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000422955 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGCGTAGGT TCGACTTCAG CGCGGCGACC GCCGTGGTCA CCGGCGCTGC CAGCGGTATC 
GGCGCAGCCC TCGCCCATGG CCTGGCCGCC CGCGGTAGCG ACCTGGTCCT GCTCGATCGC
GACGCCGCGC GCCTGGCCAC CGTCGCGGAC GCGATCCGTG CTGGGCACCC CGATCGGCGC
GTCGATCGGG TCGTCGTTGA CCTTGCCGAC GCGGCGGCCA CAGCTCGGGC CGCCGCGCAG
GTTCGCGCCC GCCATCCGCG GATCCGGCTG CTGGTCAACA ACGCAGGCGT GGCCTTGGGC
GGTCGGTTCG ACCAGGTGAC CCTGGACGAG TTCCAGTGGG TGGTCGAAAT CAACTTCCGG
GCGGTCGTGC AGCTCACGCA CGCGTTGCTG CCTGCCCTGA AGGCAGAGCC CGGTTCCCAC
CTGGTGAACG TCTCCAGCGT GTTTGGGCTG ATCGCGCCGC CTGGGCAGGC CGCCTACTCG
GCGACCAAGT TCGCCGTCCG TGGCTTTACC GAGGCCCTGC GCCACGAACT GATCGCCGAT
GGTATCGGTG TCACGTCCGT GCACCCTGGG GGCATCGCCA CCCGGATCAC CGAGAACGCG
CGTATCGGCA GTGGTGTCCG TCGGGATGAC TACGAGGAGG GCCGGCGGAA GTTCGACCGT
CTGCTCAGCA TCCCACCTGC CCGGGCCGCC GGGGTCATCC TGCGTGGCGT GGAACGCCGC
CGGCCTCGCG TCCTTGTCGG CTGGTCGGCG AAGCTGCCCG ACCTGATGGC CCGGATCGCT
CCGGGATCGT CCGGGACGCT GCTACGGGCC GGGATCGGCC GGGGTACCGG TGCGCCGGTT
CGCCGGCTGA CCACCGTGGC CGCGCCCCCG GAGGAGGCCG TGCCCCCGGC GGTGGCGAAC
GACCGGCGCT CCGAGGGAGT GTCGACGGCG GACGAGGCAT GA
 
Protein sequence
MRRFDFSAAT AVVTGAASGI GAALAHGLAA RGSDLVLLDR DAARLATVAD AIRAGHPDRR 
VDRVVVDLAD AAATARAAAQ VRARHPRIRL LVNNAGVALG GRFDQVTLDE FQWVVEINFR
AVVQLTHALL PALKAEPGSH LVNVSSVFGL IAPPGQAAYS ATKFAVRGFT EALRHELIAD
GIGVTSVHPG GIATRITENA RIGSGVRRDD YEEGRRKFDR LLSIPPARAA GVILRGVERR
RPRVLVGWSA KLPDLMARIA PGSSGTLLRA GIGRGTGAPV RRLTTVAAPP EEAVPPAVAN
DRRSEGVSTA DEA