Gene Sare_2316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2316 
Symbol 
ID5707160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2658874 
End bp2659845 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content73% 
IMG OID641271794 
Productaldo/keto reductase 
Protein accessionYP_001537165 
Protein GI159037912 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00724126 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAACAGC GACCGCTCGG CCGAAGCGGG CTGGCGGTAT CGCGGCTCGC GCTCGGCACC 
ATGACCTGGG GCCGGGACAC CGATGCCGAC GACGCGGCGG CCCAGCTGAG GAGCTATCTC
GACGCGGGCG GCAACCTGAT CGACACCGCC GACGTCTACG GCGATGGGGA CGCGGAGTCG
GTCATCGGCT CGCTCCTCGG CACACTGGTC CCCCGCGAGG AACTGCTGAT CGCGACCAAG
GCGGGGCTGC GCCCGGGCAA CGGCCGGCGC CGCGACGGCT CCCGCGGTCA CCTGCTGCGT
ACCCTCGACG CCTCGCTGCG CCGGCTCGGC ACCGACCACG TCGACCTGTT CCAGGTGCAC
GGGTACGACC CGGACACACC GCTGGAGGAG AGCCTCGCCG CCCTGGACCA CGCGGTCGCC
AGCGGGCGGG TCCGGTACGT CGGTGTCTCC AACTTCTCCG GCTGGCAGAC CGCCCGCGCC
GCGGCCTGGC AGGCGGCCTG GCCCGGCCGG TCGCCCGTGG TGGCCGCCCA GGTGGAGTAC
TCGCTGCTGG AACGCGGCAT CGAGCGGGAG GTGCTACCGG CCTGCACAGC CCTCGGGCTG
GGCGTGCTGC CCTGGTCACC GTTGGGGCGC GGGGTGCTGA CCGGCAAGTA CCGCAACGGC
CGACCAGCCG ACTCCCGGGC GGCCTCACCG CACTTCGAAC GGTTCGTCGC GACCTACCTG
GAGCCGCGCT GCTCCAGCAT CGTGGAGGCG GTCGCCACCG CAGCGGGTGG TCTCGGTGTC
TCACCGCTGG AGGTCGCGCT GGCCTGGATC CGCGACCGGC CCGGGGTCGT CGCGCCGATC
CTCGGCGCAC GCACCGTGGG GCAGCTGCTC GGCGCGCTCC AGGTCGAACA GATGACCCTG
CCGGAGGAGA TCACCACGGC CCTCAACGAC GTCTCCGCGG TGCCGGTCGG CTACCCGGAA
CGCGACGGCT GA
 
Protein sequence
MQQRPLGRSG LAVSRLALGT MTWGRDTDAD DAAAQLRSYL DAGGNLIDTA DVYGDGDAES 
VIGSLLGTLV PREELLIATK AGLRPGNGRR RDGSRGHLLR TLDASLRRLG TDHVDLFQVH
GYDPDTPLEE SLAALDHAVA SGRVRYVGVS NFSGWQTARA AAWQAAWPGR SPVVAAQVEY
SLLERGIERE VLPACTALGL GVLPWSPLGR GVLTGKYRNG RPADSRAASP HFERFVATYL
EPRCSSIVEA VATAAGGLGV SPLEVALAWI RDRPGVVAPI LGARTVGQLL GALQVEQMTL
PEEITTALND VSAVPVGYPE RDG