Gene Sare_3883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3883 
Symbol 
ID5706380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4425600 
End bp4426580 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content66% 
IMG OID641273308 
Productalcohol dehydrogenase 
Protein accessionYP_001538665 
Protein GI159039412 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.014306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.116687 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGAA TCCGCTTTTA CGCGTACGGC TCGTCCAAAG TCCTCACCCT CCAGGACCTC 
GACAAGCCCG CCGTCGGTGA CGACGACGTG CTGGTCCGGG TGCGGGCGGC TTCGGTCAAC
GTCGTGGACT GGCACACCAT GAGGGGTACG CCGTACATCA TGCGGGCGCG GGGTGGGATG
TCCCGCCCCA AGGTCAACGA GCTGGGCTTC GACCTGGCCG GGCAAGTCGA AGCGGTGGGC
AGGAACGTCA CCACCCTGCG GGTGGGCGAC GAGGTCTTCG GCTGTCAGGA CCTGGAACAC
GCGGGCGTGT TCGCCGAGTA CGTCACCATT CCCCACGATG CGGGAGTGCT GAAGAAGCCG
GTCGGGCTGT CCCTGGAACA GGCGGCTTCC GTGCCGGTGG CGGCACTCAC CGCCTACCAG
GCACTACGTC ACCACGGGCG GCTGCAACCC GGCCACAAGA TCCTGGTCAA CGGTGCGGCA
GGAGGCGTGG GAACCTTCGC CGTGCAGATC GGCAAGGCGC TGGGCGCCGA GGTAACCGCC
GTGTGCAGCA CCAGGAACGT CGAGATGGTC CGCGCTCTGG GTGCCGACCA CGTCATCGAC
TACACCACAG AGGACTTCAC TCACCGCGCG CAACGCCACG ACATCCTCCT CGACAACATC
GGAAACCACC CGCTCTCCGC ATGCCGGCGC GTGCTCACCC CCCGGGGGAC CCTCGTCCTG
AACAGCGGCA CGGGAGGCCC ACTACTCGGA CCCCTGGGCC GGGTACTCCG TGGGCTCACC
CTGTCCTTGT TCGTACGGCA GCGTCTGGTG TTCTTCCTGG CACGCCCCAC CAAGGGCGAT
CTGGAAGCAC TTCGCGACCT GCTCGAATCC GGGAAGGTCA CCCCGGTCAT CGACCGGACA
TATCCCCTCA GCGAGCTGCC CAAGGCGATC AGCTACCTCG AGACAGGGCA CGTCCGGGGA
AAGGTCGTCA TCACCATCTG A
 
Protein sequence
MKGIRFYAYG SSKVLTLQDL DKPAVGDDDV LVRVRAASVN VVDWHTMRGT PYIMRARGGM 
SRPKVNELGF DLAGQVEAVG RNVTTLRVGD EVFGCQDLEH AGVFAEYVTI PHDAGVLKKP
VGLSLEQAAS VPVAALTAYQ ALRHHGRLQP GHKILVNGAA GGVGTFAVQI GKALGAEVTA
VCSTRNVEMV RALGADHVID YTTEDFTHRA QRHDILLDNI GNHPLSACRR VLTPRGTLVL
NSGTGGPLLG PLGRVLRGLT LSLFVRQRLV FFLARPTKGD LEALRDLLES GKVTPVIDRT
YPLSELPKAI SYLETGHVRG KVVITI