Gene Sare_3901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3901 
Symbol 
ID5705839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4441498 
End bp4442424 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content70% 
IMG OID641273326 
Productacetaldehyde dehydrogenase 
Protein accessionYP_001538683 
Protein GI159039430 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4569] Acetaldehyde dehydrogenase (acetylating) 
TIGRFAM ID[TIGR03215] acetaldehyde dehydrogenase (acetylating) 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.59815 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGTCG GGGTGGCAGT GCTCGGTTCC GGGAACATCG GAACCGACTT GATGATCAAG 
GTTTTGCGAC TCAGCGACAG CCTGCGGATG GTCGCCATGG CGGGCATCGA TTCGGGCTCC
GACGGGCTGG CCCGAGCCCG GCGGCTCGGT GTCACCACGA CCGCCGACGG GGTGGCGGGG
CTCGTGACGT TGCCCGAGTT CGCCGACGTG GAGTTGGTCT TCGACGCCAC GTCGGCCGGG
GCCCACCGGC ACCACGACTC CGTGCTGCGT GCCTACGGTC GGATCGTGGT CGACCTGACC
CCCGCCGCGA TCGGGCCGTA CGTGGTGCCG CCGGTCAATC TCGACGAGCA CCTGGCGGAG
ACCAACGTCA ACATGGTCAC CTGTGGTGGG CAGGCGACCG TGCCGATCGT CGCCGCCATC
GGCCGGGTCA CCCCGGTCGC GTACGGGGAG ATCGTCGCCT CGATCGCCTC GAAATCCGCC
GGGCCAGGCA CCCGGGCCAA CATCGACGAG TTCACCGAGA CCACCGCCCG GGCGATCGAG
GTGGTCGGTG GTGCCGATCG GGGCAAGGCC ATCATCGTGC TGAACCCGGC CGACCCGCCG
CTGCTGATGC GGGACACCGT GTACTGCCTC TGCCCGGACA CCGACGCGGA CCGGAGCGCG
ATCATCGCCG CGGTCACCGA CATGGTGGGC GCTGTGCAGG AGTACGTCCC CGGCTACCGG
CTCAAGCAGG AGGTGCAGTT CGACCGGGTG GACAGCTACC TGCCGGCGCT CGGTGGGCAC
CTCACCGGCC TACAGGTCTC GGTTTTCCTG GAGGTCTCCG GTGCCGGGCA CTACCTGCCC
GAGTACGCCG GGAACCTGGA CATCATGACC TCGGCCGCCC TGCGTACCGC AGAGCGGCTG
ATCGGCCGGC GGGCGGTGAC GGCATGA
 
Protein sequence
MSVGVAVLGS GNIGTDLMIK VLRLSDSLRM VAMAGIDSGS DGLARARRLG VTTTADGVAG 
LVTLPEFADV ELVFDATSAG AHRHHDSVLR AYGRIVVDLT PAAIGPYVVP PVNLDEHLAE
TNVNMVTCGG QATVPIVAAI GRVTPVAYGE IVASIASKSA GPGTRANIDE FTETTARAIE
VVGGADRGKA IIVLNPADPP LLMRDTVYCL CPDTDADRSA IIAAVTDMVG AVQEYVPGYR
LKQEVQFDRV DSYLPALGGH LTGLQVSVFL EVSGAGHYLP EYAGNLDIMT SAALRTAERL
IGRRAVTA