Gene Sare_1725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1725 
SymbolgabD1 
ID5703424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1996437 
End bp1997852 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content69% 
IMG OID641271228 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_001536603 
Protein GI159037350 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.450617 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000946553 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCCATCG CCACCACAAA CCCCGCCACC GGACGGACAG TCAGGACGTA CGAGCCCTTC 
TCGCCCGAGC AGATCGATGC GGCGATCGAC CGTAGCCACC TCGCGTACCG GAACCTGCGC
GACACCACCG TCGCGCAGCG CGCTGCCTGG CTCGACCGGG CCGCTGACCT ACTCGACGTC
GAGCGCGACG AGGCCGCCCG AATGGCGACC ACGGAGATGG GCAAGACGTA CGCCGCCGCA
CGGGCCGAGG TGACCAAGTG CGCCAGCGCC TGCCGCTTCT ATGCGAGGAA GGCGCCCGAG
TTCCTCGCCG ACGAGCCCGC TGACGCGGCG AGCGTCGGTG CGACCCGGGC GTTCGTGCGG
TACCAACCGA TCGGGCCGGT GCTCGCGGTG ATGCCGTGGA ACTTCCCATT CTGGCAGGTG
CTGCGCTTCG CCGCGCCGGC GCTGATGGCC GGCAACACCG GCCTGCTCAA GCACGCCTCG
AATGTGCCAC AGACCGCTCT CTATCTGGCG GACCTGTTCC GCCGGGCCGG CTTTCCGGAA
GGTGCGTTCG GTGCGCTGCT GGTCGGCTCC GACGCGGTGG AGGCCATTCT GAGTGACCCC
CGGGTCCGCG CCGCGACGCT CACCGGCAGC GAGCGTGCGG GCCGTGCCAT CGCCCAGATC
GCTGGCCGGG AGTTGAAGAA GACCGTGTTG GAACTCGGCG GCAGCGACCC GTTCGTGGTG
ATGCCCTCGG CCGATCTGGA CCGGGCCGCC GAGGTCGCCA CCGTCGCCCG TTGCCAGAAC
AACGGCCAGT CCTGTATCGC CGCGAAGCGC TTCATCGTGC ACACCGACGT GTTCGACGCC
TTCGCGGAGC GGTTCGCCGC GCGCATGTCC GCGCTGCGGG TGGGTGACCC GATGGAGGAC
ACCACCGAGG TGGGTCCGCT CGTCAGCGAA GGAGGCCGTG CTGAGATCAT CGACCAGGTA
CGCGACGCCG TTGACCTGGG TGCGACCATC CTCTGTGGTG GTGAGCGGCC GGAGCGGGAC
GGCTGGTACT ACCCGCCCAC CGTCGTCACC GACCTCACCC CGGAGATGCG GATGTGGACC
GAGGAGGTAT TCGGGCCGGT CGCCGGGCTG TACCGGGTGT CGTCGTACGA CGAGGCGATC
GAGGTTGCCA ACGGCACCGC GTTCGGGCTC GGCGCGAACG CCTGGACTCG AGATCAGCGG
GAACAGGAGC GGTTCGCCAT CGACTTGGAG GCCGGCAACG TCTTCGTCAA CGGTATGACC
ACATCCTTTC CGGAGCTGCC GTTCGGCGGG GTGAAGAACT CCGGGTACGG CCGGGAACTG
TCCGCGCTGG GAATGCGCGA GTTCTGCAAC ACCAAGACCG TGTGGGTCGG TGGTGCGGAC
GATGCCACCT GGTCGGTGGG AACGCACGCC GAGTGA
 
Protein sequence
MSIATTNPAT GRTVRTYEPF SPEQIDAAID RSHLAYRNLR DTTVAQRAAW LDRAADLLDV 
ERDEAARMAT TEMGKTYAAA RAEVTKCASA CRFYARKAPE FLADEPADAA SVGATRAFVR
YQPIGPVLAV MPWNFPFWQV LRFAAPALMA GNTGLLKHAS NVPQTALYLA DLFRRAGFPE
GAFGALLVGS DAVEAILSDP RVRAATLTGS ERAGRAIAQI AGRELKKTVL ELGGSDPFVV
MPSADLDRAA EVATVARCQN NGQSCIAAKR FIVHTDVFDA FAERFAARMS ALRVGDPMED
TTEVGPLVSE GGRAEIIDQV RDAVDLGATI LCGGERPERD GWYYPPTVVT DLTPEMRMWT
EEVFGPVAGL YRVSSYDEAI EVANGTAFGL GANAWTRDQR EQERFAIDLE AGNVFVNGMT
TSFPELPFGG VKNSGYGREL SALGMREFCN TKTVWVGGAD DATWSVGTHA E