Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1725 |
Symbol | gabD1 |
ID | 5703424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1996437 |
End bp | 1997852 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641271228 |
Product | succinic semialdehyde dehydrogenase |
Protein accession | YP_001536603 |
Protein GI | 159037350 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.450617 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000946553 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTCCATCG CCACCACAAA CCCCGCCACC GGACGGACAG TCAGGACGTA CGAGCCCTTC TCGCCCGAGC AGATCGATGC GGCGATCGAC CGTAGCCACC TCGCGTACCG GAACCTGCGC GACACCACCG TCGCGCAGCG CGCTGCCTGG CTCGACCGGG CCGCTGACCT ACTCGACGTC GAGCGCGACG AGGCCGCCCG AATGGCGACC ACGGAGATGG GCAAGACGTA CGCCGCCGCA CGGGCCGAGG TGACCAAGTG CGCCAGCGCC TGCCGCTTCT ATGCGAGGAA GGCGCCCGAG TTCCTCGCCG ACGAGCCCGC TGACGCGGCG AGCGTCGGTG CGACCCGGGC GTTCGTGCGG TACCAACCGA TCGGGCCGGT GCTCGCGGTG ATGCCGTGGA ACTTCCCATT CTGGCAGGTG CTGCGCTTCG CCGCGCCGGC GCTGATGGCC GGCAACACCG GCCTGCTCAA GCACGCCTCG AATGTGCCAC AGACCGCTCT CTATCTGGCG GACCTGTTCC GCCGGGCCGG CTTTCCGGAA GGTGCGTTCG GTGCGCTGCT GGTCGGCTCC GACGCGGTGG AGGCCATTCT GAGTGACCCC CGGGTCCGCG CCGCGACGCT CACCGGCAGC GAGCGTGCGG GCCGTGCCAT CGCCCAGATC GCTGGCCGGG AGTTGAAGAA GACCGTGTTG GAACTCGGCG GCAGCGACCC GTTCGTGGTG ATGCCCTCGG CCGATCTGGA CCGGGCCGCC GAGGTCGCCA CCGTCGCCCG TTGCCAGAAC AACGGCCAGT CCTGTATCGC CGCGAAGCGC TTCATCGTGC ACACCGACGT GTTCGACGCC TTCGCGGAGC GGTTCGCCGC GCGCATGTCC GCGCTGCGGG TGGGTGACCC GATGGAGGAC ACCACCGAGG TGGGTCCGCT CGTCAGCGAA GGAGGCCGTG CTGAGATCAT CGACCAGGTA CGCGACGCCG TTGACCTGGG TGCGACCATC CTCTGTGGTG GTGAGCGGCC GGAGCGGGAC GGCTGGTACT ACCCGCCCAC CGTCGTCACC GACCTCACCC CGGAGATGCG GATGTGGACC GAGGAGGTAT TCGGGCCGGT CGCCGGGCTG TACCGGGTGT CGTCGTACGA CGAGGCGATC GAGGTTGCCA ACGGCACCGC GTTCGGGCTC GGCGCGAACG CCTGGACTCG AGATCAGCGG GAACAGGAGC GGTTCGCCAT CGACTTGGAG GCCGGCAACG TCTTCGTCAA CGGTATGACC ACATCCTTTC CGGAGCTGCC GTTCGGCGGG GTGAAGAACT CCGGGTACGG CCGGGAACTG TCCGCGCTGG GAATGCGCGA GTTCTGCAAC ACCAAGACCG TGTGGGTCGG TGGTGCGGAC GATGCCACCT GGTCGGTGGG AACGCACGCC GAGTGA
|
Protein sequence | MSIATTNPAT GRTVRTYEPF SPEQIDAAID RSHLAYRNLR DTTVAQRAAW LDRAADLLDV ERDEAARMAT TEMGKTYAAA RAEVTKCASA CRFYARKAPE FLADEPADAA SVGATRAFVR YQPIGPVLAV MPWNFPFWQV LRFAAPALMA GNTGLLKHAS NVPQTALYLA DLFRRAGFPE GAFGALLVGS DAVEAILSDP RVRAATLTGS ERAGRAIAQI AGRELKKTVL ELGGSDPFVV MPSADLDRAA EVATVARCQN NGQSCIAAKR FIVHTDVFDA FAERFAARMS ALRVGDPMED TTEVGPLVSE GGRAEIIDQV RDAVDLGATI LCGGERPERD GWYYPPTVVT DLTPEMRMWT EEVFGPVAGL YRVSSYDEAI EVANGTAFGL GANAWTRDQR EQERFAIDLE AGNVFVNGMT TSFPELPFGG VKNSGYGREL SALGMREFCN TKTVWVGGAD DATWSVGTHA E
|
| |