Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_3528 |
Symbol | |
ID | 5060002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 4044681 |
End bp | 4046186 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640475782 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001160337 |
Protein GI | 145596040 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR03216] 2-hydroxymuconic semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.757321 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGGT GGGATGCGCG CCAGATGCAG GGCCAGGCGC CGGGCGGGAC GGCCCTGCTG CCCAACTTCA TCGCTGGCGA GTTCGTCGAC TCGGGTCGCC GTTTCACCAA GCACAGCCCG GTTACCGGGG AGCCGCTCTT CGAGGTCGTC GAGGCGGATC AGGCCGGAGT GGACGACGCG GTGGCCGCCG CGCGTGCGGC GCTGCGTGGC CCGTGGGGTC GGCTCGGCGA ACGAGAGCGC GCCGAGGTGC TGCTCCGGAT CGCCGACGAA CTGGAGCGCC GCTTTGATGA CCTGGTCACC GCTGAGGTGG CGGACACCGG CAAGACGATT TCGCAGGCCC GGACCCTGGA CATCCCGCGC GGCGCGGCCA ACTTCCGTGC CTTCGCCGAG ATCGCGGCGA CCACACCGAC CGAGTCGTTC AGCACCCGTA CCCCATCCGG TGGCCGTGCG CTGAACTACG CGCTCCGCAA GCCAGTCGGC GTGGTCGCGG TGATCGTCCC GTGGAACCTG CCGTTGCTCC TGCTCACCTG GAAGGTCGCG CCCGCGTTGG CGTGTGGCAA CGCGGTGGTG GTCAAGCCCA GTGAGGAGAC ACCCGCCTCG GCGACCCTGC TCGCCGAGGT GATGGCCGCC GCCGGCGTGC CCGAGGGCGT CTTCAACGTC GTGCACGGCT TCGGGCCCGG CTCGGCGGGG GAGTTTCTTA CCTCGCACCC CGACGTCAAC GCGATCACCT TCACCGGCGA GTCAACCACC GGCGGAGCCA TCATGCGGGC CGCTTCCGAC GGGGTGAAGG CGGTCTCCTT CGAGCTGGGT GGCAAGAACG CCGGGCTGGT CTTCGCCGAC GCCGACCTGT CGGCGGCGGT GGCCGGCTCG GTGCGGTCCA GCTTCACCAA CGGTGGCCAG GTCTGCCTCT GCACCGAACG CATCTACGTG CAGCGTCCGG TCTTCGCGGA GTTCACCGCC CGGCTGGCCG AGCGGGCCGC CGAACTGCCG TACGGGTGGC CCGTCGACGA GGCCACGGCA AACATGCCGC TGATCTCGCC CGTTCACCGG GAGAAGGTGC TCGGCCACTA CGAACTGGCC CGAGCCGAGG GGGCGCAGGT CCTCGCCGGC GGCGGTACGC CGCGCTTCGG TGACGCCCGC GACGGCGGCG CGTACATCCA GCCGACGGTG CTCGCCGGGC TCGGCCCGGA CGCGCGAACC AACCGGGAGG AGATCTTCGG CCCGGTCGTG CACGTGGCGC CCTTCGACGA CGAGGAGGAG GCGTACGCGC TGGCCAACGG CACCGAGTAC GGCCTGGCGG CGGCGGTGTG GACCCGAGAC GTGGGCCGGG CCCACCGGGC CGGTGCCCGG TTGGACGCGG GAATCGTCTG GGTCAACACC TGGTACCTGC GTGACCTGCG TACCCCGTTC GGTGGGGTGA AGTCCTCCGG CGTCGGCCGC GAGGGCGGCG TTCATTCGCT GGGTTTCTAT TCCGAGCTCA CCAACGTCTG TGTGGACCTG ACATGA
|
Protein sequence | MTRWDARQMQ GQAPGGTALL PNFIAGEFVD SGRRFTKHSP VTGEPLFEVV EADQAGVDDA VAAARAALRG PWGRLGERER AEVLLRIADE LERRFDDLVT AEVADTGKTI SQARTLDIPR GAANFRAFAE IAATTPTESF STRTPSGGRA LNYALRKPVG VVAVIVPWNL PLLLLTWKVA PALACGNAVV VKPSEETPAS ATLLAEVMAA AGVPEGVFNV VHGFGPGSAG EFLTSHPDVN AITFTGESTT GGAIMRAASD GVKAVSFELG GKNAGLVFAD ADLSAAVAGS VRSSFTNGGQ VCLCTERIYV QRPVFAEFTA RLAERAAELP YGWPVDEATA NMPLISPVHR EKVLGHYELA RAEGAQVLAG GGTPRFGDAR DGGAYIQPTV LAGLGPDART NREEIFGPVV HVAPFDDEEE AYALANGTEY GLAAAVWTRD VGRAHRAGAR LDAGIVWVNT WYLRDLRTPF GGVKSSGVGR EGGVHSLGFY SELTNVCVDL T
|
| |