Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4001 |
Symbol | |
ID | 5704887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4552431 |
End bp | 4553927 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641273426 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001538782 |
Protein GI | 159039529 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00109814 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGGCTG TACATGTTCC GGGCACTCCG CTGATCGAGG ACGGTCGACT CCTGTCGACA AACCCGGCGA CCGGCGTGGA GGCCGGGCGG TTGCCGGTTG CGTCACCCGC CGATGTCGAG GCGGCCGTCG CACGAGCCCG GGCCGCCAGC GGGTGGTGGG CCGGTCTCGA CGTCGCCGGT CGCCGGCAAC GGCTACTGCG CTGGCGGGCC CGACTGGCAT CCCGGATCGA GGAGCTCGCC GAACTGGTGC ACGTCGAGGG CGGCAAGCCG GTCGCCGAAG CTGTCGTCGA GGTGGTCACC GCCGTCGAGC ACGTCGACTG GGCCGCGCGT AACGCCAAGC GGGTGCTCGG TCCCCGCCGA GTACGATCCC GGCTGATGCT GGCGGAGTTC GCCGCCCACC TGGAATACCA GCCCTACGGG GTGATCGGCG TGATCGGCCC GTGGAACTAC CCCGTGTTCA CCCCGATCGG CTCCATCGCC TACGCCCTTG CCGCCGGCAA TGCCGTGGTG CTCAAGCCCA GTGAGTACAC CCCGGCCGTC GGCCAGTGGC TGGTCGACAG CTTCGCGGAG ATCGTCCATG AGGAGCCGGT GTTCACCGCG GTGCACGGCC TCGGTGACGT CGGCGCGGCA CTGTGCCGAT CCGGAGTCGA CAAGGTGGCG TTCACCGGTT CCACCGCGAC TGGTCGGAAG GTGATGGCCG CCTGCGCCGA ATCACTGACC CCCGTGCTCA TCGAGGCGGG CGGCAAGGAC GCGATGATCG TCGACACGGA CGCCGACCTG GACGCCGCCG CCGAAGCCGC CGTCTGGGGC GGTCTCACCA ACGCCGGTCA GACCTGTATC GGCATCGAAC GGGTGTACGC CGTCGAAGGG GTCTTCGACG GCTTCGTCGA CCGGGTGGTC CAGCGGGCCG AGCGGCTGAC CGTGGGCGCC GACGGCACCG ATATCGGCCC GATCACCATG CCGAGCCAGC TGGAGGTGAT CCGTCGGCAC ATCGACGACG CGTTGGCCCG GGGTGGGCGG GCGGTGCTCG GCGGGGCAGA TGCGGTGCAG CCGCCGTACG TACAGCCGAC CGTGCTGGTG GACGTGCCCG AGGACTCGGT CGCCGTACGC GAGGAGACCT TCGGTCCGAC CCTGACCATC AACCGGGTCC GCGATGTCGA CGAGGCAGTC GCCCGGACCA ACGCCCTGCG CTACGGGCTC GGCGGATCGG TCTTCGGCCG ACGGCGGGCC ATGGCGGTCG CACAGCGCCT GCGCTCCGGG ATGGCCTCGG TGAACTCGGC CCTGACCTTC GCCGGAATGT CCACCCTGCC ATTCGGCGGC GTGGGCAACT CCGGTTTCGG GCGCATCCAC GGCGCGGACG GGCTTCGGGA GTTCGCCCGG CCCAAGGCCA TCACCCGCCG CCGGGCCAGA TCGCTGCTGC CCGCGACGAC CTTCGATCGC ACCGACGCGG ACGTCGCCCG ACTGGTCAAG CTCGTCAAGC TCTTCTACGG TCGCTGA
|
Protein sequence | MTAVHVPGTP LIEDGRLLST NPATGVEAGR LPVASPADVE AAVARARAAS GWWAGLDVAG RRQRLLRWRA RLASRIEELA ELVHVEGGKP VAEAVVEVVT AVEHVDWAAR NAKRVLGPRR VRSRLMLAEF AAHLEYQPYG VIGVIGPWNY PVFTPIGSIA YALAAGNAVV LKPSEYTPAV GQWLVDSFAE IVHEEPVFTA VHGLGDVGAA LCRSGVDKVA FTGSTATGRK VMAACAESLT PVLIEAGGKD AMIVDTDADL DAAAEAAVWG GLTNAGQTCI GIERVYAVEG VFDGFVDRVV QRAERLTVGA DGTDIGPITM PSQLEVIRRH IDDALARGGR AVLGGADAVQ PPYVQPTVLV DVPEDSVAVR EETFGPTLTI NRVRDVDEAV ARTNALRYGL GGSVFGRRRA MAVAQRLRSG MASVNSALTF AGMSTLPFGG VGNSGFGRIH GADGLREFAR PKAITRRRAR SLLPATTFDR TDADVARLVK LVKLFYGR
|
| |