Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0771 |
Symbol | |
ID | 5707441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 860982 |
End bp | 862412 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641270290 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001535681 |
Protein GI | 159036428 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.302759 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGAAT ACGCCCCCGC CCCCGAGTCC CGCTCGGTGG TACGACTCCA GCCCACGTAC GGGCTCTTCG TCGGCGGCGA CTTCGTTGAC CCGACCGACG GCGGCACCTT CAAAACGATC AACCCGGCCT CGGAGGAGGT CCTCGCCGAG ATCGCCGAGG CCAGCGCGGG CGATGTCGAC CGCGCGGTTC GCGCCGCACG GTCCGCGTAC GAGCGGATCT GGGCCCCGAT GCCGGGCCGG GACCGGGCCA AGTACCTGTT CCGAATCGCC CGGATCATCC AGGAGCGCTC CCGCGAGTTG GCGGTGCTGG AGTCGCTGGA CAACGGCAAA CCGATCAAGG AGTCCCGGGA CGTCGACCTC CCCCTGGTCG CCGCCCACTT CTTCTACTAC GCCGGCTGGG CGGACAAGCT CGACCACGCG GGGTTCGGCC CGGGCCCACG GCCACTCGGG GTGGCCGCGC AGGTCATCCC GTGGAACTTC CCCCTGCTCA TGCTGGCGTG GAAGATCGCC CCGGCGCTGG CCGCGGGCAA CACCGTGGTG CTCAAGCCCG CCGAGACGAC CCCGCTCACC GCGTTGCTCT TCGCCGAGAT CTGCCAGCAG GCCGACCTCC CCGGCGGCGT GGTGAACGTC GTCACCGGCG CTGGCGACAC CGGGCGGGCG CTGGTCGAGC ACCACGACGT GGACAAGGTC GCCTTCACCG GCTCCACCGC GGTGGGCAGG GCCATCGCCC GCGCCGTCGC CGGCACCGAA AAGAAGCTCA CCCTGGAGTT GGGTGGCAAG GCCGCCAACA TCGTCTTCGA CGACGCCGCG GTGGACCAGG CCGTCGAGGG CATCGTCAAC GGGATCTTCT TCAACCAGGG GCACGTCTGC TGCGCCGGCT CCCGGCTGCT GCTTCAGGAG TCGGTCGCCG ACCAGGTCAT GGCGTCGCTC AAACGGCGGA TGGCCCAGCT ACGGGTCGGT GACCCGCTGG ACAAGAACAC CGACATCGGG GCGATCAACT CAGCGGCGCA ACTGGCCCGC ATCAGGGAAC TCTCCGACAC CGGCGCAGCC GAGGGCGCGG AGCGCTGGTC ACCGCCGTGC GAGCTGCCCG ACCGGGGCTT CTGGTTCGCA CCGACCGTCT TCACCGGCGT CACCCAGGCC CACCGGATCG CCCGCGAGGA GATCTTCGGT CCGGTGCTGT CCGTGCTGAC CTTCCGCACC CCGGCGGAGG CCGTCGAGAA GGCCAACAAC ACGCCGTACG GGCTCTCCGC CGGGGTGTGG ACGGAGAAGG GCTCCCGGAT GCTGTGGATG GCCGACCGGC TCCGCGCCGG CGTGGTCTGG GCCAACACGT TCAACAGGTT CGACCCCACC TCGCCGTTCG GCGGTTACCA GGAGTCGGGT TACGGTCGCG AGGGCGGCCG GCACGGGCTG GAGGCGTACC TGAATGTCTG A
|
Protein sequence | MFEYAPAPES RSVVRLQPTY GLFVGGDFVD PTDGGTFKTI NPASEEVLAE IAEASAGDVD RAVRAARSAY ERIWAPMPGR DRAKYLFRIA RIIQERSREL AVLESLDNGK PIKESRDVDL PLVAAHFFYY AGWADKLDHA GFGPGPRPLG VAAQVIPWNF PLLMLAWKIA PALAAGNTVV LKPAETTPLT ALLFAEICQQ ADLPGGVVNV VTGAGDTGRA LVEHHDVDKV AFTGSTAVGR AIARAVAGTE KKLTLELGGK AANIVFDDAA VDQAVEGIVN GIFFNQGHVC CAGSRLLLQE SVADQVMASL KRRMAQLRVG DPLDKNTDIG AINSAAQLAR IRELSDTGAA EGAERWSPPC ELPDRGFWFA PTVFTGVTQA HRIAREEIFG PVLSVLTFRT PAEAVEKANN TPYGLSAGVW TEKGSRMLWM ADRLRAGVVW ANTFNRFDPT SPFGGYQESG YGREGGRHGL EAYLNV
|
| |