Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1197 |
Symbol | |
ID | 3916494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1245211 |
End bp | 1246557 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640443933 |
Product | aldehyde dehydrogenase |
Protein accession | YP_496476 |
Protein GI | 87199219 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.238764 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGCCC CGACCGCCGC CGACCTTTCC GCCGACATCG CACGCGTCTT CGCACTCCAG CAGGCGCACA TGTGGGAGGC CAAGGCCTCC ACCGCGGCCG AGCGCAAGGA AAAGCTCGCG CGCCTCAAGG CCGCCGTCGA AGCCCACGCC GACGACATCG TCGCCGCCGT CCTCGAAGAC ACGCGCAAGC CGGTTGGCGA AATCCGCGTG ACCGAAGTCC TCAACGTCAC CGCCAACATC CAGCGCAACA TCGACAATCT CGATGAATGG ATGAAGCCGG TCGAGGTCGC CACCTCGCTC AATCCCGCCG ACCGCGCGCA GATCATCCAC GAAGCGCGCG GCGTCTGCCT GATCCTTGGC CCCTGGAACT TCCCCCTCGG CCTCGCGCTC GGTCCGGTCG CCGCTGCCAT CGCCGCAGGC AACACCTGCA TCGTGAAGCT CACCGACCTC TGCCCCGCCA CCGCAAGGGT GGCCTCGGTG ATCGTCAGGG AAGCGTTCGA CGAAAAGGAT GTGGCTCTGT TCGAAGGCGA CGTCTCGGTC GCCACCGCGC TCCTCGATCT GCCGTTCAAC CACGTCTTCT TCACCGGCTC GCCCCGCGTC GGCAAGATCG TGATGGCCGC TGCCGCAAAG CACCTCACCA GCGTCACGCT CGAACTTGGC GGGAAGTCGC CCGTCATCGT CGACGATAGC GCCGACATCG ATCAGGTCGC CGCCCAGCTC GCCGCGGCCA AGCAGTTCAA CGGCGGGCAG GCCTGCATCA GCCCGGACTA CGTCTTCGTG AAGGAAGACA AGAAGGCCGC GCTGGTCGAA GGCTTCCGGG CCAACGTGCA GAAGAACCTC TATGACGATG CCGGCAACCT GAAGAAGGAC AGCATCGCCC AGGTGGTCAA CAAGGCGAAC TTCGACCGCG TGAAGGCCAT GTTCGACGAT GCCGTCGCCA AGGGCGCGAC CGTCGCCGCC GGCGGAACGT TCGAAGCCGA TGACCTCACC ATCCATCCGA CCATGCTGAC CGGCGTCACC CCGCAGATGA CCATCCTCCA GGACGAAATC TTCGCCCCCG TCATCCCGGT GATGACCTAC GACACGCTCG ACCAGGCGAT CGGCTACATC GAAGCCCGCG ACAAGCCGCT CGCACTCTAT GTCTACAGCA AGGACGAAGC GAACGTCGAA AAGGTCCTCG CCCGCACCTC GTCGGGCGGT GTCACGGTGA ATGGCGTGTT CTCGCACTAC CTGGAAAACA ACCTGCCGTT CGGCGGCGTC AACACCAGCG GCATGGGCAG CTACCACGGC GTGTTCGGCT TCAAGTGCTT CAGCCACGAA CGGGCTGTCT ACCGCCACCA GCAGTAA
|
Protein sequence | MTAPTAADLS ADIARVFALQ QAHMWEAKAS TAAERKEKLA RLKAAVEAHA DDIVAAVLED TRKPVGEIRV TEVLNVTANI QRNIDNLDEW MKPVEVATSL NPADRAQIIH EARGVCLILG PWNFPLGLAL GPVAAAIAAG NTCIVKLTDL CPATARVASV IVREAFDEKD VALFEGDVSV ATALLDLPFN HVFFTGSPRV GKIVMAAAAK HLTSVTLELG GKSPVIVDDS ADIDQVAAQL AAAKQFNGGQ ACISPDYVFV KEDKKAALVE GFRANVQKNL YDDAGNLKKD SIAQVVNKAN FDRVKAMFDD AVAKGATVAA GGTFEADDLT IHPTMLTGVT PQMTILQDEI FAPVIPVMTY DTLDQAIGYI EARDKPLALY VYSKDEANVE KVLARTSSGG VTVNGVFSHY LENNLPFGGV NTSGMGSYHG VFGFKCFSHE RAVYRHQQ
|
| |