Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3522 |
Symbol | |
ID | 4443832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 3960522 |
End bp | 3962036 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639691346 |
Product | 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
Protein accession | YP_832997 |
Protein GI | 116672064 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTTCA CTGCCGCAGA AACCACGTCC CATTACGTTC CGCAGGACCT TCCCAGCCAC ATCCAGCACT ACATCAACGG TGAATTCGTT GACTCCGTGG GCGGTAAGAC CTTCGACGTC CTGGATCCGG TTTCCAACAC CAACTACGCC ACCGCCGCCG CCGGCCAGAA GGAAGACATC GACCTCGCCG TCGCCGCCGC CCGCGAAGCC TTCAGCAACG GTCCCTGGCC GAAGATGAAG CCCCGCGAAC GTGCCCGGAT CCTGAACAAA ATCGCCGACG CCGTCGAAGC CCAGGAAGAA CGCCTGGCCG AACTGGAAAC GTTCGACACC GGCCTGCCCA TCACCCAGGC CAAGGGCCAG GCCCTGCGCG CGGCAGAGAA CTTCCGTTTC TTCGCGGACC TGATCGTGGC CCAGTTCGAC GACGCCATGA AGGTCCCGGG CTCGCAGATC AACTACGTGA ACCGCAAGCC GATCGGCGTC GCCGGGCTCA TCACCCCGTG GAACACCCCG TTCATGCTGG AGTCCTGGAA GCTCGCCCCG GCACTGGCCA CCGGCAACAC CGTGGTCCTC AAGCCCGCCG AATTCACGCC GCTCTCCGCC TCGCTGTGGG CCACCATCTT CAAGGACGCA GGCCTGCCCG ACGGCGTGTT CAACTTGGTC AACGGCCTCG GCGAGGAAGC CGGCGACGCC CTGGTCAAGC ACCCGGACGT CCCGCTGATT TCCTTCACCG GCGAGACCAC CACCGGGCAG ACGATCTTCC GCAACGCCGC CGCCAACCTC AAGGGCCTCT CCATGGAACT CGGCGGCAAG TCCCCCTGCG TCGTGTTCGC CGACGCCGAC CTCGACGCCG CAATCGACTC CGCCCTCTTC GGCGTCTTCT CCCTCAACGG CGAACGCTGC ACCGCCGGCT CCCGCATCCT GGTGGAACGC GCCATCTACG ACGAGTTCTG CGAAAGGTAC GCCGCCCGCG CCAAGAACAT CGTCGTCGGC GACCCCCACG ACCCCAAGAC CCAGGTGGGC GCCCTGGTCC ACCCGGAGCA CTTCGCCAAG GTGGCCTCCT ACGTGGAGAT CGGCAAATCA GAAGGCCGGC TCCTCGCCGG CGGCGGACGC CCGGAAGGCC TGCCGGAAGG CAACTACATC GCCCCCACCG TGTTCGCCGA CGTCGCCCCC GACGCGAGGA TCTTCCAGGA GGAAATCTTC GGCCCCGTCG TCGCCATCAC CCCGTTCGAG AACGACGACG AAGCACTCGC CCTCGCGAAC AACACCAAGT ACGGTCTGGC CGCCTACATC TGGACCCAGA ACCTCACCCG CGCCCACAAC TTCTCGCAGA ACGTCGAGGC CGGCATGGTG TGGCTGAACA GCCACAACGT CCGCGACCTC CGCACCCCGT TCGGCGGCGT CAAGGCCTCC GGCCTGGGCC ACGAGGGCGG CTACCGCTCC ATCGATTTCT ACACCGACCA GCAGGCCGTG CACATCACCC TCGGCACTGT CCACACCCCC AAATTCGGCG CCTAG
|
Protein sequence | MTFTAAETTS HYVPQDLPSH IQHYINGEFV DSVGGKTFDV LDPVSNTNYA TAAAGQKEDI DLAVAAAREA FSNGPWPKMK PRERARILNK IADAVEAQEE RLAELETFDT GLPITQAKGQ ALRAAENFRF FADLIVAQFD DAMKVPGSQI NYVNRKPIGV AGLITPWNTP FMLESWKLAP ALATGNTVVL KPAEFTPLSA SLWATIFKDA GLPDGVFNLV NGLGEEAGDA LVKHPDVPLI SFTGETTTGQ TIFRNAAANL KGLSMELGGK SPCVVFADAD LDAAIDSALF GVFSLNGERC TAGSRILVER AIYDEFCERY AARAKNIVVG DPHDPKTQVG ALVHPEHFAK VASYVEIGKS EGRLLAGGGR PEGLPEGNYI APTVFADVAP DARIFQEEIF GPVVAITPFE NDDEALALAN NTKYGLAAYI WTQNLTRAHN FSQNVEAGMV WLNSHNVRDL RTPFGGVKAS GLGHEGGYRS IDFYTDQQAV HITLGTVHTP KFGA
|
| |