Gene Arth_3522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3522 
Symbol 
ID4443832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3960522 
End bp3962036 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content66% 
IMG OID639691346 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_832997 
Protein GI116672064 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTTCA CTGCCGCAGA AACCACGTCC CATTACGTTC CGCAGGACCT TCCCAGCCAC 
ATCCAGCACT ACATCAACGG TGAATTCGTT GACTCCGTGG GCGGTAAGAC CTTCGACGTC
CTGGATCCGG TTTCCAACAC CAACTACGCC ACCGCCGCCG CCGGCCAGAA GGAAGACATC
GACCTCGCCG TCGCCGCCGC CCGCGAAGCC TTCAGCAACG GTCCCTGGCC GAAGATGAAG
CCCCGCGAAC GTGCCCGGAT CCTGAACAAA ATCGCCGACG CCGTCGAAGC CCAGGAAGAA
CGCCTGGCCG AACTGGAAAC GTTCGACACC GGCCTGCCCA TCACCCAGGC CAAGGGCCAG
GCCCTGCGCG CGGCAGAGAA CTTCCGTTTC TTCGCGGACC TGATCGTGGC CCAGTTCGAC
GACGCCATGA AGGTCCCGGG CTCGCAGATC AACTACGTGA ACCGCAAGCC GATCGGCGTC
GCCGGGCTCA TCACCCCGTG GAACACCCCG TTCATGCTGG AGTCCTGGAA GCTCGCCCCG
GCACTGGCCA CCGGCAACAC CGTGGTCCTC AAGCCCGCCG AATTCACGCC GCTCTCCGCC
TCGCTGTGGG CCACCATCTT CAAGGACGCA GGCCTGCCCG ACGGCGTGTT CAACTTGGTC
AACGGCCTCG GCGAGGAAGC CGGCGACGCC CTGGTCAAGC ACCCGGACGT CCCGCTGATT
TCCTTCACCG GCGAGACCAC CACCGGGCAG ACGATCTTCC GCAACGCCGC CGCCAACCTC
AAGGGCCTCT CCATGGAACT CGGCGGCAAG TCCCCCTGCG TCGTGTTCGC CGACGCCGAC
CTCGACGCCG CAATCGACTC CGCCCTCTTC GGCGTCTTCT CCCTCAACGG CGAACGCTGC
ACCGCCGGCT CCCGCATCCT GGTGGAACGC GCCATCTACG ACGAGTTCTG CGAAAGGTAC
GCCGCCCGCG CCAAGAACAT CGTCGTCGGC GACCCCCACG ACCCCAAGAC CCAGGTGGGC
GCCCTGGTCC ACCCGGAGCA CTTCGCCAAG GTGGCCTCCT ACGTGGAGAT CGGCAAATCA
GAAGGCCGGC TCCTCGCCGG CGGCGGACGC CCGGAAGGCC TGCCGGAAGG CAACTACATC
GCCCCCACCG TGTTCGCCGA CGTCGCCCCC GACGCGAGGA TCTTCCAGGA GGAAATCTTC
GGCCCCGTCG TCGCCATCAC CCCGTTCGAG AACGACGACG AAGCACTCGC CCTCGCGAAC
AACACCAAGT ACGGTCTGGC CGCCTACATC TGGACCCAGA ACCTCACCCG CGCCCACAAC
TTCTCGCAGA ACGTCGAGGC CGGCATGGTG TGGCTGAACA GCCACAACGT CCGCGACCTC
CGCACCCCGT TCGGCGGCGT CAAGGCCTCC GGCCTGGGCC ACGAGGGCGG CTACCGCTCC
ATCGATTTCT ACACCGACCA GCAGGCCGTG CACATCACCC TCGGCACTGT CCACACCCCC
AAATTCGGCG CCTAG
 
Protein sequence
MTFTAAETTS HYVPQDLPSH IQHYINGEFV DSVGGKTFDV LDPVSNTNYA TAAAGQKEDI 
DLAVAAAREA FSNGPWPKMK PRERARILNK IADAVEAQEE RLAELETFDT GLPITQAKGQ
ALRAAENFRF FADLIVAQFD DAMKVPGSQI NYVNRKPIGV AGLITPWNTP FMLESWKLAP
ALATGNTVVL KPAEFTPLSA SLWATIFKDA GLPDGVFNLV NGLGEEAGDA LVKHPDVPLI
SFTGETTTGQ TIFRNAAANL KGLSMELGGK SPCVVFADAD LDAAIDSALF GVFSLNGERC
TAGSRILVER AIYDEFCERY AARAKNIVVG DPHDPKTQVG ALVHPEHFAK VASYVEIGKS
EGRLLAGGGR PEGLPEGNYI APTVFADVAP DARIFQEEIF GPVVAITPFE NDDEALALAN
NTKYGLAAYI WTQNLTRAHN FSQNVEAGMV WLNSHNVRDL RTPFGGVKAS GLGHEGGYRS
IDFYTDQQAV HITLGTVHTP KFGA