Gene Arth_2364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2364 
Symbol 
ID4445129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2650585 
End bp2652012 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content69% 
IMG OID639690172 
Productaldehyde dehydrogenase 
Protein accessionYP_831843 
Protein GI116670910 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0648683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACACTC CCGCCGCCGC CGTCGCCCGA TCCCGCGAGC TTTTCGACAG CGGGGTGAGC 
CGCCCCCTGG ACTGGCGGCT GGAGCAACTG GGCAATCTGC GCAGGATGCT GACGGAACGC
CGTGAGGATT TTGCCGGCGC ACTGCTCAGC GACCTGGGCA AGCACCGGAG TGAATCACAG
ATGACCGAAA TCGGTTTTGT GGCCGCGGAA ACGGCCCATC TGGAACGGCA TCTCGCGGGC
TGGCTTCGGC GCCGCCGGGT GGACGTTCCG CTTGCCATGC AGCCGGCCCG GGCCTGGACC
GAGCTGACTC CGCTCGGTGT GGTGCTGGTG ATCGGCACCT GGAACTACCC GGTCCAGCTG
ACGCTGGCGC CGATGGCCGG CGCCCTGGCG GCCGGCAACA CCGTGGTCGT CAAGCCGAGC
GAGCACGCCC CCGCCACCTC AGCCGCCCTG GTCCGCTGGT TGCCTGAGTA CCTGGGCGGT
GCCGCCGAGG TGGTGCCCGG AGGCATCCCG GCAACCAAGG CACTCCTGGC TGAGCGCTTC
GACCACATCT TCTTCACCGG CGGCCAGGAC GCGGCCCGGG TGGTCATGCG GGCGGCAGCC
GAACACCTGA CGCCGGTGAC CCTGGAACTG GGTGGAAGGT GCCCGGCCTT CGTGGACGGG
ACCGCCGACC TCGAGACCAC CGCCGGACGC CTTGCCTGGG GCCGGTTCAT GAACGCGGGG
CAAACGTGCG TTGCACCTGA CTATGTGCTG GCCGCCCCCG AGGTTCTGGA CGCGCTGGAA
CCGCTGCTTG TGGACGCCAT CACCGCCATG TTCGGAAAGG ACCCGGCGTC CAGCGCTTCC
TATGGCCGGA TCGTGGATGA CCGGCACTTT GAAAGGATCG CCGAACTGGC GGACGGCAGC
ACCGTGGTGC ACGGAGGGCA GCGGGACCCC GGCAGCAGGT ATTTTGCACC GACGCTGTTG
CGCCCCGCCC CCGGCGATGC AGTCCTGGGC GAGGAAATCT TCGGTCCGCT GCTGCCGCTG
GTGCCGGTAT CCGGGCGGGA CGAAGCCATC CGCATGATCA ATTCCGGCTC CAAGCCGCTG
GCCGTGTATG TCTTCAGCGA AGAGGACGCC GTGCGCAGCG CCTTCGCGGC GGAAACGTCG
TCAGGCGCGC TGGCCTACGG AGCACCGGCG GCACACCTCA CGGTTCCGGG CCTCCCGTTC
GGCGGGGTGG GCGGCAGCGG CATGGGGGCA TACCACGGTG AACATTCCGT GCGGACGTTC
TCGCACGAAC GGGCAGGCAT GGACAAGCCG CTGTGGCCGG ACACCCTCGG GCTGGCCTAT
CCGCCTTACG GGACGGCGAA AGACAGGGTG GTCACCGCCC TGCTGTCATT GGCCGGGCGG
GTGCCGGGCC GCCCGAAAGC CGGCCGTCCC GGGGACCGCA AGGCCTAG
 
Protein sequence
MHTPAAAVAR SRELFDSGVS RPLDWRLEQL GNLRRMLTER REDFAGALLS DLGKHRSESQ 
MTEIGFVAAE TAHLERHLAG WLRRRRVDVP LAMQPARAWT ELTPLGVVLV IGTWNYPVQL
TLAPMAGALA AGNTVVVKPS EHAPATSAAL VRWLPEYLGG AAEVVPGGIP ATKALLAERF
DHIFFTGGQD AARVVMRAAA EHLTPVTLEL GGRCPAFVDG TADLETTAGR LAWGRFMNAG
QTCVAPDYVL AAPEVLDALE PLLVDAITAM FGKDPASSAS YGRIVDDRHF ERIAELADGS
TVVHGGQRDP GSRYFAPTLL RPAPGDAVLG EEIFGPLLPL VPVSGRDEAI RMINSGSKPL
AVYVFSEEDA VRSAFAAETS SGALAYGAPA AHLTVPGLPF GGVGGSGMGA YHGEHSVRTF
SHERAGMDKP LWPDTLGLAY PPYGTAKDRV VTALLSLAGR VPGRPKAGRP GDRKA