Gene Arth_0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0233 
Symbol 
ID4447324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp245239 
End bp246711 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content70% 
IMG OID639688029 
Productaldehyde dehydrogenase 
Protein accessionYP_829734 
Protein GI116668801 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCA CAGCCGGCCA GCTGAACGCA TCAGTCGAGG CCGCCCACGC TGCCTTCGAA 
AAAGCCCGCC TGGCAGGTCC CGGAACACGG GCAGCCTGGC TCGAAGCGGT GGCCGCCGGC
CTGGAAGGCG ACGCTGTGAC CCTCATCGGA ATCGCGGCCG CGGAAACCCA CCTCGCCGAA
CCACGACTCC AGGGCGAGCT GAAGCGCACC GTCTTCCAGC TCCGGCTCTT CGCCGACGAG
ATCCGCCGCG GCGAGCACTT CGACGCGACG ATCGACCATG AGGATGCCGC CTGGGGCATG
GGGCCGCGGC CCGACCTTCG CCGCTACAAC GTGCCGCTCG GCGTGGTCGG CGTCTTTGGG
GCGTCCAACT TCCCGTTCGC CTTCAGTGTG ATGGGCGGTG ACTCCGCGTC GGCCCTGGCG
GCCGGCTGTG CCGTCGTCCA CAAGGCACAC GACGGACACC GGGAACTTGC GGTCCGCACG
GCCGAAACGG TGACCACCGC ACTCGAGGCT GCCGGGGCGC CGTCGGGCCT CTTCTCCCTG
GTCACCGGCC GCCAGGCTGC GGAGGCGCTG GTTGAGCACC CGCTGGTGAA GGCCATCGGG
TTCACGGGTT CGACGGCGGG CGGCCGGGCT TTGTTCGACC GTGCAGCTGC GCGGCCCGAA
CCGATCCCGT TCTTTGGCGA ACTGGGCGGC ATCAATGCCG TTTTCGTTAC CGGCAACGCC
TGGTCCGCGC GCCGCGAGGA GATCCTGGGC GGCTTTGCCG GCTCCTTCAC CCTGGGAATG
GGTCAGTTCT GCACCAAGCC GGGTGTGCTC TTCCTCCCGG CCGGGGAAAC TGAGAAGGTC
CGGGACAGCC TCCGGAAAGC CCTCGCGGAC TTCGCTCCGG CGCCGCTGCT CAGCGAACGG
CTGCACGAAG GGTTCCGGCA GGCAGTTGCC GGGCTTCGGG ACACGGCGGG CGTGCAGGTG
CTGGTGGACG GCGATTTCGC CGAGTCGCCG GCGCCCACCG TCCTGATGAC CACGGCCGAT
GCTGTCCGCC GGGATCCCGG CATCCTCCGC CAGGAGATGT TCGGACCGGC CAGCCTGGTG
GTCGAATACA ACGACGACTC CGAGCTCGCC GCCCTTGCCG GGCTCCTGGA AGGCCAGCTG
ACCACCACCC TGCAGGCCGA AGCGGAGGAT GACGTCGCCG AACTTGCCGG CAGGCTCGCG
GACATCAGCG GACGCCTGCT CTGGAACGGC TGGCCAACGG GGGTGACCGT CAGTTACGCC
CAGCACCACG GCGGGCCGTA CCCGGCCACG ACGTCTGGCA CCACCTCCGT GGGGACGGCC
GCCATCCGGC GGTTCCTCCG GCCGGTGGCC TTCCAGTCCT TTCCGGAGCC GCGGCTGCCG
GAGCCGCTGC AGGATGCGAA CCCGTGGAAC GTCCCGCAAA GGGTCGACGG CGTTTGGCAG
CGGCCGTCCG CACAGCCGGA CGGCCAGCCG TGA
 
Protein sequence
MNTTAGQLNA SVEAAHAAFE KARLAGPGTR AAWLEAVAAG LEGDAVTLIG IAAAETHLAE 
PRLQGELKRT VFQLRLFADE IRRGEHFDAT IDHEDAAWGM GPRPDLRRYN VPLGVVGVFG
ASNFPFAFSV MGGDSASALA AGCAVVHKAH DGHRELAVRT AETVTTALEA AGAPSGLFSL
VTGRQAAEAL VEHPLVKAIG FTGSTAGGRA LFDRAAARPE PIPFFGELGG INAVFVTGNA
WSARREEILG GFAGSFTLGM GQFCTKPGVL FLPAGETEKV RDSLRKALAD FAPAPLLSER
LHEGFRQAVA GLRDTAGVQV LVDGDFAESP APTVLMTTAD AVRRDPGILR QEMFGPASLV
VEYNDDSELA ALAGLLEGQL TTTLQAEAED DVAELAGRLA DISGRLLWNG WPTGVTVSYA
QHHGGPYPAT TSGTTSVGTA AIRRFLRPVA FQSFPEPRLP EPLQDANPWN VPQRVDGVWQ
RPSAQPDGQP