Gene Daro_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1008 
Symbol 
ID3568433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1100668 
End bp1102188 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content62% 
IMG OID637679467 
Productaldehyde dehydrogenase 
Protein accessionYP_284234 
Protein GI71906647 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones64 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTACG CATCCCCCGG TACCGCCGGC GCCAAGATCG CCTACAAAGC TCAATACGAC 
AACTTCATTG GCGGCAAGTG GGTCGCCCCG GTCAATGGCC AGTACTTTGA CGTCGTCACT
CCGATTTCCG GCAAGCCTTA CACCAAGGCT GCCCGCTCCA CCGCCGAGGA CATGGAACTG
GCACTGGACG CCGCCCATGC CGCCTTCCCG ACCTGGGGCA AGACCTCTGC CGCCGACCGC
GCCAACGTGC TGCTGAAGAT CGCCGACCGC CTCGAAGCCA ACCTCGAACT GCTCGCCTAT
GCCGAGACGG TCGATAACGG CAAGGCCATT CGCGAAACCC TGAACGCCGA CATCCCGCTC
GCCGTCGACC ACTTCCGTTA CTTCGCCGGC TGCCTGCGCG CCCAGGAAGG CGGCATTTCG
GAAATCGACG AAAACACCAT GGCTTACCAT ATCCATGAGC CCCTCGGCGT TGTCGGCCAG
ATCATCCCGT GGAACTTCCC GATCCTGATG GCGGCCTGGA AACTGGCCCC GGCCATCGGC
GCCGGCAACT GCGTGGTGCT CAAGCCAGCT GAATCGACCC CGATCTCCAT CCTGATCCTG
GCCGAACTGA TCGCCGACAT TCTGCCGGCC GGCGTGCTGA ACATCGTCAA CGGCTATGGC
CGCGAAGCCG GCATGCCGCT CGCCACCAGC AAGCGCATCG CCAAGATCGC CTTCACCGGC
TCGACCTCCA CCGGCCGCGT CATCGCCCAG GCCGCTGCCA ACAACCTGAT TCCGGCCACC
CTGGAACTCG GCGGCAAGTC ACCGAACGTC TTCTTCGCCG ACATCATGGA CAAGGATGAC
AGCTTCCTCG ACAAGGCCGT CGAAGGCATG GTGCTGTTTG CCTTCAACCA GGGCGAAGTT
TGCACCTGCC CGTCGCGCGC CCTGATCCAG GAATCGATCT ACGAGAAGTT CATGGAGCGC
GTCCTGAAGC GTGTGGCTGC CATCAAGCAG ATCAGCCCGC TCGATACCGA CTGCATGATG
GGTGCCCAGT GCTCGCAGGA ACAGATGACC AAGATCCAGT CCTATCTGGA ACTCGGCAAG
CAGGAAGGCG CCGAATGCCT GATCGGTGGC GAACGCGCTC ATCTGGGCGG CGACCTTGAA
GGCGGCTACT ACATCCAGCC GACCATGTTC AAGGGTCACA ACAAGATGCG CATCTTCCAG
GAAGAAATCT TCGGGCCGGT GCTCGCTGTG ACCACCTTCA AGGACGAAGC CGAAGCCCTG
GCCATCGCCA ACGACACCAT CTACGGCCTC GGCGCCGGCG TCTGGAGCCG TAACGGCAAC
GTCGCCTACC GCATGGGTCG CGCCATCCAG GCCGGCCGCG TGTGGACCAA TTGCTACCAC
GCCTACCCGG CGCACGCTGC CTTCGGCGGC TACAAGGAAT CCGGTATCGG CCGCGAGACC
CACAAGGTCA TGCTCGACCA CTACCAGCAA ACGAAGAACC TGCTTGTTTC GTACAGCGAA
ACCAAGCTGG GCTTCTTCTA A
 
Protein sequence
MLYASPGTAG AKIAYKAQYD NFIGGKWVAP VNGQYFDVVT PISGKPYTKA ARSTAEDMEL 
ALDAAHAAFP TWGKTSAADR ANVLLKIADR LEANLELLAY AETVDNGKAI RETLNADIPL
AVDHFRYFAG CLRAQEGGIS EIDENTMAYH IHEPLGVVGQ IIPWNFPILM AAWKLAPAIG
AGNCVVLKPA ESTPISILIL AELIADILPA GVLNIVNGYG REAGMPLATS KRIAKIAFTG
STSTGRVIAQ AAANNLIPAT LELGGKSPNV FFADIMDKDD SFLDKAVEGM VLFAFNQGEV
CTCPSRALIQ ESIYEKFMER VLKRVAAIKQ ISPLDTDCMM GAQCSQEQMT KIQSYLELGK
QEGAECLIGG ERAHLGGDLE GGYYIQPTMF KGHNKMRIFQ EEIFGPVLAV TTFKDEAEAL
AIANDTIYGL GAGVWSRNGN VAYRMGRAIQ AGRVWTNCYH AYPAHAAFGG YKESGIGRET
HKVMLDHYQQ TKNLLVSYSE TKLGFF