Gene Dshi_2442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2442 
SymbolaldH1 
ID5714098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2584873 
End bp2586378 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content71% 
IMG OID641268365 
ProductNADP-dependent fatty aldehyde dehydrogenase 
Protein accessionYP_001533777 
Protein GI159044983 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCA CCCCCCACGG AAAACACCTG ATCGCCGGCG CCTGGGTCGG GAGCGATCAG 
ACCTTCGCCT CCGATCCCGC CCACGGGCCC GCCCACGAAT TCTCCGTCGG CACGCCCGCG
CTGGTCGATC AGGCCTGCGC TGCCGCGGAG GACGCCTTTG CCAGCTATGG CTACAGCGAC
GCGGCCACCC GCGCGGCCTT CCTGAACGCC ATCGCCGACG AGATCGACGC CCGCGCCCAG
ATCATCACTG GGATCGGCAC CCAGGAAACC GGCCTGCCCG AAGCCCGCCT GCAAGGGGAG
CGGGGCCGAA CCACCGGGCA GCTGCGCCTC TTTGCCGAAC ATATCCTCAA GGGCGATTGC
CTCGACCGGC GGCATGATCC GGCTCTGCCG GACCGCGCGC CTCTGCCGCG CCCGGACCTG
AAGCTGGTCC AACGCCCCAT CGGCCCGGTC GCGGTGTTCG GCGCCTCGAA CTTCCCGCTC
GCCTTCTCGG TGGCGGGCGG CGACACCGCG GCGGCGCTGG CCGCGGGCTG CCCCGTGGTG
GTCAAGGGCC ACTCGGCCCA TCCCGGCACC GGGGAGATCG TGGCCGAAGC GATCCACGCG
GCCATCGCCA GGACCGGGAT GCCCGCGGGG GTTTTCAGCC TGATCCAGGG CGGCAAGCGC
GACGTGGGCA CAGCCCTCGT CCAACACCCG CTGATCCGCG CGGTGGGCTT CACCGGCTCG
CTCGCCGGGG GGCGGGCGCT CTTCGATCTC TGCGCCGCAC GGCCCGAGCC GATCCCGTTC
TTCGGCGAGC TGGGCTCGGT CAACCCGATG TTCCTGCTGC CCGAGGCGAT CGCGGCGCGT
GGGGCCGAGA TCGGCGCGGG CTGGGCGGGC TCGCTGGCCA TGGGGGCGGG GCAGTTCTGC
ACCAATCCCG GCATCGCGGT GGTGCTGCCG GGCGCCGACG CTTTCGTCGC CGCCGCCGAG
GCCGCCCTGC GCGAGACCGC CGCGCAAACC ATGCTGACGG AGGGGATCGC CGCGGCCTAT
CGCGACGGCG TCGCCCGTCT GGCCGCGCAC CCGCAAACCT CGGAACTGCT GGGCGCCCCC
TGCGATGGGC GCGAGGCGCA CCCCTGTCTC TACCGTGTCG CGGCCAGGGA CTGGCTGGCC
GATCACACCC TGCAAGAGGA GGTCTTCGGG CCGCTCGGCC TGGTGGTGGA GGCGCAGGAT
GCGGCCGAAA TGGCCCGGAT CGCCAGGTCC CTCCAGGGCC AGCTCACCTG TACGCTCCAC
ATGGAGGACG GCGACACCGA CCATGCGAGA TCCCTGGTGC CGCTGCTCGA ACGCAAGGCG
GGCAGGATGC TTGTCAACGG CTTCCCCACG GGCGTCGAGG TTGCCGACAG CATGGTGCAT
GGCGGGCCCT ATCCGGCCTC CACGAATTTC GGTGCGACCT CGGTCGGGAC ACTCTCGATC
CGGCGATTCC TGCGGCCCGT GTGCTATCAG AACATGCCGG ACGCCCTTTT GCCTGCCGAT
TACTGA
 
Protein sequence
MSFTPHGKHL IAGAWVGSDQ TFASDPAHGP AHEFSVGTPA LVDQACAAAE DAFASYGYSD 
AATRAAFLNA IADEIDARAQ IITGIGTQET GLPEARLQGE RGRTTGQLRL FAEHILKGDC
LDRRHDPALP DRAPLPRPDL KLVQRPIGPV AVFGASNFPL AFSVAGGDTA AALAAGCPVV
VKGHSAHPGT GEIVAEAIHA AIARTGMPAG VFSLIQGGKR DVGTALVQHP LIRAVGFTGS
LAGGRALFDL CAARPEPIPF FGELGSVNPM FLLPEAIAAR GAEIGAGWAG SLAMGAGQFC
TNPGIAVVLP GADAFVAAAE AALRETAAQT MLTEGIAAAY RDGVARLAAH PQTSELLGAP
CDGREAHPCL YRVAARDWLA DHTLQEEVFG PLGLVVEAQD AAEMARIARS LQGQLTCTLH
MEDGDTDHAR SLVPLLERKA GRMLVNGFPT GVEVADSMVH GGPYPASTNF GATSVGTLSI
RRFLRPVCYQ NMPDALLPAD Y