Gene Ksed_14840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_14840 
Symbol 
ID8372992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp1525259 
End bp1526704 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content73% 
IMG OID644991756 
ProductNAD-dependent aldehyde dehydrogenase 
Protein accessionYP_003149274 
Protein GI256825314 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.0170872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.129561 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG AGCCGACGAC CGACCGGGCC GAGCCGACCA CCGACAGCAC GGGCACGACC 
GCGCAGGGCG GCGGCTTCAC CCCGGACCTG CGCGGCGTCC ACACGCTGGC CCGCCGGACC
TGGGAGTCCG GGCGCCTGCG CAGCCTCGAG GCCCGGCGCG AGCAGCTGGA GGGGCTGAAG
CGCCTGGTGC GCGAGGGCGG CGACGAGCTC GCGGCCGCGC TGCAGCAGGA CCTCGGGAAG
TCCCCCACCG AGGCCCGCAC CACCGAGCTG TCGGTGGTGG TGACCGAGGT CGAGTACGTG
CTCAAGCACC TCAAGGGCTG GTTGGAGCCG CGCAAGGCGG CGGTGCCGCT GGCCTTCCAG
CCCGCCAGCG GTCGGGTCCG CCGGGAGCCG CTGGGGTCGG TGCTCATCAT CGGGCCGTGG
AACTACCCCG TGAACCTCGT GCTGATGCCG CTGGTGGGCG CCTTGGCCGG GGGCAACACG
GTCGTGCTCA AGCCCAGCGA GCTCACCCCT GCCACCGCCG AGGCCCTGGC CCGGCTGGTG
CCGCGCTACC TGGACCCGGA GGTCGTGCAG GTGGTGAACG GCGGCGTGCC GGAGAGCACC
GCCCTGCTCG AGCTGCCCTG GGACCACGTC TTCTACACCG GGGGCGAGCG CGTGGGGCGG
ATCGTGATGC GGGCCGCGGC CGAGCACCTG ACGCCGGTGA CCCTGGAGCT CGGCGGCAAG
TCCCCCACCT GGGTGGGCAC CGAGACCGAC CTGCGGACGG CGGCCCGCCG CATCGTGTGG
TCGAAGTTCG TCAACGCCGG GCAGACCTGC GTGGCCCCCG ACCACGTGCT GTGCACCGCC
AGCACCCAGG CCGAGCTGGT GCCCGAGCTG GAGCGTGCGA TCCGCGAGAT GTTCGGGGAC
GACCCGCGCA CCAGCGCGGA CTACGGCCGC ATCGTGAACA CCGAGCACGC CGAGCGGCTG
GCCGGCCTGG TGGACGGCGC GGCGATCGGT GGTGAGGTGG ACGTCGCGGG GCGCTACCTC
TCCCCCACGG TGCTCACCGA CGTCACCGAC GACCACCCGG CCATGGCCGA GGAGATCTTC
GGACCGGTGC TGCCTATCGT CCCGGTGGCC GACGTGCACG ACGCGATCCG CCGCGTCAAC
GCGCGGCCGC ACCCGCTGGC GCTGTACCTG TTCACCGACG ACCTGGACGA GCAGGACCTG
TGGCTGGCCA GCACGCGCTC GGGGGGCGTC GGCATCAACA TGCCCCTGGT GCACGTGGCC
GTGCCGGAGC TGCCCTTCGG TGGCGTCGGC GCCAGCGGCA TGGGCAACTA CCACGGGCTG
GCCTCGCTGG AGACCTTCAC CCACGAGCGC TCCGTGCTCT CCAAGCCGCT GGCCCCGGAC
ACCATGCGGA TCGTCTACCC GCCCTACGGC CCGGTGAAGC AGCGCCTCAT CCGCGCCGTG
CAGTGA
 
Protein sequence
MSTEPTTDRA EPTTDSTGTT AQGGGFTPDL RGVHTLARRT WESGRLRSLE ARREQLEGLK 
RLVREGGDEL AAALQQDLGK SPTEARTTEL SVVVTEVEYV LKHLKGWLEP RKAAVPLAFQ
PASGRVRREP LGSVLIIGPW NYPVNLVLMP LVGALAGGNT VVLKPSELTP ATAEALARLV
PRYLDPEVVQ VVNGGVPEST ALLELPWDHV FYTGGERVGR IVMRAAAEHL TPVTLELGGK
SPTWVGTETD LRTAARRIVW SKFVNAGQTC VAPDHVLCTA STQAELVPEL ERAIREMFGD
DPRTSADYGR IVNTEHAERL AGLVDGAAIG GEVDVAGRYL SPTVLTDVTD DHPAMAEEIF
GPVLPIVPVA DVHDAIRRVN ARPHPLALYL FTDDLDEQDL WLASTRSGGV GINMPLVHVA
VPELPFGGVG ASGMGNYHGL ASLETFTHER SVLSKPLAPD TMRIVYPPYG PVKQRLIRAV
Q