Gene Ksed_11010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_11010 
Symbol 
ID8372609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp1127601 
End bp1128977 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content73% 
IMG OID644991379 
ProductNAD-dependent aldehyde dehydrogenase 
Protein accessionYP_003148906 
Protein GI256824946 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value0.148552 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0213171 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCC TGACAGCCAC CGACCCCACC ACCGGCCGCG TCGTCCGCGA GGTCCCCGCA 
GCCGGCCCCG ACGAGGTGAC CGCCACCCTC GACCGCGCCC AGCAGGCGTT CGCGTCCTGG
CGCGAGCGGA CCGTCGCCGA GCGGGCTGAG GTGCTGCGCG CCGTCGCCGA GCACCTGCGG
GAGCACACCG AGGAGTACGC CCCGCTGATG ACCCAGGAGA TGGGCAAGCC GATCACCGAG
GCCCGCGGCG AGGTCGGCAA GGCCGCGTGG TGCGCCGAGC ACTACGCCGA GCACGCTGCG
GAGTACCTGG CCGACGAGCA CATCGCCTCC GACGCCACGG AGTCCTGGGT GCAGTACCTG
CCGCTGGGCC CGGTGCTGGG GATCCTGCCG TGGAACGCCC CGTTCTGGCT GGCGCTGCGC
TTCGCCGCAC CGGCGCTCAT GGCCGGCAAC ACCTGCGTCA TGAAGCACGA CCCCCACGTG
CCCGGGTGCG CCCAGGCCCT CGAGGCGGCG TTCACCGCCG CCGGGGCGCC CGCCGGGGTC
TTCCAGGCCC TGGTGACCAC CACCGAGAAC ACCGAGCGGG CGATCCGCGA CCCCCGCACC
CGGGCGGTCT CCTTCACCGG CTCCGACCGT GCCGGGGCGA TCGTGGCCTC CGTGGCAGCC
AGCGAGATCA AGCCCGCGGT GCTGGAGCTC GGCGGCTCGG ACCCCTTCAT CGTGCTGGCG
GACGCCGACC TGCCGCGGGC GGCGAAGGTC GCGGCGCAGT CGCGCATCAT CAACGCCGGC
CAGTCCTGCA TCGCCGCGAA GCGGATCATC GTGGAGGGCT CCGTGCACGA CGAGTTCGTG
GAGCTGCTCA CCGAGGAGCT CGCGGGGCTG GTGATGGGCG ACCCGTCGCA GGAGACCACC
CAGGTGGGCC CCATCGCCCG GGAGGACCTG CGGGAGAACC TGCACCGGCA GGTGACCGCC
AGCATCGAGG CCGGGGCCAC CTGCGTGCTC GGAGGAGAGC TGCCCGAGGG CGACGGGTGG
TTCTACCCGG TCACCCTGCT CACCGGCGTC GACGACTCGA TGACGGTGTG CACCGAGGAG
ACCTTTGGTC CGGTGGCCGC GGTGGTCGCC GTGGACGACG CCGAGGCCGC GATCGCCCTG
GCCAACGACA CCCCCTTCGG ACTGGGGGCG GCGATCTGGA CGGAGACCGG GCGCGGCACG
GCGATGGCCC GCCGCATCGA GGCCGGCCAG GTGAGCGTCA ACGGCATCGT GAAGACCGAC
CCCCGGCTGC CCTCCGGCGG CATCAAGCGC TCCGGCTACG GGCGCGAGCT CGGCCCGCAC
GGCATCAAGG AGTTCGTGAA CGCCCAGCAG GTCTGGGTGG GGCCCTCCAC CGCCTGA
 
Protein sequence
MTVLTATDPT TGRVVREVPA AGPDEVTATL DRAQQAFASW RERTVAERAE VLRAVAEHLR 
EHTEEYAPLM TQEMGKPITE ARGEVGKAAW CAEHYAEHAA EYLADEHIAS DATESWVQYL
PLGPVLGILP WNAPFWLALR FAAPALMAGN TCVMKHDPHV PGCAQALEAA FTAAGAPAGV
FQALVTTTEN TERAIRDPRT RAVSFTGSDR AGAIVASVAA SEIKPAVLEL GGSDPFIVLA
DADLPRAAKV AAQSRIINAG QSCIAAKRII VEGSVHDEFV ELLTEELAGL VMGDPSQETT
QVGPIAREDL RENLHRQVTA SIEAGATCVL GGELPEGDGW FYPVTLLTGV DDSMTVCTEE
TFGPVAAVVA VDDAEAAIAL ANDTPFGLGA AIWTETGRGT AMARRIEAGQ VSVNGIVKTD
PRLPSGGIKR SGYGRELGPH GIKEFVNAQQ VWVGPSTA