Gene Sde_2645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2645 
Symbol 
ID3968503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp3345939 
End bp3347384 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content48% 
IMG OID637921743 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_528117 
Protein GI90022290 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0237988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.24266 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTC ATAACATGAA AAATTTTATC AACGGCGAAT ATATAGCTTC ACAAGCTGAT 
GGCGCTATTG ATGTGCTAAG CCCAAGCACC GGTAAAAAGG TAGGCGATAT TCCCGCAGGA
TGTGTAGAGG ATGCGCAGTT GGCGCTGGAT ACAGCCAACG CAGCTCAAAA GCTGTGGGCA
AAAAAAACGA ACAGAGAGCG CGCAAAAATA TTGCGTGTAT TCGCTGCGAA TATTCGTGCG
GCGGCGGATG ATTTAGCCAA GCTGTTAGTG AGCGAGCAGG GTAAATTACT TTCTGTTGCG
CAAATGGAAG TAGAAGCCAC AGCAACGTTT ATAGAATACG CGTGTGATAA CGCGCTTACT
ATAGAGGGCG ATATTTTACC TTCCGATAAC CCCAACGAAA AAATATATAT CCACAAAGTG
CCACGCGGTG TGGTTGTGGC AATTACCGCT TGGAATTTTC CGTTAGCACT GGCGGGCAGA
AAAATAGGCC CAGCACTTGT TACAGGCAAT GCTATCGTGG TTAAGCCAAC CCAAGAAACG
CCACTTGCAA CATTGGCGTT AGGCGAGCTA GCTAATGCTG CGGGTATTCC CGCCGGCGTA
CTCAATATTG TAAACGGCCG TGGCAGTGTT GTTGGGCAGC ACCTGTGCGA AAGCCCAATA
ACCCGCTTAA TAACCATGAC CGGCAGCACC CCTGCTGGGC AGCGTATTTA CCGCACCAGT
GCCGATCATT TAACGCCAGT AATGCTAGAA CTGGGCGGTA AGGCACCATT TATCGTAATG
GAAGATGCCA ACTTAGAAAG CGCAGTAGAG GCGGCATTTA CTACGCGTTA TGCCAATTGC
GGGCAAGTGT GTACCTGTGC CGAGCGCCTG TATGTACACG AATCTATTTA CCCCGCTTTT
ATGGATAAGC TACTTGAGAA GGTGAAAGCA ATAAAAGTGG GCGACCCAAT GGCTGCCGAT
ACCGATATGG GTCCCAAGGT TAATCAAAGC GAAATAGAAA ATATTGATGC GCTGGTTAAG
AAGGGTATTG AGCAAGGCGC AACCTTGCTG CATGGCGGTA AGCGCGCGCA TGTGCCTGGC
TTTGAAGGTG GCAACTGGTA TGAACCCACA CTGCTAGGTG ATGTGCAGCA AAGTAATATT
CTTGTGCACG AAGAAACGTT TGGGCCTATT TTACCTGTAG TTAAAATTAA CAGTATTGAG
CAGGCTATAG AGTACACCAA CGACAGTGAG TATGGCCTTT CAACGTATTT GTTTACGCAA
AACCTTAAAT ATATTCATCA ATATATTGCC GAGGTTGAGG CCGGTGAGGT GTATGTTAAC
CGCGGTATTG GTGAGCAGCA CCAAGGCTTC CACAACGGTT GGAAGCTAAG CGGCGCAGGC
GGTGAAGATG GTCGTTACGG TTTAGAGCAG TACTTAGAGA AGAAGACAGT GTATTTTGCT
GAATGA
 
Protein sequence
MKIHNMKNFI NGEYIASQAD GAIDVLSPST GKKVGDIPAG CVEDAQLALD TANAAQKLWA 
KKTNRERAKI LRVFAANIRA AADDLAKLLV SEQGKLLSVA QMEVEATATF IEYACDNALT
IEGDILPSDN PNEKIYIHKV PRGVVVAITA WNFPLALAGR KIGPALVTGN AIVVKPTQET
PLATLALGEL ANAAGIPAGV LNIVNGRGSV VGQHLCESPI TRLITMTGST PAGQRIYRTS
ADHLTPVMLE LGGKAPFIVM EDANLESAVE AAFTTRYANC GQVCTCAERL YVHESIYPAF
MDKLLEKVKA IKVGDPMAAD TDMGPKVNQS EIENIDALVK KGIEQGATLL HGGKRAHVPG
FEGGNWYEPT LLGDVQQSNI LVHEETFGPI LPVVKINSIE QAIEYTNDSE YGLSTYLFTQ
NLKYIHQYIA EVEAGEVYVN RGIGEQHQGF HNGWKLSGAG GEDGRYGLEQ YLEKKTVYFA
E