Gene Sbal223_3036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3036 
Symbol 
ID7088945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3596238 
End bp3597701 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content50% 
IMG OID643461920 
Productbetaine aldehyde dehydrogenase 
Protein accessionYP_002358944 
Protein GI217974193 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01804] glycine betaine aldehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0986408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTAG AAGTACAGTA CAACTTTATA GCGGGTCAGT TGCTCGCTAA CGATAGCGGC 
GAAACCTTCG ACGTCGTCAA TCCTGCCACT GGGCAATTGG CGTACCGAGT GCAAGTGGCC
GATGAAAAAA TTCAGCAAGC GGCCATTGAG AGCGCCAAAA AAGGCTTTGC GATTTGGTCT
GCCATGAGTG CCACTGAACG CAGTCGTATT CTGTTAAACG CGGTTGCGTT ATTGCGCGAT
CGTAATGATG AACTCGCGGC AATCGAAGTG CGTGACACAG GTAAACCTTG GCAAGAAGCC
TCAGTGGTTG ATGTGGTCAC AGGCGCAGAT GCCATTGAGT TTTTCGCCCA TATTGCCCCT
GGGCTTGAAG GCGCACAGCA GCAAGTCGGT GGAGATTTTT ATTACACCCG TCGTGAGCCC
TTAGGCATTT GTGCTGGCAT TGGCGCGTGG AATTACCCAT TACAAATCGC CTGTTGGAAG
GCCGCTCCTG CCTTAGCCAG CGGCAATGTG ATGATTTTTA AACCTTCGGA AGAAACGCCG
CTTGGCGCAC TGAAATTAGC GCAACTCTTA AGTGAGGCGG GTTTGCCCGA TGGCGTATTT
AACGTCGTGA TGGGCGATGG CAAGGTTGGC GCTTGGTTAA CGGGACATCC GGATATTGCC
AAAGTGTCTT TCACTGGTGA AGTCGGTACT GGCAAAAAAG TCATGGCCGC GGCGGCAAGT
TCATTAAAAG ACGTCACTAT GGAACTCGGC GGTAAATCAC CGCTTATCGT GTTTGACGAT
GCTGATATCG ACAATGCGGT TTCAGGTGCC ATGCTCGGTA ACTTCTACAC CCAAGGTGAA
GTCTGCACCA ATGGGACGCG AGTGTTTGTC CATGAGTCGG TTTATCAAGA CTTTATTGAG
AAGCTGCTCG CCCGTACCCA AGCCAATATT GTGGTGGGCG ATCCTATGGC GCCAGAGACT
AACTTTGGCG CGTTGATTTC TAAAGATCAT CAGCAAAAAG TGCTCGATTA TATTCAGCAA
GGTATCGATG ACGGCGCAAC TCTGCTCACT GGTGGCACAG CGTTAACGCC TGAAAACGCT
CCCAATGGTT ACTTTGTCGC CCCAACCATT TTTACCGATT GCACCGACGA GATGCGTATT
TGCCAAGAGG AAATCTTTGG CCCTGTGATG TCGGTACTCA CCTTTAAAGA TGAAGCCGAA
GTGATTGCTC GGGCAAACAA TACCGCCATG GGCTTAGCTG CTGGGGTGTT CACCCAAGAT
ATCAGCCGTG CACATCGGGT TATTCATCAG TTACAAGCGG GTATTTGCTG GATCAATGCC
TATGGTGCAT CGCCAGCAGA AATGCCTGTA GGTGGGTATA AGTTGTCGGG TATTGGCCGT
GAAAACGGTA GCGAGACACT CAAGCATTAC ACCCAAGTTA AAGCCGTTTA CGTGGGTCTA
CAGCCCTTAG AAAGCCCATT TTAA
 
Protein sequence
MSVEVQYNFI AGQLLANDSG ETFDVVNPAT GQLAYRVQVA DEKIQQAAIE SAKKGFAIWS 
AMSATERSRI LLNAVALLRD RNDELAAIEV RDTGKPWQEA SVVDVVTGAD AIEFFAHIAP
GLEGAQQQVG GDFYYTRREP LGICAGIGAW NYPLQIACWK AAPALASGNV MIFKPSEETP
LGALKLAQLL SEAGLPDGVF NVVMGDGKVG AWLTGHPDIA KVSFTGEVGT GKKVMAAAAS
SLKDVTMELG GKSPLIVFDD ADIDNAVSGA MLGNFYTQGE VCTNGTRVFV HESVYQDFIE
KLLARTQANI VVGDPMAPET NFGALISKDH QQKVLDYIQQ GIDDGATLLT GGTALTPENA
PNGYFVAPTI FTDCTDEMRI CQEEIFGPVM SVLTFKDEAE VIARANNTAM GLAAGVFTQD
ISRAHRVIHQ LQAGICWINA YGASPAEMPV GGYKLSGIGR ENGSETLKHY TQVKAVYVGL
QPLESPF