Gene Sbal223_3204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_3204 
Symbol 
ID7085817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3798810 
End bp3800300 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content52% 
IMG OID643462088 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_002359112 
Protein GI217974361 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.887195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAGA TCACTCACTT TGTTAATGGT CAGCACACTC CTGCCAGTAC CAGAACCCAA 
GATATTTTTG AACCAGCCAC GGGTGAACTA CGTGGCCAAG TCTCACTCGC AAGCGAAACT
GAAGTGGGTG AAGCCATCGC AATCGCTAAA ACCGCCTTTG AAACATGGTC GCAGGTGACG
CCGCTCAACC GTGCTCGAGT GCTATTTAAA TTCAAAGCCT TAGTTGAGCA GAACCTAGAT
GAAATGGCGC AGCTTATCAC CCGCGAGCAC GGCAAAGTGA TCGACGATGC TAAGGGCGAG
TTGATCCGTG GTCTCGAAGT GGTCGAGTTT GCCTGTGGTA TTCCGCACTT GCTTAAAGGT
GAACACACCC AGCAAGTCGG TGGCGGGGTC GATTCTTGGT CAGTTAATCA AGCTTTAGGT
GTTGTCGCAG GCATAGCGCC CTTTAACTTC CCCGTGATGG TTCCCATGTG GATGTTCCCA
ATCGCGATTG CCTGCGGTAA CACCTTTATT ATGAAACCCT CAGAAAAAGA CCCAAGCTCG
GTGATGCGTA TTGCCGATCT GCTTAAAGAA GCGGGTCTTC CCGATGGCGT GTTTAACGTG
ATTAACGGCG ACAAAGAAGC CGTCGATACC TTACTCACCC ATAAAGATGT GCAAGCGGTG
AGCTTTGTAG GCTCAACGCC GATTGCCGAA TACATCTACA GCACAGCCTC TAAACATGGC
AAACGCGTGC AAGCCTTAGG CGGCGCGAAA AACCATATGT TACTCATGCC AGATGCGGAT
TTAGATCAAG CCGTTAGCGC CTTAATGGGC GCAGCTTACG GCAGTGCTGG TGAGCGTTGT
ATGGCGATTT CTGTCGTACT TGCGGTAGGC GATGTGGGTG ACGCGTTAGT GGAAAAACTG
CTACCGCAAA TCCAAACCTT AAAAGTCGGC AACGGCCTAA CGCCAGAGAT GGAAATGGGT
CCGCTGATCT CAAAACAGCA CCTTGCCAAG GTCACCCAAT ATGTTGAAGC CGGTGTGCAA
GAAGGCGCAG CGCTGCTGGT CGATGGCCGT AAACTGAGTG TTGAAGATCA TCAACAGGGT
TATTTCCTCG GCGCCTGTTT GTTCGACCAC GTCACGCCTG AAATGAGCAT CTACCGCGAA
GAAATCTTTG GCCCAGTGCT GGCGATTGTG CGCGTGAAAG ATTACCCAAC GGCGCTTGAG
CTGATTAACC AACACGAATT TGGCAATGGC ACGGCGATTT TCACCCAAAG TGGTGAAGCG
GCGCGACATT TTTGCCACCA CGTGCAAGTC GGTATGGTTG GAGTGAACGT GCCGATCCCG
GTGCCAATGG CGTTCCACAG TTTTGGCGGT TGGAAGCGAT CACTCTTTGG GCCGCTGCAT
ATGCATGGTC CAGATGGCGT GCGTTTTTAT ACCAAACGTA AGGCAATTAC TGCCCGCTGG
CCCGTAGGTA AACAGACTCA AGCCGAGTTT GTGATGCCTA CGATGAAATA G
 
Protein sequence
MLKITHFVNG QHTPASTRTQ DIFEPATGEL RGQVSLASET EVGEAIAIAK TAFETWSQVT 
PLNRARVLFK FKALVEQNLD EMAQLITREH GKVIDDAKGE LIRGLEVVEF ACGIPHLLKG
EHTQQVGGGV DSWSVNQALG VVAGIAPFNF PVMVPMWMFP IAIACGNTFI MKPSEKDPSS
VMRIADLLKE AGLPDGVFNV INGDKEAVDT LLTHKDVQAV SFVGSTPIAE YIYSTASKHG
KRVQALGGAK NHMLLMPDAD LDQAVSALMG AAYGSAGERC MAISVVLAVG DVGDALVEKL
LPQIQTLKVG NGLTPEMEMG PLISKQHLAK VTQYVEAGVQ EGAALLVDGR KLSVEDHQQG
YFLGACLFDH VTPEMSIYRE EIFGPVLAIV RVKDYPTALE LINQHEFGNG TAIFTQSGEA
ARHFCHHVQV GMVGVNVPIP VPMAFHSFGG WKRSLFGPLH MHGPDGVRFY TKRKAITARW
PVGKQTQAEF VMPTMK