Gene Ssed_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_1043 
Symbol 
ID5610174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp1237318 
End bp1238808 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content54% 
IMG OID640931891 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_001472782 
Protein GI157374182 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0205992 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.342071 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACGA TCACCCATTT CATTAACGGC AGCCATACTG ACACAAGCGA GCGCACGGGT 
CAGGTTTTCG AACCCGCCAC CGGAGAGCAA ACAGCATCGG TATCCTTGGC CAGTGCGGCC
GAAGTTGCCG GCGCTATCGA GTTAGCCAAG AGAGCACATA AGAGCTGGTC TCAGATCTCA
CCACTCAATC GCGCCAGAGT CCTGTTTAAG TTCAAGGCGC TGGTCGAAAA CAATATCGAT
GAGTTAGCAG AGCTTATCAC CCGGGAACAC GGCAAGGTAT TAGATGATGC CAAGGGCGAG
ATCATTCGAG GCCTGGAGGT CGTTGAGTTT GCGTGCGGCA TTCCACATCT GCTTAAAGGC
GAGCACACCG AGCAGGTAGG CACCGGCGTC GATGCCTGGC ATGTGAATCA ATCGCTCGGC
GTCGTGGCCG GTATTGCTCC GTTCAACTTC CCGGTCATGG TTCCCATGTG GATGTTCCCA
ATCGCGATTG CCAGTGGTAA CACCTTTATC ATGAAGCCAT CGGAGAAAGA TCCGAGCTCT
GTGATGCGCC TGGCCGAACT GCTTAGTGAA GCGGGTCTGC CCGATGGTGT GTTCAATGTC
GTAAACGGTG ATAAAGAGGC GGTCGATACC CTGCTGACCC ATAAAGATAT TCAAGCCGTG
AGTTTCGTTG GCTCAACCCC TATCGCCGAA TATATCTATG AGACGGCATC TAAATACGGT
AAACGTGTAC AGGCACTGGG TGGCGCGAAA AACCATATGT TACTCATGCC CGATGCCGAC
TTAGATCAAG CCGTTGGCGC CTTGATGGGC GCAGCCTATG GTTCAGCCGG CGAGCGTTGC
ATGGCGATAT CTGTGGTACT GGCGGTAGGC AACTCCGGCG ATGCACTGGT TGAAAAACTG
CTGCCACAGA TTAAAGCATT GCGCGTGGGT AACGGAGTCA CTCCCGAGAT GGACATGGGG
CCTTTGATCT CCGCGCAGCA TCTGGCCAAG GTCACCGACT ATGTCGAAAC CGGCGTGGCA
GAAGGTGCCA CCCTCCTCGC CGATGGCCGG GAGTTAACGG TTGCAGATCA CGAGCAGGGC
TATTTTCTCG GTGGCTGCCT GTTCGATAAT GTCACCCCTG AGATGACTAT CTACAAGGAG
GAGATCTTCG GTCCGGTGCT GGCCATCGTC AGAGTCGATG ATTACGCCGA AGCGCTCGAA
CTCATTAACG AACATGAATT TGGCAATGGC ACCGCCATCT TTACTCAAAG CGGCGAGGCG
GCGAGACATT TCTGTCATCA CGTTCAGATT GGTATGGTCG GCGTTAACGT GCCGATCCCC
GTTCCTATGG CTTTCCATAG CTTCGGTGGA TGGAAGCGTT CACTGTTTGG TCCGCTACAT
ATGCATGGGC CCGACGGTGT TCGCTTCTAT ACCAAGCGTA AGGCTATTAC TGCCCGCTGG
CCAAAACCAA AGCATGCTCA GGCTGAGTTT GTCATGCCAA CGATGAAGTA A
 
Protein sequence
MQTITHFING SHTDTSERTG QVFEPATGEQ TASVSLASAA EVAGAIELAK RAHKSWSQIS 
PLNRARVLFK FKALVENNID ELAELITREH GKVLDDAKGE IIRGLEVVEF ACGIPHLLKG
EHTEQVGTGV DAWHVNQSLG VVAGIAPFNF PVMVPMWMFP IAIASGNTFI MKPSEKDPSS
VMRLAELLSE AGLPDGVFNV VNGDKEAVDT LLTHKDIQAV SFVGSTPIAE YIYETASKYG
KRVQALGGAK NHMLLMPDAD LDQAVGALMG AAYGSAGERC MAISVVLAVG NSGDALVEKL
LPQIKALRVG NGVTPEMDMG PLISAQHLAK VTDYVETGVA EGATLLADGR ELTVADHEQG
YFLGGCLFDN VTPEMTIYKE EIFGPVLAIV RVDDYAEALE LINEHEFGNG TAIFTQSGEA
ARHFCHHVQI GMVGVNVPIP VPMAFHSFGG WKRSLFGPLH MHGPDGVRFY TKRKAITARW
PKPKHAQAEF VMPTMK