Gene Sare_3214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3214 
Symbol 
ID5705437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3705012 
End bp3706502 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content71% 
IMG OID641272645 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_001538012 
Protein GI159038759 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0475805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGG TCGGGCACTT CGTCGACGGC AAGTGGGTCT CGGGCTCCGC GACGCGGCGC 
GGCGACGTCT TCGACCCGGC CACCGGCGTG CGTACCGCGG AGGTCGAGCT GGCCTCCGCC
GCGGAGGTGG CGGCAGCGGT CGAGGCCGCC GACCGGGCCG CGCGGAGCTG GCGAGACGCC
TCCCTGTCCC GGCGCACGGC GGTGCTCTTC GCCTTCCGGG AGCTGGTCCA CGCCCGGCGT
GACCGGCTCG CCGAGGTGAT CACGGCGGAG CACGGCAAGG TGCTCGCCGA CGCCGGCGGC
GAGGTGCAGC GTGGGCTGGA GGTGATCGAG TACGCCTGCG GCATCCCCGC GGCGATGCGT
GGCGGGTTCA GTGAGAACGT GTCAACCGAT GTCGACTCGT ACAGCCTTCG TCAGCCCCTC
GGAGTGGTGG CGGTGATCAG CCCGTTCAAC TTCCCGGTGA TGGTGCCGCT GTGGTTCGTG
CCGGTCGCGG TGGCCACGGG CAACGCGGTG GTGCTCAAGC CCAGTGAGAA GGACCCAAGC
GCGGCGCTGC TGCTGGCGGA GTGGTTCGCC GAGGCGGGCC TGCCACCCGG GGTGTTCAAC
GTGGTCAACG GTGACAAGCA GGCCGTTGAC GCACTGCTGG ATCATCCGGC GGTGCGGGGG
GTGTCGTTCG TCGGTTCTAC GCCGGTCGCC CGGCATGTGC ACCAGCGTGC GTCGTTGGCC
GGCAAGCGGG TGCAGGCCCT CGGTGGGGCG AAGAACCACA TGGTGGTGTT GCCCGACGCC
GATCTGGAAC TGGCCGCCGA CGCCGCCGTC AACGCGGGGT TCGGGTCGGC GGGGGAGCGC
TGCATGGCGA TCTCCGCGCT GGTCGCGGTG GAGCCGGTCG CGGACGCCCT GGTCGCGAAG
ATCGCCGAGC GGACGGCCGG GCTGCGCACG GGCGACGGCC GGCGTGGTTG TGACATGGGC
CCGCTGGTCA CCGCGGCGCA CGCCGAGCGG GTGCGCTCGT ACGTCGAGGC GGGTGTGGCG
GCTGGCGCGG TGCCGGTGGT CGACGGACGG GACGTGCGTC CCGACGGCGA CCCGAACGGC
TACTGGTTGG GGCCGACGCT GCTCGACCGG GTCACCCCGG AGATGTCGGT GTACACCGAC
GAGATCTTCG GACCGGTGCT GTCGGTGCTC CGGGTCGGTT CGTACGACGA GGCCGTCGAC
CTGGTCAACG CCAACCCGTA CGGCAACGGG ACGGCCGTCT TCACCAACGA CGGGGGCGCC
GCCCGGCGCT ACCAGCACGA GGTGGAGGTG GGCATGGTCG GGATCAACGT GCCGATCCCG
GTGCCCATGG CGTACTACTC CTTCGGTGGC TGGAAGGCGT CGCTCTTCGG CGACCTGCAC
GCGCACGGTG AGGACGGGGT GCGTTTCTTC ACCCGGGGCA AGGTGATCAC CAGCCGTTGG
CTGGATCCCC GCACCGGTGG GGTCAACCTC GGTTTCCCCA CCCAGACCTG A
 
Protein sequence
MKLVGHFVDG KWVSGSATRR GDVFDPATGV RTAEVELASA AEVAAAVEAA DRAARSWRDA 
SLSRRTAVLF AFRELVHARR DRLAEVITAE HGKVLADAGG EVQRGLEVIE YACGIPAAMR
GGFSENVSTD VDSYSLRQPL GVVAVISPFN FPVMVPLWFV PVAVATGNAV VLKPSEKDPS
AALLLAEWFA EAGLPPGVFN VVNGDKQAVD ALLDHPAVRG VSFVGSTPVA RHVHQRASLA
GKRVQALGGA KNHMVVLPDA DLELAADAAV NAGFGSAGER CMAISALVAV EPVADALVAK
IAERTAGLRT GDGRRGCDMG PLVTAAHAER VRSYVEAGVA AGAVPVVDGR DVRPDGDPNG
YWLGPTLLDR VTPEMSVYTD EIFGPVLSVL RVGSYDEAVD LVNANPYGNG TAVFTNDGGA
ARRYQHEVEV GMVGINVPIP VPMAYYSFGG WKASLFGDLH AHGEDGVRFF TRGKVITSRW
LDPRTGGVNL GFPTQT