Gene Anae109_1665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_1665 
Symbol 
ID5375429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp1871474 
End bp1872952 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content76% 
IMG OID640843174 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_001378853 
Protein GI153004528 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.499432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGCA ACGTCTCCCC AGCGCGGGGC CAGCCCCCGG CGCCGCTCCC GCTGCTCGTG 
GACGGCGCGG CCCGTCCCGC CGAGGCCGCC GAGGCGGCGC CCGTCCTCGA GCCGGCCACC
GGCAGGACGC TCGCGCAGGT CCCGCTCTGC GGCCCGGCCG ACGTCGACAC GGCGGTCCGG
GCCGCCGCGG CGGCGTTCCC GGCCTGGCGC GCCACGCCCG TGCCGGAGCG CGTCCAGGTG
CTGTTCCGCT ACAAGGCGCT CCTCGAGCGG GAGCAGGACG CGCTCGCGGC GTCGGTGTCG
CGCGAGAACG GCAAGCTCCT CGCCGACGCG CGCAACGAGG TCCGCCGCGG GATCGAGGTG
GTCGACTTCG CCTGCGGCAT GCCGACGCTC GCGCAGGGCC GGACGGTGGA GGGGATCGCC
CGCGGCGTCG ACTCGCACAC CTGGCGAGTC CCGGTCGGCG TGGTCGCGGG GATCTGCCCG
TTCAACTTCC CGGCGATGAT CCCGCTGTGG ATGTTCCCCA TCGCCATCGC GGCGGGGAAC
ACCTTCGTCC TGAAGCCGTC CGAGCGGACG CCCATGACCG GGCTGCGGCT CGCCGAGCTG
CTGCACGAGG CGGGGCTGCC CCCCGGCGTC CTCGACGTCG TGCACGGGGG TCGCGACGCG
GTGGACGCCC TGCTCGATCA CCCGCTCGTG CGGGCCGTCT CCTTCGTCGG CTCGGAGGGG
GTGGCCCGCC ACGTCTACGC CCGCGCCGCC GCGAACGGCA AGCGCGTGCA GGCGATGGCG
GGCGCGAAGA ACCACCTGCT CGTGCTGCCC GACGCCGATC TGGAGCTCAC CGTCGCGGCG
GTGATGGGCT CCGCGTTCGG CGCGGCCGGC CAGCGCTGCC TCGCGGGCAG CGTGCTCGTC
GCGGTCGACG GCGCCGCGGA GCCGCTGCTC GAGCGGCTCA CCCGCGAGGC ACGCGCGGCG
CGCGTCGGAG ATCCGTTCGC GGCCGACTCC GCCATGGGTC CGGTCATCCG CGAGGACGCG
CGCGACCGCG TGCGGCGCTT CATCGAGACC GGGCTCGCCG AGGGCGCCGC GCTCCTCGTC
GACGGGCGCG AGGCGGCGGC CGTCGGCGAC GGGTACTTCA TCGGGCCGAC CCTCTTCGAC
GGCGTGCGGC CGGAGTCGGC GCTCGCCCGC GAGGAGATCT TCGGCCCGCT CCTCGCCACG
GTGCGGGCGG GGAGCGTCGA GGAGGCGGTC GCGCTCGCCA ACCGCGCGCG CTACGGCAAC
GCCGCCAGCA TCTTCACCTC GAGCGGCCGC GCCGCCGCCT ACTTCCGCCG CAACGTCGAG
GCCGGGATGA TCGGCGTGAA CGTGGGGGTC GCCGCGCCCA TGGCGTTCTT CCCGTTCGCC
GGCTGGAAGA GCTCGTTCTT CGGCGACCTG CACGCCACCG GCGAGGACGC GGTCCGCTTC
TACACGGAGA CCCGGGTGGT CATCGAGCGA TGGGCCTGA
 
Protein sequence
MASNVSPARG QPPAPLPLLV DGAARPAEAA EAAPVLEPAT GRTLAQVPLC GPADVDTAVR 
AAAAAFPAWR ATPVPERVQV LFRYKALLER EQDALAASVS RENGKLLADA RNEVRRGIEV
VDFACGMPTL AQGRTVEGIA RGVDSHTWRV PVGVVAGICP FNFPAMIPLW MFPIAIAAGN
TFVLKPSERT PMTGLRLAEL LHEAGLPPGV LDVVHGGRDA VDALLDHPLV RAVSFVGSEG
VARHVYARAA ANGKRVQAMA GAKNHLLVLP DADLELTVAA VMGSAFGAAG QRCLAGSVLV
AVDGAAEPLL ERLTREARAA RVGDPFAADS AMGPVIREDA RDRVRRFIET GLAEGAALLV
DGREAAAVGD GYFIGPTLFD GVRPESALAR EEIFGPLLAT VRAGSVEEAV ALANRARYGN
AASIFTSSGR AAAYFRRNVE AGMIGVNVGV AAPMAFFPFA GWKSSFFGDL HATGEDAVRF
YTETRVVIER WA