Gene Mflv_0428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_0428 
Symbol 
ID4971504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp445699 
End bp447177 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content68% 
IMG OID640454633 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_001131710 
Protein GI145221032 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0260686 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.349012 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAAA GCACTGCCTT CAGAACCGAA TGGGACAAGT TGTTCATCGG CGGCAAGTGG 
GTCGAGCCGG CCTCCTCGGA GGTGATCGAG GTGCGCTCCC CCGCCACCGG TGACGTGGTC
GGCAAGGTGC CGCTGGCCTC GGCCGCCGAC GTCGACGCCG CGTGCGCCGC CGCCCGCGAG
GCCTTCGACA ACGGCCCGTG GCCGCAGATG TCGCCGACCG AGCGCGCCGA GGTGCTGGGC
CGCGCCGTGA AGCTCATGGA GGAGCGCGCC GACGAGCTGA AGTTCCTGCT GGCCGCCGAG
ACGGGGCAGC CGCCGACGAT CGTCGACATG ATGCAGTACG GCGCCGCGAT GTCGTCGTTC
CAGTTCTACG CCGGCGCCGC CGACAAGTTC ACCTGGCAGG ACATCCGCGA CGGCGTGTAC
GGCCAGACCC TGGTCGTGCG TGAGCCGGTC GGCGTGGTCG GCGCTGTCAC CGCGTGGAAC
GTGCCGTTCT TCCTCGCCGC GAACAAGCTC GGCCCGGCGC TGCTGGCCGG CTGCACGGTG
GTGCTGAAGC CTGCTGCCGA GACCCCGCTG TCGGTCTTCG CGATGGCCGA GATGTTCGTC
GAGGCCGGCC TGCCCGAGGG CGTGCTGTCG ATCGTGCCCG GCGGTCCGGA GACCGGTCAG
GCGCTGACCG CCAACCCGAA CCTGGACAAG TACACGTTCA CCGGGTCCTC GGGTGTGGGC
AAGGAGATCG CGAAGATCGC CGCCGACAAG CTCAAGCCGT GCACCCTGGA GCTCGGCGGC
AAGTCCGCCG CGATCATCCT CGAGGACGCC GACCTGGACT CGACGCTGCC GATGCTGGTG
TTCTCGGGCC TGATGAACTC GGGCCAGGCG TGTGTCGGGC AGACCCGCAT CCTGGCGCCG
CGTTCGCGCT ACGACGAGGT CATAGAGAAA CTCGGGGAAG CTGTCCGCAA TATGGCCCCG
GGCCTGCCGG ACAACCCCGC CGCGATGATC GGCCCGCTGA TCAGCGAGAA GCAGCGCGAC
CGCGTCGAGG GTTACATCAA GAAGGGCATC GAGGAGGGCG CGCGCGTCAT CACCGGTGGT
GGCCGCCCCG AAGGCCTGGA CAGCGGCTGG TTCGTCGAGC CGACCGTCTT CGCCGACGTC
GACAACTCGA TGACCATCGC GCAGGAGGAG ATCTTCGGAC CCGTCCTGTC GGTGATCCCC
TACGAGGACG AGGACGACGC GGTCCGTATC GCCAACGACT CGGTGTACGG GCTGGCCGGT
TCGGTGTACA CCACCGACAA CGACCGGGCG CTCAAGATCG CGCGGCGTAT CCGCACCGGC
ACCTACGCGG TGAACATGTA CGCGTTCGAT CCGTGTGCCC CGTTCGGCGG TTACAAGAAC
TCGGGCATCG GCCGGGAGAA CGGCTGGGAG GGCATCGAGG CCTACTGCGA GCAGAAGAGC
ATCCTGCTGC CGTTCGGGTA CACCCCGCCG GCTTCCTGA
 
Protein sequence
MTQSTAFRTE WDKLFIGGKW VEPASSEVIE VRSPATGDVV GKVPLASAAD VDAACAAARE 
AFDNGPWPQM SPTERAEVLG RAVKLMEERA DELKFLLAAE TGQPPTIVDM MQYGAAMSSF
QFYAGAADKF TWQDIRDGVY GQTLVVREPV GVVGAVTAWN VPFFLAANKL GPALLAGCTV
VLKPAAETPL SVFAMAEMFV EAGLPEGVLS IVPGGPETGQ ALTANPNLDK YTFTGSSGVG
KEIAKIAADK LKPCTLELGG KSAAIILEDA DLDSTLPMLV FSGLMNSGQA CVGQTRILAP
RSRYDEVIEK LGEAVRNMAP GLPDNPAAMI GPLISEKQRD RVEGYIKKGI EEGARVITGG
GRPEGLDSGW FVEPTVFADV DNSMTIAQEE IFGPVLSVIP YEDEDDAVRI ANDSVYGLAG
SVYTTDNDRA LKIARRIRTG TYAVNMYAFD PCAPFGGYKN SGIGRENGWE GIEAYCEQKS
ILLPFGYTPP AS