Gene Mflv_3083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_3083 
Symbol 
ID4974404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp3263379 
End bp3264878 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content69% 
IMG OID640457306 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_001134348 
Protein GI145223670 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.36697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGCG ATCCCGACCT GTGCATCGGC GGCGCGTGGC GCCATGCCTG CGACGGTGGC 
ACCCGCGACA TCATCAATCC GGCGAACGGC GCGGTGGCGG CGGTCGTGGA CGAGGCGACT
CCCCAGGATG CTCTCGACGC CGTCGCCGCG GCCAGGGCTG CGTTCGACCA GGGAGGCTGG
CGGGCAACGG CGGTCGCCGA ACGGTCGGCT CTGCTGCAGC GGATCGCCGA TCTGCTGCAG
CGCGACAAGG AGGACCTGGC GCGGCTGGAA ACCGTCGACA CCGGCAAGAC CCTGGCCGAG
AGCCGGATCG ACATCGACGA TGTGACGTCG GTGTTCCGCT ACTACGCCAA CCTGGTCGCC
TCCGAGGCCG AGCGGGTGGT CGACGTCGGC GACCCCGCCG TGATCAGCCG CGTGATCCGG
GAGCCGATCG GGGTGTGCGT GCTGATCGCG CCCTGGAACT ACCCGCTGCT GCAGATGTCG
TGGAAGGTGG CCCCGGCGTT GGCGGCGGGG TGCACGATGG TCGCGAAGCC CAGTGAGGTG
ACGCCGCTGT CGACGATCGC CTTCGCGAAA CTGCTCGACG AGGCGGGGGT GCCACCCGGG
GTGGTCAACC TGATCCAGGG CAGCGGTGCG ACGCTGGGCC CGGCGCTGAC CGACAACCCG
CAGGTCGACT TCATCTCGTT CACAGGAGGT CTCGCGACCG GACGCACCAT CGCACGGACG
GCCGCCGAAC ACGTCACCAA GGTCGCCCTC GAGCTCGGCG GCAAGAATCC GCACCTCGTC
TTCGCCGACA TCAGGGACGC GGGCAGGGAA TCCGGGTGGG ACGCAGCCGT CGACCATGTC
CTGACGGGAG TGTTCCTGCA TTCCGGACAG GTGTGCTCGG CCGGCACCCG GCTGATCATC
GAGGAGTCGA TCGCCGACGA GTTCGTCGCC GCCCTCGCGG CCCGCGCCGC GACGATCAGG
ATGGGTGACG GTCTGGACCC CGCCAGCGAG ACCGGTCCGC TGGTCTCCGC GCAGCACCGC
GACAAGGTGG AAAGCTATGT GCGGCTGGGT ATCTCCGAGG GGGCGCAGCT CATCGCCGGG
GGGTGCCGAC CCGAGGACCC GGTGCTCGCC GGCGGCTACT TCTACCGACC CACCATCTTC
GACCGATGTG ACCGCTCGAT GCGGATCGTG CAGGAGGAGA CGTTCGGCCC GATACTGACG
GTGGAACGAT TCACAGACGA GGCCGAGGCG GTCACCCTCG GCAACGACAC GGAATACGGT
CTCGCCGCCG GTGTCCGGAC CACGGACACG GCGCGGGGCG AACGTGTGGT GCGGGCACTG
CGGCACGGCA CGGTGTGGCT CAACGATTTC GGCTACTACA CCGCAGCGGC GGAGTGGGGC
GGATTCGGCA GGTCCGGTAA CGGCCGCGAA CTGGGTCCGA CGGGCCTTGC GGAATACCAA
GAGATCAAAC ATATCTGGCA CAACACCGCC CCCGCTGCCG CGGGTTGGTT CACAGGCTAG
 
Protein sequence
MDGDPDLCIG GAWRHACDGG TRDIINPANG AVAAVVDEAT PQDALDAVAA ARAAFDQGGW 
RATAVAERSA LLQRIADLLQ RDKEDLARLE TVDTGKTLAE SRIDIDDVTS VFRYYANLVA
SEAERVVDVG DPAVISRVIR EPIGVCVLIA PWNYPLLQMS WKVAPALAAG CTMVAKPSEV
TPLSTIAFAK LLDEAGVPPG VVNLIQGSGA TLGPALTDNP QVDFISFTGG LATGRTIART
AAEHVTKVAL ELGGKNPHLV FADIRDAGRE SGWDAAVDHV LTGVFLHSGQ VCSAGTRLII
EESIADEFVA ALAARAATIR MGDGLDPASE TGPLVSAQHR DKVESYVRLG ISEGAQLIAG
GCRPEDPVLA GGYFYRPTIF DRCDRSMRIV QEETFGPILT VERFTDEAEA VTLGNDTEYG
LAAGVRTTDT ARGERVVRAL RHGTVWLNDF GYYTAAAEWG GFGRSGNGRE LGPTGLAEYQ
EIKHIWHNTA PAAAGWFTG