Gene Mvan_5235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5235 
Symbol 
ID4645250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5605975 
End bp5606895 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content67% 
IMG OID639808710 
Productacetaldehyde dehydrogenase 
Protein accessionYP_956012 
Protein GI120406183 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4569] Acetaldehyde dehydrogenase (acetylating) 
TIGRFAM ID[TIGR03215] acetaldehyde dehydrogenase (acetylating) 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.264485 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACA AAAAGTCTGT GGCGATCGTG GGGTCGGGCA ACATCAGCAC CGACCTGCTG 
TACAAACTGT TGCGTTCGGA GTGGCTGGAG CCGCGCTGGA TGATCGGCAT CGATCCCGAG
AGCGAGGGGT TGGCGCGGGC GCGCAAGCTC GGTCTGGAGA CCTCGCACGA GGGCGTGGAC
TGGCTGTTGG CCCAGAGCGA GTTGCCTGAC ATGGTGTTCG AGGCCACCAG CGCCTATGTG
CACAAGGCCG CCGCTCCGCG CTACGCCGAG GCGGGCATCC GGGCGATCGA CCTCACCCCG
GCCGCGGTCG GGCCCGGCGT GATCCCGCCG GCCAATCTGC GTGCGCACCT GGACGCACCC
AACGTCAACA TGGTGACCTG TGGCGGCCAG GCCACGATCC CGATGGTGTA CGCGGTGTCG
CGCGTCGTGG AGGTGCCCTA CGCCGAGATC GTGGCGTCGG TGTCGTCGGC GTCGGCCGGC
CCGGGGACGC GGGCCAACAT CGACGAGTTC ACCAAGACCA CCAGCGCCGG GGTGCAGAAC
ATCGGTGGCG CCCAGCGGGG CAAGGCGATC ATCATCCTGA ACCCGGCCGA GCCGCCGATG
ATCATGCGCG ACACCATCTT CTGCGCGATC CCCGAGCACG CGGACCATGC CGCGATCACC
CAGTCGATCA AGGACGTGGT GGCCGAGGTG CAGACCTATG TTCCGGGTTA CCGGCTGCTC
AACGAGCCGC AGTTCGACGA GCCGTCGGTG GTCAACGGCG GCAACCATGT CGTGACGGTC
TTCGTCGAAG TGGAGGGTGC GGGCGACTAT CTGCCGCCGT ACGCTGGAAA CCTGGACATC
ATGACCGCGG CAGCGACGAA GGTCGGCGAA GAGATCGCGA AGGAATCTCT CGCCGCTACC
GCTGGAGGAG CACAGGCATG A
 
Protein sequence
MADKKSVAIV GSGNISTDLL YKLLRSEWLE PRWMIGIDPE SEGLARARKL GLETSHEGVD 
WLLAQSELPD MVFEATSAYV HKAAAPRYAE AGIRAIDLTP AAVGPGVIPP ANLRAHLDAP
NVNMVTCGGQ ATIPMVYAVS RVVEVPYAEI VASVSSASAG PGTRANIDEF TKTTSAGVQN
IGGAQRGKAI IILNPAEPPM IMRDTIFCAI PEHADHAAIT QSIKDVVAEV QTYVPGYRLL
NEPQFDEPSV VNGGNHVVTV FVEVEGAGDY LPPYAGNLDI MTAAATKVGE EIAKESLAAT
AGGAQA