Gene Mvan_4156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4156 
Symbol 
ID4648915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4467250 
End bp4468764 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content67% 
IMG OID639807623 
Productaldehyde dehydrogenase 
Protein accessionYP_954939 
Protein GI120405110 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.349065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAA CAGCGACGAT CTCGGATGAG CGCAGCGGTG CTGGGCAGGC AGCGCAGCGC 
GCGGAACGGC ACCTCCTGAT CGGGGGGCAA CTCCTCGAGA CCGAGCGGAC GTTTCCTTCC
CTCAACCCGG CGACCGGTGA GGTGCTCGGA TATGCGCCTG ACGCCACCGT CGCCGATGCG
GAGGCAGCCG TGGCGGCGGC GCGTCGCGCG TTCGACGCGG GCGTCTGGGC GACCGACACC
GAGCTGCGGA TCCGCTGCCT GGAGCAGTTC CACGCGGCGC TGGTCGAGCA TCGCGATGAG
CTGGCCGCCC TGACCGTCGC CGAGGTCGGC GCCACCCCGG CCCTGTGCCA GGGTGCTCAA
CTCGACCAGC CGATCGAGAT CGTCCGGTAC TACGCAGACC TGCTGAAGAC GTATCCGCTC
ACCGAGGACC TCGGCAACAT CGAGAGCCGG GGGATGCAGC ACCACCGATG GGTGGAGAAA
GAAGCGGGCG GCGTGGTGGC CGCGATCATC GCCTACAACT ATCCCAACCA GTTGGCGCTG
GCCAAGCTCG CGCCGGCGCT GGCCGCCGGC TGCACCGTGG TGCTCAAGTC AGCTCCCGAC
ACCCCGCTGG TCACACTGGC GCTCGGTGAG TTGATCGCCA AGCACACCGA CATCCCGGCC
GGCGTCGTCA ACGTGCTCTC CGGTGCGGAC CCGGCTGTCG GCGCGGCGCT GACCACCAGC
CCGGACGTCG ACATGGTGAC GTTCACCGGG TCGACCCCCA CCGGACGCGC GATCATGGCC
GCCGCGAGCG GAACCCTGAA GAAGGTGTTC CTCGAACTCG GCGGGAAATC GGCCGCCATC
GTCCTCGACG ACGCCGACTT CAACACCGCG GCAATGTTCT CGGCGTTCAG CATGGTCACC
CACGCCGGTC AGGGCTGCGC GTTGACCTCG CGTCTGCTGG TTCCCAAGAA GCACAAGGAC
GAGCTCGTCG AGCTGGTCAA GAACAATTTC GGTCTCGTCC GTTACGGGGA CCCCACCGCG
GCGGGCACTT ATATGGGCCC GCTGATCAGC GAGAAGCAGC GCGACAAGGT CGACGGGATG
GTCAAGCGCG CCGTCGAGGC CGGTGCCACG TTGGTGACCG GTGGCGAGAA GGTCGACCCC
GGTTATTTCT ACACGCCGAC GCTGCTGGCC GACGTCGACC CGGACAGCGA AATCGCCCAG
GAAGAGGTGT TCGGACCGGT CCTGGTCGTG ATCGCCTACG AGGACGACGA CGATGCGGTG
CGGATCGCCA ACAACTCGAT CTACGGGCTC TCGGGTGCGG TGTTCGGCAG CCAGGACCGT
GCGCTCGCGG TGGCGCGCCG AATCCGTACC GGCACCTTCT CCATCAACGG CGGCAACTAC
TTCGCCCCTG ACAGCCCGTT CGGTGGCTAC AAGCAGTCGG GTATCGGCCG CGAGATGGGC
AGGGCCGGAC TCGAAGAGTT CCTGGAGTCC AAGACATACG CAGTGGTCGT GCCGGAGGCG
GGGGGACAAG CATGA
 
Protein sequence
MQETATISDE RSGAGQAAQR AERHLLIGGQ LLETERTFPS LNPATGEVLG YAPDATVADA 
EAAVAAARRA FDAGVWATDT ELRIRCLEQF HAALVEHRDE LAALTVAEVG ATPALCQGAQ
LDQPIEIVRY YADLLKTYPL TEDLGNIESR GMQHHRWVEK EAGGVVAAII AYNYPNQLAL
AKLAPALAAG CTVVLKSAPD TPLVTLALGE LIAKHTDIPA GVVNVLSGAD PAVGAALTTS
PDVDMVTFTG STPTGRAIMA AASGTLKKVF LELGGKSAAI VLDDADFNTA AMFSAFSMVT
HAGQGCALTS RLLVPKKHKD ELVELVKNNF GLVRYGDPTA AGTYMGPLIS EKQRDKVDGM
VKRAVEAGAT LVTGGEKVDP GYFYTPTLLA DVDPDSEIAQ EEVFGPVLVV IAYEDDDDAV
RIANNSIYGL SGAVFGSQDR ALAVARRIRT GTFSINGGNY FAPDSPFGGY KQSGIGREMG
RAGLEEFLES KTYAVVVPEA GGQA