Gene Mvan_4901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4901 
Symbol 
ID4648834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5251231 
End bp5252313 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content61% 
IMG OID639808372 
Productalcohol dehydrogenase 
Protein accessionYP_955680 
Protein GI120405851 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCATT CAGGAACCCA CAAAGTCAAG GAGACAGCAA TGAAGGCAAT GGTGTATCAC 
GGCGATGGAA AGGCTTCATG GGACGACGTA CCGGATGCCG TCCTCGTCGA TCCGACCGAC
GCGGTGGTTC GGGTCGACGC CGTCACGATC TGCGGGACTG ACCTGCATAT CCTACGTGGC
CATGTCCCCA CCGTGGACAG GGGACGAATC CTCGGCCACG AGGCGGTGGG AACCGTGACC
GCGATCGGTT CAGGCGTGCG GCAGCTTGCG GTCGGTGACC GTGTGTTGAT CTCCTGCATC
AGTTCGTGCG GGAGTTGCCG ATACTGCCGT CGCACGAGCT ATGGCCAATG CAGCGGAGGA
GGTGGTTGGA TCCTGGGGAA TCGCATCGAC GGAACTCAGG CTGAGTTCGT GCGAGTTCCG
TTTGCGGACA ATTCGACACA TCGGGTTCCG GACGGCGTGA GCGACGAAAA CATGATCACG
CTGGCGGATT TGCTTCCGAC CGGGTACGAA GTGGGAGCCA TCAACGGCAG AGTCCGGCCG
GCGGACACAG TCGTTGTCGT GGGTGCCGGG CCGATCGGCC TTGCGGCGAT CATGACGTCC
CAGTTGTTCA GCCCCAGCCG CATCGTGGCC ATCGACCTTG CCGACAGCCG ACTGGATGCT
GCCCGCAAGT TCGGTGCAGA CATCGTGATC AATCCCGACC GCCTAGACCC GGTTGCGGCG
ATCGCCGACT TGACAGGCGG ATTGGGTGTT GACGCGGCCA TGGAAGCAGT CGGGACGGCC
GCAACGTTCG AACTTGCCGT GCAACTCGTC CGTCCGGGCG GACACGTCGC CAACATCGGG
GTGCACGGCG GGCCGGCAAC ACTTCATCTC GAAGACATCT GGATCAGGAA TCTCACCATC
ACCACAGGCC TCGTCGACAC CTATTCGACA CCGACCCTTG TCGACCTTGT CGCCGCGCAC
AGACTCGATA CATCCGCCCT GGTGACGCAC CGCTACCCCT TGGACGAATT CGAGCGCGCC
TATCACGAAT TCAGTAATGC CGGCGAAACG GGAGCACTCA AAGTTCTACT GACACAGAAC
TGA
 
Protein sequence
MFHSGTHKVK ETAMKAMVYH GDGKASWDDV PDAVLVDPTD AVVRVDAVTI CGTDLHILRG 
HVPTVDRGRI LGHEAVGTVT AIGSGVRQLA VGDRVLISCI SSCGSCRYCR RTSYGQCSGG
GGWILGNRID GTQAEFVRVP FADNSTHRVP DGVSDENMIT LADLLPTGYE VGAINGRVRP
ADTVVVVGAG PIGLAAIMTS QLFSPSRIVA IDLADSRLDA ARKFGADIVI NPDRLDPVAA
IADLTGGLGV DAAMEAVGTA ATFELAVQLV RPGGHVANIG VHGGPATLHL EDIWIRNLTI
TTGLVDTYST PTLVDLVAAH RLDTSALVTH RYPLDEFERA YHEFSNAGET GALKVLLTQN