Gene Mvan_1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1920 
Symbol 
ID4648011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2048639 
End bp2050141 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content70% 
IMG OID639805407 
Productputative short chain dehydrogenase 
Protein accessionYP_952746 
Protein GI120402917 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGAAC TTATTCGAAC GTCGGGGATC CCGTCACCCC TGCTAGTGGA GATGGGGTTC 
ATGGAGAGCC CGTACGCTTC CCGGGTGAAC GCGATCGATC CGGACAAGCT CATCACCTGT
CTGCAGGTGC TGGCCGACAT CGAGGCCCTG CCTCCCGAGC ACCCCGATGC CGTCGCTGTC
CGCCGGGCCA CGGCCGGCAT CTTCAAATCG GTGAGAAAGG CTCGTCGCGC CGCCAAGCGC
GACGCCGTGG CCGCCGCCGA CGACGCCATC ACCGCGGCCA CCGCCACCGG TGCGCCCGGC
CGCATCGACG ACGAGACGCA GGGTCTGCCG CTGGTGTCCA CCACCGTCGG CGCCACCGCC
GGCACGCTGC TGCGTCCGCG CGCCTGCTAC ATCTGCAAGA ACCGGTACAC CGTGGTCGAC
GCGTTCTACC ACCAACTCTG CCCCGACTGC GCTGCCCTGA ACCGGGCCAA GCGCGACGCC
CGCACCGACC TCACCGGCAG GCGCGCCCTG CTCACCGGCG GTCGCGCCAA GATCGGCATG
TACATCGCCC TGCGACTGCT GCGCGACGGC GCCCACACCA CCATCACCAC CCGCTTCCCC
AACGATGCCG TGCGCCGTTT CGCAGCGATG CCCGACAGCG CCGACTGGCT GCACCGACTG
CGGATCGTCG GTATCGACCT GCGGGACCCG GCGCAGGTGG TCGCCCTCGC CGACGCCGTG
GCCGCGCAGG GCCCGCTGGA CATCCTGATC AACAACGCCG CGCAGACCGT GCGCCGGTCG
CCCGGTGCCT ATGCGGCACT CGTCGAGACG GAACGCACGC CGCCGCCGGA GATCGTGGAC
GTGCTGACGT TCGACCGGGT CAGCGACGCC CACCCGGCCG CGCTCGCCGG CAGCCTGGCC
GCAAACCCCA CTCCGCACCA GGTGGCCGAG CTGGCGCTGA CTGCCCGCAG CGCCTCCCCG
GACCGGATCG CCGCGGGCAC CGCCATCGAC GCGGGCGGCC TGCTGCCCGA CACGGCCCCG
GTGAACAGCT GGACCCAGCG GGTCCACGAG GTCGACGCGA TGGAACTGCT GGAGGTGCAG
CTGTGCAACC AGACCGCGCC GTTCATCCTG GTGAGCCGGC TGCGCCCGGC GATGGCCGCC
GCACCTGCGC GTCGCACCTA CGTCGTGAAT GTCTCCGCGA TGGAGGGTCA GTTCAGCCGG
GCATACAAGG GCCCGGGTCA TCCGCACACC AACATGGCCA AGGCCGCGCT GAACATGCTG
ACCCGCACGA GCGCCGGTGA GATGTTGGAG CGCGACGGCA TTCTGATGAC CGCCGTGGAC
ACCGGCTGGA TCACCGACGA GCGCCCGCAC CCGACGAAGC TGCGGCTCGC AGAGGAGGGG
TTTCACGCCC CGCTGGACCT GGTCGACGGG GCTGCGCGCG TGTACGACCC GATCGTGCGC
GGCGAGGCCG GCGAAGATCT GCACGGCTGC TTTTTGAAGG ACTACTCGCC GTCCAACTGG
TAG
 
Protein sequence
MSELIRTSGI PSPLLVEMGF MESPYASRVN AIDPDKLITC LQVLADIEAL PPEHPDAVAV 
RRATAGIFKS VRKARRAAKR DAVAAADDAI TAATATGAPG RIDDETQGLP LVSTTVGATA
GTLLRPRACY ICKNRYTVVD AFYHQLCPDC AALNRAKRDA RTDLTGRRAL LTGGRAKIGM
YIALRLLRDG AHTTITTRFP NDAVRRFAAM PDSADWLHRL RIVGIDLRDP AQVVALADAV
AAQGPLDILI NNAAQTVRRS PGAYAALVET ERTPPPEIVD VLTFDRVSDA HPAALAGSLA
ANPTPHQVAE LALTARSASP DRIAAGTAID AGGLLPDTAP VNSWTQRVHE VDAMELLEVQ
LCNQTAPFIL VSRLRPAMAA APARRTYVVN VSAMEGQFSR AYKGPGHPHT NMAKAALNML
TRTSAGEMLE RDGILMTAVD TGWITDERPH PTKLRLAEEG FHAPLDLVDG AARVYDPIVR
GEAGEDLHGC FLKDYSPSNW