Gene Mvan_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2006 
Symbol 
ID4645329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2143726 
End bp2144574 
Gene Length849 bp 
Protein Length282 aa 
Translation table11 
GC content67% 
IMG OID639805491 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_952829 
Protein GI120403000 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.512917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAGC TGGACAACAA AGTTGCGGTG GTCTCGGGCG CGGCGCGGGG ACAGGGGCGC 
TCGCACGCGG TGAGCCTGGC GGCCGAGGGC GCCCACATCA TCGCGCTCGA CATCTGCGCC
GACCTGGAGG GCAACACCTA TCCGCTGTCG CGCCCGGAGG ATCTGGACGA GACGGCGCGG
CTGGCCGAGA AGGAAGGGGT GCGCGTACAC ACGGCGATCG TCGACGTACG CGAGCGGGCC
GCGGTGTGGA AGGCCGTCGC CGACGGGGTC GACGCGTTGG GCCGCCTCGA CATCGTGGTC
GCCAACGCCG GCATCTGTCC GATGGGCCGC GGCCAGACGA TCCAGTCATG GTCGGACGCC
ATCGACACCG ACCTGGTCGG CGTGTTCAAC GTGATCCAGG CCAGCCTGCC GCACGTCAAC
GACGGCGCGT CGATCATCGC CACCGGCTCG CTGGCCGCTC AGTTGGGCAG TGCCACCAAC
CAGGGGCCGG GCGGTTCGGC ATACAGCCTG GCCAAGCAGG TTGTGGCGCA CTACGTCAAT
GACCTCTCGA TTCAGTTGGC CAAGAGGATG ATCCGGGTCA ACGCGATCCA TCCGACGAAC
GTCAACACCG ACATGCTGCA CAACGAGGGC CTGTACAGAG TGTTCCGACC GGACCTCAAG
GAGCCGACCC GCGAGGAGGC CGAGGAGGCG TTCCCGGCGA TGCAGGCCAT GCCGATTCCG
TACATCGAGC CGCGGGATGT GTCGAATGCG GTGGTGTTCC TGGCCGGCGA CGACTCGCGT
TACATCACCG GGACCCAGCT GCGCATCGAT GCCGGCGGCT ACGTCAAGGC CGTGCCCTGG
AAGGGGTGA
 
Protein sequence
MGKLDNKVAV VSGAARGQGR SHAVSLAAEG AHIIALDICA DLEGNTYPLS RPEDLDETAR 
LAEKEGVRVH TAIVDVRERA AVWKAVADGV DALGRLDIVV ANAGICPMGR GQTIQSWSDA
IDTDLVGVFN VIQASLPHVN DGASIIATGS LAAQLGSATN QGPGGSAYSL AKQVVAHYVN
DLSIQLAKRM IRVNAIHPTN VNTDMLHNEG LYRVFRPDLK EPTREEAEEA FPAMQAMPIP
YIEPRDVSNA VVFLAGDDSR YITGTQLRID AGGYVKAVPW KG