Gene Mvan_0840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0840 
Symbol 
ID4646173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp874085 
End bp875191 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content68% 
IMG OID639804340 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_951684 
Protein GI120401855 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.626315 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCCG AGGCCCGGTC TGGAGGCCGT TCCGGGGCCG ACGGCTCGGA CACCCGTGAC 
GATCTGAACT ACCCGAAGGT CGTGCTGGTC ACCGGCGCGT GCCGGTTCCT GGGTGGATAC
CTGACCGCAC GGCTGGCGCA GAACCCCTTG ATCAATCACG TGATCGCGGT CGACGCGATC
GCGCCGAGCA AGGATCTGCT GCGCAGGATG GGTCGCGCGG AGTTCGTCCG GGCCGACATC
CGCAACCCCT TCATCGCCAA GGTCATCCGC AACGGTGACG TCGACACCGT CGTGCACGCC
GCGGCGGCGT CGTACGCGCC CAGGTCGGGC GGCCGCGCGA CGCTGAAGGA ACTGAACGTG
ATGGGCGCGA TTCAGCTGTT CGCGGCCTGC CAGAAGGCGC CTTCGGTGCG GCGGGTGATC
CTCAAGTCGA CGTCGGAGGT GTACGGGTCC AGTTCACGGG ACCCGGTGCT GTTCTCCGAG
AGCAGCAGCC GCCGCAGGCC GCCCGGTGAG GGTTTCGCCA GGGACAGCAT CGACATCGAG
GGTTACGCGC GTGGCCTGGG CCGGCGCCGA CCGGATATCG CGGTGACGAT CCTGCGGTTG
GCCAACATGA TCGGCCCTGC AATGGACACG GCGCTGTCGC GGTATCTGGC GGGTCCGGTC
GTGCCGACGA TCATCGGGCA CGATCCGCGC CTGCAGTTGC TGCACGAGCA GGACGCGCTC
GGTGTGCTGG AGCGCGCGAC GATGGCGGGC AAGGCGGGCA CGTTCAACGT GGGGGCGTCC
GGGGTCATCA TGATGAGCCA GGCGATACGG CGGTCGGGAC GGCTGGCGCT GCCCGTTCCG
CGTTCGGTGT TGGTGGCGGT GGATTCGCTG TGGCGCGCCA CCCGTAACAC CGAACTGGAT
CGAGAGCAGC TCGACTACCT CAGCTACGGC CGCGTCATGG ACACCACCAG GATGCGCACC
GAGCTGGGCT ACACGCCGAA GTGGACGACG GCGGAAGCCT TCGACGACTA TGTGCGCGGA
CGTGGCCTGA CACCGATCGT GGACCCCGAC TGGATCCGGT CGGTGGAGAA TCGCGCGGTC
GCGGCGGCGC AGCGCTGGGG ACGGTAA
 
Protein sequence
MDSEARSGGR SGADGSDTRD DLNYPKVVLV TGACRFLGGY LTARLAQNPL INHVIAVDAI 
APSKDLLRRM GRAEFVRADI RNPFIAKVIR NGDVDTVVHA AAASYAPRSG GRATLKELNV
MGAIQLFAAC QKAPSVRRVI LKSTSEVYGS SSRDPVLFSE SSSRRRPPGE GFARDSIDIE
GYARGLGRRR PDIAVTILRL ANMIGPAMDT ALSRYLAGPV VPTIIGHDPR LQLLHEQDAL
GVLERATMAG KAGTFNVGAS GVIMMSQAIR RSGRLALPVP RSVLVAVDSL WRATRNTELD
REQLDYLSYG RVMDTTRMRT ELGYTPKWTT AEAFDDYVRG RGLTPIVDPD WIRSVENRAV
AAAQRWGR