Gene Mvan_4227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4227 
Symbol 
ID4645912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4534636 
End bp4535706 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content68% 
IMG OID639807694 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_955010 
Protein GI120405181 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.77425 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGGTA CATCCGGTAC CGTGCTCGTC ACCGGAGGCT TCGGGCTCGT CGGCTCGGCA 
ACGGTCCGTC GACTTGTCGA GTTGGGCCGC AGCGTCGTGG TCGCCGATCT CGACACCCCG
GCCAACCGCG CGTCGGCCGC TCAGCTGCCT GCCGGCGTCA CGGTCCGCTG GACCGATCTC
ACCGACGCCG AACAGACTTC CGCATTGGTT TCCGAGGTCG CGCCCGCGGT GATCATCCAC
CTCGCGGCGA TCATCCCGCC GGCGATCTAC AAGAATCGCG CCCTCGCCCG GCGCGTCAAC
GTCGAAGCGA CCGCGACGCT CGTGCGTATC GCGGAGGCTC AGCCCACTCC CCCGCGTTTC
GTCCAGGCGT CCAGCAACGC GGTGTACGGC GCACGCAACC CGTACAAGTC GGCCGGTCCG
GTCACCGCCG ACATGCCGAT GAAGCACTCC GATCTCTACA GCGCGCACAA GGCCGAGGCC
GAGGCGATCG TGCGCGCCTC GTCGCTGGAG TGGGTGGTGC TACGTCTGGG CGGGGTGCTC
AGCACGGATC CCAACGCCAT TCCGTTCAGC GCGGATGCGC TGTACTTCGA GAGCGTGCTT
CCCGCTGACG GCCGAATACA CACGGTCGAT GTGCGCGATG TGGCATGGGC TTTCGCCGCG
GCGACGACGG CCGATGTGGC TCGTGAGATC CTGTTGATCG CCGGCGACGA CTCGCATCGC
GTGCTTCAAG GTGACGTCGG CCGCGCGCTG GCCGAATCGC GCGGCCTCAA GGGTGGCCTG
GTGCCGGGCC GCAACGGCGA CCCCAACAGC GACGAGAACT GGTTCGTCAC CGACTGGATG
GACACCCGCC GCGCGCAGGA AGCCCTACAG TTCCAGCACT ATTCGTGGCA GAACATGCTC
GATGAGGCCC AGCGGCGTGC CGGCGCCTCG CGCTATGTGC TGCCGGTGTT CGCGCCGCTG
GTGCGGGCAG TTCTCAAGCG GCGCTCGGCC TACTGGAAGC AGCCCGGCCA GTACGCCGAT
CCGTGGGGCG CGATCAAGCG CGGGATCGGC GACCCGTCGC CCGATTCGTA G
 
Protein sequence
MSGTSGTVLV TGGFGLVGSA TVRRLVELGR SVVVADLDTP ANRASAAQLP AGVTVRWTDL 
TDAEQTSALV SEVAPAVIIH LAAIIPPAIY KNRALARRVN VEATATLVRI AEAQPTPPRF
VQASSNAVYG ARNPYKSAGP VTADMPMKHS DLYSAHKAEA EAIVRASSLE WVVLRLGGVL
STDPNAIPFS ADALYFESVL PADGRIHTVD VRDVAWAFAA ATTADVAREI LLIAGDDSHR
VLQGDVGRAL AESRGLKGGL VPGRNGDPNS DENWFVTDWM DTRRAQEALQ FQHYSWQNML
DEAQRRAGAS RYVLPVFAPL VRAVLKRRSA YWKQPGQYAD PWGAIKRGIG DPSPDS