Gene Mvan_1030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1030 
Symbol 
ID4644251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1081405 
End bp1082346 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content68% 
IMG OID639804531 
Product5-dehydro-4-deoxyglucarate dehydratase 
Protein accessionYP_951874 
Protein GI120402045 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.113729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.255193 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACCGA CCTCGCGTCC TGATTTCCGC AGCGTTCTGT TCTTCCCCGT CACCCCGATG 
ACCCCCTCCG GTGCCGTCGA CCTCGACGCG CTGGCCCGCC ACATCGCCAG GGGTGTGGAC
GCCGGCCCAG GTGGCGTGTT CATCGCCTGT GGCACAGGGG AATTTCATGC CCTGGAGGAC
AACGAGTTCG GTGACGTGGT GCGGACCGCT GTCGACGTGG TGGCCGGACG CGTTCCGGTG
TACGCCGGCG CGGGCGGCGC GGTGGCACAG GCCAAACGGT TCGCCCTGGC CGCCGAGGAG
TCCGGAGCTG ACGGCCTGCT GCTGATGCCG CCCTACCTCG TCGAGGTGCC GCAGGCCGGC
CTGGTCGACT ACACCCGCGC CGTCGCCGAT ACCACGGACC TGCCCGTCAT CGTCTACAAC
CGCAACAACG CGCGGTTCAC CGAAGAGTCC GCCGTTGCCG TCGCCGAGAT CCCCAATGTG
ATCGGATTCA AGGACGGCAC CGGCAATTTC GACCTGGTAG CCCGCATCGT GCAGGCCGTC
AAGACCAACG TCGACCCCGA CTTCCTGTTC TTCAACGGAC TGCCCACCGC AGAGACCACA
CAGCTGGCGT ACCGCGCCAT CGGGGTGCCG CTGTACTCGT CGGCCACGTT CGCCTTCGCC
CCCGACCTGG CGCTGGCGTT CTACAACGCG CTGGACTCCG GCAACGAACA GCTCGCCGAG
GCACTGCTGA ACGCGTTCTT CATCCCGCTG GTGCGACTGC GCGACACCGT GCCCGGCTAC
GCCGTGTCAT TGGTCAAGGC CGGCGTCACG ATGGAAGGCA TCCCGGCCGG ACCGGTGCGC
CCACCTCTGG TGATGCCCGG CACCGACGAT CTGACCGAGC TCGCCGCGAT CGTCAAGGCC
GGCCGCGCGG TGCTGTCCGG CGCGCTCGCC CAGGCAGTCT GA
 
Protein sequence
MRPTSRPDFR SVLFFPVTPM TPSGAVDLDA LARHIARGVD AGPGGVFIAC GTGEFHALED 
NEFGDVVRTA VDVVAGRVPV YAGAGGAVAQ AKRFALAAEE SGADGLLLMP PYLVEVPQAG
LVDYTRAVAD TTDLPVIVYN RNNARFTEES AVAVAEIPNV IGFKDGTGNF DLVARIVQAV
KTNVDPDFLF FNGLPTAETT QLAYRAIGVP LYSSATFAFA PDLALAFYNA LDSGNEQLAE
ALLNAFFIPL VRLRDTVPGY AVSLVKAGVT MEGIPAGPVR PPLVMPGTDD LTELAAIVKA
GRAVLSGALA QAV