Gene Mvan_1501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1501 
Symbol 
ID4645394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1588950 
End bp1590005 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content69% 
IMG OID639804999 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_952339 
Protein GI120402510 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.66157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.410286 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTGG CGCAGACCGC CACCCCGCCC GCGACGTCGG ACCGCCGGAT CCGCAGTTTC 
GGCGAGATTC CCAGCCCACA CGCGGTGTCG ACCGAATTCC CGTTGGGTGC CCGCCGTGCC
GAGCGGGTGG CCCGCGACCG CGACGAGATC GCCGACATCC TCGCGGGGCG GGACGACCGT
CTGCTGGTGG TCGTCGGGCC GTGCTCGGTG CATGACCCTG CCGCCGCGCT GGAGTACGCC
GGCCGGCTGG TCAAGATCGC CGCCGAGCTC AAGGACAGCC TCAAGATCGT GATGCGGGTG
TACTTCGAGA AGCCGCGCAC CACGATCGGT TGGAAAGGCC TGATCAACGA TCCGGGGATG
GACGGCACCT TCGACGTCGC GCGGGGCCTG CGCATCGCCC GGCAACTGCT GCTGGACATC
ATCGACATCG GGCTCCCGGT GGGGTGTGAA TTCCTCGAGC CGACCAGCCC GCAGTACATC
GCCGACGCCG TGGCGTGGGG TGCGATCGGC GCCCGCACCA CCGAATCGCA GGTGCACCGT
CAACTTGCTT CGGGCCTGTC GATGCCGGTC GGCTTCAAAA ACGGAACCGA CGGCAACATT
CAGGTCGCCG TCGACGGCGC GAAATCCGCT GCCGCCCAAC ATGTGTTCTT CGGCATGGAC
GACATGGGCC GCGGCGCCGT GGTGAGCACC GAAGGTAACA GGGACTGCCA TGTCATCCTG
CGGGGAGGTA CCGGCGGACC GAACTGGGAC GCCGAGTCGG TGCGCTCGGC GGCCGACAAG
CTCGAGAGCG CGGGACTGCC CGGCCGGGTG GTGATCGACT GCAGCCACGC GAATTCCGGT
AAGGACCACG TGCGGCAGGC GAGCGTAGCC GCGGAGGTGG CGCAGCTGGT GCGGGACGGC
CTTCCGGTCA GCGGAGTCAT GCTGGAGAGC TTCCTGGTCG CCGGGGCACA GGCTCCCGAG
GCGCGTCCGC TGACCTACGG CCAGTCGGTG ACCGACAAGT GCATGGATTG GGGTGCAACG
GATCTGGTGT TGCGAGAGCT GGCCCGGCGC GGTTAG
 
Protein sequence
MTLAQTATPP ATSDRRIRSF GEIPSPHAVS TEFPLGARRA ERVARDRDEI ADILAGRDDR 
LLVVVGPCSV HDPAAALEYA GRLVKIAAEL KDSLKIVMRV YFEKPRTTIG WKGLINDPGM
DGTFDVARGL RIARQLLLDI IDIGLPVGCE FLEPTSPQYI ADAVAWGAIG ARTTESQVHR
QLASGLSMPV GFKNGTDGNI QVAVDGAKSA AAQHVFFGMD DMGRGAVVST EGNRDCHVIL
RGGTGGPNWD AESVRSAADK LESAGLPGRV VIDCSHANSG KDHVRQASVA AEVAQLVRDG
LPVSGVMLES FLVAGAQAPE ARPLTYGQSV TDKCMDWGAT DLVLRELARR G