Gene Mvan_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0472 
Symbol 
ID4645578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp514011 
End bp515036 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content63% 
IMG OID639803980 
Productdihydrodipicolinate synthetase 
Protein accessionYP_951325 
Protein GI120401496 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACAC GTAGCAGCGA ACTCGTTCCC AGCGACATGA AGGGGCTGTG GGGATTCGTG 
CCGGCCTGCT CGACGCCGGA CGCCGCCGAT GTCAACGCGG TCGACACGAT CGACACCGAT
GCGCTCGCGT CTCTGGTGGA TCGTCTGGTG CGTGATGGTG TCGACGGCAT CGTGACGACC
GGCAGCGCCG GCGAGTCGCA CACCCTTTCC GACGACGAAT ACCGCACGCT CATCACGACA
GTCGTGGAGA CGGTGAACGC CCGGGTTCCG GTGTTCGTTG GTGCCAGCAC GCTCAACACG
CGCGACTCGA TCCGACGCGC CCGCGTCATC GCCGACCTCG GGGCGGACGG CATTATGAGC
GGACCGCCGA TGTATTTGCC GCAGACTGCC GAGAACGCGG TCCAGTACTA TAAAGACCTC
GCCGAGGCCG TTCCGGAGCT GGCGATTATG ATTTACCAGA ACCCGCATGC GTTCCGCATC
ACATTGCCGC CCGGTGCATT TAGGGAGCTG GCCCAGATTC GCAATATCGT TGCGCTCAAG
CAGACCTCGA TGGACATCTT CAATGTGATC GGCGCCATCA AAGCGGTCAA GGAAAAGATG
TCGGTCCTCG TCTTGGACCA ATTGATGTAC CCCGCAATGA TGTTCGGTGC TGCCGGAGCG
TGGAGCATCG ACGTATGCAT GGGCCCCTGG CCCGCGCTTT CGCTGCGCGA TGCATGCCAG
CGTGGCGACT GGACAGAGGC CGCGGCCATC GCCGACCAGA TGCAGGCGCC ATTTCGAACG
CTGGGTCTGA CTATGGAGGA ATTCCAAGCC ATGCAGTCCG CCTGGTGGAA GATGGCAATC
GACACTGCGG GCTATGGGCG TGCTGGGGCT GCTCGGCCGC CCTTCGTTCA CATACCGCAG
ACCGTCGTCG ATTCCGCGCA CCGCTACGGT GAACGCTGGG CAGGACTTGC GGAGCGCTAT
CACCGGTCAA GGGAAGCCGC TGGGCTGCCG CCTGCCGCGG CGAACGTCGC CGCCGCCTCG
TCATAG
 
Protein sequence
MTTRSSELVP SDMKGLWGFV PACSTPDAAD VNAVDTIDTD ALASLVDRLV RDGVDGIVTT 
GSAGESHTLS DDEYRTLITT VVETVNARVP VFVGASTLNT RDSIRRARVI ADLGADGIMS
GPPMYLPQTA ENAVQYYKDL AEAVPELAIM IYQNPHAFRI TLPPGAFREL AQIRNIVALK
QTSMDIFNVI GAIKAVKEKM SVLVLDQLMY PAMMFGAAGA WSIDVCMGPW PALSLRDACQ
RGDWTEAAAI ADQMQAPFRT LGLTMEEFQA MQSAWWKMAI DTAGYGRAGA ARPPFVHIPQ
TVVDSAHRYG ERWAGLAERY HRSREAAGLP PAAANVAAAS S