Gene Mvan_0211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_0211 
Symbol 
ID4647724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp222656 
End bp224527 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content67% 
IMG OID639803721 
Productglycosyl transferase family protein 
Protein accessionYP_951067 
Protein GI120401238 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis
[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGA CAGACGCCCC GGGAGCCATC GCCGCCGGTC CGGCCGTCAT GCAGACACCG 
AAGGTGTCGA TATGTATCCC CGCCCACCAG GCCGCTGCGT ACCTGCAACC CCTGCTCGAC
AGCGTGCTGT CCCAGGCCTA CGACGACTTC GAGGTGGTCG TCATCGACAA CCACAGCACC
GACGGCACTT CCGACATCCT GGCGCGCGTC GACGATCCGC GTGTTCGGGT CATGCGGAAT
CCGGCCACCC TGCCGTTCGT CGAGAACTGG AACCTCCTGG TGTCACAGTC CCGGGGCGAG
TTCGTCAAAC TTGTCTGCGC CGACGATCTG CTCAAGCCCG GCTGCCTTGC GGTGCAGGCC
TCGGTCCTCG ACAACAATCC CGATGTCGCC CTGGTGTCGG TGAAATGCGA CTTCATCGAC
GACAACGAGC GCTTGATCGT GCCCGCCCGG GGACTCGACG GCATCGAGGG ACAGGTCACC
GCCGAAGGCG TGGTCAGGCG GATCGTGCGC AATGGAGGCA ATCCGATCGG AGCACCGGTG
GCGGGCATGT TCCGGCGCGC CGACTTCGAC CGGGTCGGTG GGTTCACCGC CGACTTCCCC
TTCCTGAGTG ACATACATCT GTGGGTGCGG CTGTTGGGCT GTGGCGACTT CTACGGCATA
CCGGCGACAC ACGCCTCGTT CCGGATCCGC GGTGGCTCCA TGAGCGGCCT GACCTCGGCG
CGGACCCAAC TTGCCCAGTC GCTCGACTTC GAGAAGTCGC TTGCCCGCGA TCCACGCTGG
GACCTGTCCC AAATCGACCT TTTCCGCGGC TGGATGCGTT GCCACGAACA GACTTTGCGC
CGGATGGCAC TGTTCGGTCT GACCAAGTGG CGTGTTGCGC GACGTGACCG CGGGCCGGTC
CGGGCCGGCC CGCGAGCCGG CACCGATCTG CCGTCGACCG TCGTGGCCGA CACCCTGACC
GTGGTGATCT GCGCCTACAC CACGCAACGG TGGGATGAGC TCTGCCCTGC AGTGGAATCA
GTTCTGAATC AGGACTTCCC GGTACTCGGC GTCGTCGTGG TAATCGATCA CTGCCCGGAG
CTGTACCGGC TCGCCCGGGA CCGATTCGGT GCCCGAGGAC GAGTCACGGT GCTCGAAAGT
GACGGGGAGC GTGGACTTTC GGGTGCCAGG AACACGGGGG TGGGCGCGGC GCGCGGCGAC
GTCGTCGCGT TCCTCGACGA CGACGCAGTC GCCGAGCCCG GTTGGGCGCA TGCCCTGATG
CGCCACTATC GCGATCCGCG GGTCGCTGCC GTCGGCGGCT ATGCCGCCCC GGTGTGGCCC
ACGGGCGCCC GCCCGCACTG GATGCCTGCC GAGTTCGACT GGGTGGTCGG GTGCAGTTAC
ACCGGGCAGC CGACCGAGCT GGCGGAGGTG CGGAACCCCC TTGGCTGCAA TATGTCGATC
CGCCGCTCGG TGTTCGACGA CATCGGCGGG TTCAGGTCCG AGGTGGGCCG GGTCGGCAAC
CACCCGGTCG GCGGAGAGGA GACGGAGCTG TGCCTCCGCA TCCGTGGCCG CCAACCGGAT
GCACGGGTGC TGTACGACCC GGACGCCGTT GTCCGTCATC ATGTCTCGTG CGATCGGACG
ACGATCCGCT ATTTCCGGCG GCGGTGCTAC CACGAGGGGA TTTCGAAGGC TGTCGTCACC
GAGATCGCAG GCGTCGGCAA CCCGCTGCTC GCGGAGCGGG CCTACACGAC GCGGACCCTC
CCGCGCGGCG TCCTGCGGGA GCTCACAGCG CCGAGGCAGG GCGGGTTCCG GCGCGCGGGA
GTCATGGCTT TCGGGCTCGC AGCGACGACG GCCGGCTATC TGCGCGCCAA GACCCAGTAC
CGGCTCTCAT GA
 
Protein sequence
MTTTDAPGAI AAGPAVMQTP KVSICIPAHQ AAAYLQPLLD SVLSQAYDDF EVVVIDNHST 
DGTSDILARV DDPRVRVMRN PATLPFVENW NLLVSQSRGE FVKLVCADDL LKPGCLAVQA
SVLDNNPDVA LVSVKCDFID DNERLIVPAR GLDGIEGQVT AEGVVRRIVR NGGNPIGAPV
AGMFRRADFD RVGGFTADFP FLSDIHLWVR LLGCGDFYGI PATHASFRIR GGSMSGLTSA
RTQLAQSLDF EKSLARDPRW DLSQIDLFRG WMRCHEQTLR RMALFGLTKW RVARRDRGPV
RAGPRAGTDL PSTVVADTLT VVICAYTTQR WDELCPAVES VLNQDFPVLG VVVVIDHCPE
LYRLARDRFG ARGRVTVLES DGERGLSGAR NTGVGAARGD VVAFLDDDAV AEPGWAHALM
RHYRDPRVAA VGGYAAPVWP TGARPHWMPA EFDWVVGCSY TGQPTELAEV RNPLGCNMSI
RRSVFDDIGG FRSEVGRVGN HPVGGEETEL CLRIRGRQPD ARVLYDPDAV VRHHVSCDRT
TIRYFRRRCY HEGISKAVVT EIAGVGNPLL AERAYTTRTL PRGVLRELTA PRQGGFRRAG
VMAFGLAATT AGYLRAKTQY RLS