Gene Mvan_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1042 
Symbol 
ID4645353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1094953 
End bp1096089 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content69% 
IMG OID639804543 
Productglycosyl transferase, group 1 
Protein accessionYP_951886 
Protein GI120402057 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCGAC CGATCCCCCG GACGGTCCGA TTCGGCCTGC TCAGTACTTA CCCTCCAACA 
CCGTGCCGAC TCGCGAATTA CAGCTCAGCG CTGTTTGGCG CCTTGAGCGC GCGCGGGTCC
CAGGTGAGCG TGGTGCGGGT TGCCGACGGC TCACAGTCGA GTGACGCCAG GATCGTCGGG
GAGTTGGTCA ACGGCTCGGC GCGATCGGCG GCGCACTGCG TCGACTCGCT CAATCACAGC
GACGTCGCGG TGATTCAGCA CGACTACGGC GTTTACGGTG GCGCACACGG CGACGGCCTG
CTGGACGTCA TCGACGGGCT GCGCGTCCCG ACGGTGGCCG TCGCCCATAC GATCTTGAAA
AACCCTGCGC CACATCAACG TTGGGTGATG GAGCGGATGG CGGCGACGAT CGACCGGATG
GTGGTGATGT CCGAGGCGGC ACGGGAGCGG CTGTGCCGTG AGTACGGCGT GGACCGCCGC
AAGGTCGTCA CGATCCCGTA CGGTGCGGTG CTGCCCACCG GCCCACGTGC GAAGCGTGGC
AGCAGGCCCA CCATCCTGAC GTGCGGTCTG CTCGGCCCCG GTAAGGGCGT CGAGCGCGTC
ATCGACGTGA TGTCCTCGTT GCAGAGCGTG CCCGGCCATC CCCGCTATGT GGTGGCGGGC
CGCACGCATC CGAAGGTGCT GGCCCGCGAC GGCGAGGCCT ACCGCGAAGC CCGCATCGAG
CAGGCCCGCC GCCTCGGTGT CGCGGATTCG GTGACCTTCG AGGACCGCCA CCTGGACCGG
GCATCGCTGG CAGCGCTCTT CCAGGCGGCG GCAGTCATCG TCTTGCCCTA CGACTCCACC
GATCAAGTGA CCTCGGGAGC CCTGGTCGAC GCAGTCGCCA GCGGCAGACC CGTCGTGGCC
ACCGCGTTCC CGCATGCGGT GGAGGTCCTG CGGGACGGTG CCGGCATCCT CGTCCCCCAT
GACGATCCCG AGGCCCTGTC CTGCGCGCTA CGCCGTGTCC TGACACAGCC GCGGCTGGCC
GGGTCGCTGG CCGCCGAGGC GCGGCAACTG GCGCCGGCGA TGGCGTGGCC GGTCGTTGCC
GACACCTACC TGGAGCTGGC GGCTCGCCTG CTGACGGAGC GGCAGCTACG CGTGTGA
 
Protein sequence
MKRPIPRTVR FGLLSTYPPT PCRLANYSSA LFGALSARGS QVSVVRVADG SQSSDARIVG 
ELVNGSARSA AHCVDSLNHS DVAVIQHDYG VYGGAHGDGL LDVIDGLRVP TVAVAHTILK
NPAPHQRWVM ERMAATIDRM VVMSEAARER LCREYGVDRR KVVTIPYGAV LPTGPRAKRG
SRPTILTCGL LGPGKGVERV IDVMSSLQSV PGHPRYVVAG RTHPKVLARD GEAYREARIE
QARRLGVADS VTFEDRHLDR ASLAALFQAA AVIVLPYDST DQVTSGALVD AVASGRPVVA
TAFPHAVEVL RDGAGILVPH DDPEALSCAL RRVLTQPRLA GSLAAEARQL APAMAWPVVA
DTYLELAARL LTERQLRV