Gene Mvan_2401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2401 
Symbol 
ID4644823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2556415 
End bp2557587 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content74% 
IMG OID639805885 
ProductUDP-glucuronosyl/UDP-glucosyltransferase 
Protein accessionYP_953221 
Protein GI120403392 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0273557 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTTG CCGTGGTCGC AGGACCTGAC CCGGGGCACG CTTTCCCGGC GATCGCGCTG 
TGCCTGAGGT TTCTGGCGGC CGGGGACCAC CCCACGTTGC TGACCGGAAC CAAATGGCTC
GACACGGCGC GTCGGGAAGG CGTCGATGCG GTCGAACTGC TCGGCCTGGA CCCCACCGCC
GACGACGACG ACACCGACGC CGGCGCCAAG ATCCACCACC GCGCCGCCCG GATGGCCGTC
CTCAACAAGG ACCCGATCGC CGCGCTGGCA CCGGACCTGG TGATCTCGGA TTCGATCACC
ACCTGTGGCG GGCTGGCCGC CGACCTGCTG GGACTGCCGT GGGCGGAACT GAACACCCAT
CCGCTCTACC ACCCGTCCAA GGGTCTGCCG CCGATCGGCA GCGGGCTGGC ACCGGGAACC
GGCCTGCGCG GCCGGCTGCG CGATGCGGTG CTGCGGGCAC TCAGCGCCCG CTCGTGGCGT
GCCGGCCTGC GCCAGCGGTC CGAGGCGCGG GTCGGGATCG GGCTGCCCGC CGCGGATCCG
GGCCCGGTGT GTCGCCTGAT CGCCACGCTG CCCGCACTGG AGGTGCCTCG CCCGGACTGG
CCGGCAGACG CGGTCGTCGT GGGTCCGCTG CACTTCGAGC CGACGACGCA CGTGCTGCGG
CTGCCCGCCG GGGACGGCCC CGTCGTCGTG GTGGCCCCGT CGACCGCGAC GACCGGTGCG
GTCGGGCTGG CCGAACTGGC GTTGGCGACG CTGGTGCCCG GCGAGGTGCT GCCCGAGGGG
GCGCGGGTGG TGGTGTCGCG CCTCGAGGGA CCCGACGCCG AGGTCCCGCC GTGGGCGGTG
GTCGGTCTCG GCCGGCAGGA CGAACTGCTG GCGAAGGCGG ATCTGCTGAT CTGTGGCGGC
GGGCACGGTA CGGTCGCGAA GTCGTTGCTG GCCGGCGTCC CGATGGTCGT GGTCCCCGGC
GGGGGAGACC AATGGGAGAT CGCCAATCGC CTTGTCCGCC AAGGCAGTGC GCAGCTCGTC
CGGCCGCTGA CCGGTTCGAC GCTGACCGCC GCCGTGCAGG AAGTGCTCGG CTCACCGGGC
TATCGGGAGG CTGCCCGCCG GGCCGGGGCC GGCGTCGCGG ACGTCGCCGA TCCGGTACGG
GTGTGCCACG CCGCCCTCGG GGCCCCGGCG TAA
 
Protein sequence
MRVAVVAGPD PGHAFPAIAL CLRFLAAGDH PTLLTGTKWL DTARREGVDA VELLGLDPTA 
DDDDTDAGAK IHHRAARMAV LNKDPIAALA PDLVISDSIT TCGGLAADLL GLPWAELNTH
PLYHPSKGLP PIGSGLAPGT GLRGRLRDAV LRALSARSWR AGLRQRSEAR VGIGLPAADP
GPVCRLIATL PALEVPRPDW PADAVVVGPL HFEPTTHVLR LPAGDGPVVV VAPSTATTGA
VGLAELALAT LVPGEVLPEG ARVVVSRLEG PDAEVPPWAV VGLGRQDELL AKADLLICGG
GHGTVAKSLL AGVPMVVVPG GGDQWEIANR LVRQGSAQLV RPLTGSTLTA AVQEVLGSPG
YREAARRAGA GVADVADPVR VCHAALGAPA