Gene Mvan_4886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4886 
Symbol 
ID4648819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5232328 
End bp5233563 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content71% 
IMG OID639808357 
ProductUDP-glucuronosyl/UDP-glucosyltransferase 
Protein accessionYP_955665 
Protein GI120405836 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATGG TCAACGTCGG GATCGGCCTG CAGGCGCTCG GCCACGACAT CTGCGTGCTG 
ACGGGCGCCG AGTTCAGCGA CGCGGTGCAC GCCGCCGGGC TGCAGATGAT CCCGCTGCCC
GACAGTGTGC GCATCGACCC GCCCGCGGCT GCGCCCGCGC TGCTGCGGAA GTTGCCCGTC
GCGGTGCGGC GGTTCTGGCT GGGCCGTGCC GAACTGGCCT CGGTGTTCGC CGAACCCCTT
GCCGCAGAAG CACAGTCACT GCGGGCGGTG CTGCGGAACG GCTCGGTGGA CGCGATCGTC
GCCGATGTGG CGTTCACCGG TGTGCTTCCG GTCCTGCTGC AGGACTCACC ACGGCCACCG
GTCGTCGTGT GTGGGGTGGG CCCGCTGACG ATCTCGAGCC GGGACACGCC GCCGTTCGGC
GTGGCCTGGC AACCCGAGCG CGGCCTCGAT TACCGGCGGA TGACCACCGC TGCGCACCGG
GTGATCATGC GGTCAAGCCA GCGACGCTTC AATCATGCTC TGCGACAGGC GGGTTCGGAT
GCCTGCCCGC TCTTCGTCAG CGACTGGCCG CGGCTGGCGG ACGGGTTGCT GCAACTGTCG
GTGCAGGCGT TGGAGTATCC ACGCGGCGAC CTTCCGGCCA CGGTCGAGTT CGTCGGTCCG
GTGCTGCCTG CGGGACCGGA GGATTTCGAC CCGCCGCACT GGTGGGGCGA CGTGATGAAT
GCCGGCGCGG TCGTGCACGT CACGCAGGGC ACTTTCGACA ACGCCGACCT GGACCAGCTG
ATCGCCCCGA CCCTGGAGGC CCTCGGCGAC CGCGCCGACC TGCTCGTCGT CGCCACCACC
GGCGGCCGAC AGGGGCAGCG CATCCACGGC CGGATCCCGG CCAATGCGCG AATCGCCGAC
TGGATCCCCT ATTCGGCGCT GCTTCCGCAC GTCGACGTGA TGATCACCAA CGGAGGCTAC
GGCGGGGTGC AACACGCGCT GGCCCACGGC GTGCCGCTCG TCGTCGCGGG CGAGACCTCC
GACAAAGCCG AGGTCGCCGC CCGTGTCGAC TACAGCGGCG TCGGCATCGA CCTCAAGACG
GCGACGCCGA CACCCGAGGC GATCCGGGCC GCCGTGGACC ACGTCCGCCG CGACGGCCGC
TACCGGAGCG CCGCGGAGCG GCTGCGGTCG GCGATCGAAG CGTCGACCCC GGTGGACGCC
ATCGCGAACG CGATCAAGCG GCTGTGCAAC GCCTGA
 
Protein sequence
MPMVNVGIGL QALGHDICVL TGAEFSDAVH AAGLQMIPLP DSVRIDPPAA APALLRKLPV 
AVRRFWLGRA ELASVFAEPL AAEAQSLRAV LRNGSVDAIV ADVAFTGVLP VLLQDSPRPP
VVVCGVGPLT ISSRDTPPFG VAWQPERGLD YRRMTTAAHR VIMRSSQRRF NHALRQAGSD
ACPLFVSDWP RLADGLLQLS VQALEYPRGD LPATVEFVGP VLPAGPEDFD PPHWWGDVMN
AGAVVHVTQG TFDNADLDQL IAPTLEALGD RADLLVVATT GGRQGQRIHG RIPANARIAD
WIPYSALLPH VDVMITNGGY GGVQHALAHG VPLVVAGETS DKAEVAARVD YSGVGIDLKT
ATPTPEAIRA AVDHVRRDGR YRSAAERLRS AIEASTPVDA IANAIKRLCN A