Gene Mvan_3868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3868 
Symbol 
ID4649185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4136643 
End bp4138226 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content67% 
IMG OID639807334 
Productglycosyltransferase family 28 protein 
Protein accessionYP_954655 
Protein GI120404826 
COG category[R] General function prediction only 
COG ID[COG4671] Predicted glycosyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.479264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.241184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGATCC CGGAACCGCA CTCCACCAAG CACATCGAGG ACATGCTGGC CTGGTCGGCC 
GACGTCGATG CCGCCACGCT CGTGGACACC ACGGATGCCA GGCTGGACCC GGAATTCCTC
GCGGCGGAGC CATTCGAGGA TCTCGCGAAG GCCGTCGGCT GCCCGGTCCA TGTGGTGCAC
GGGACAGCGG ACCGGATCAG CAGCCCCGCG GTCGGCGAGC AGCTCGCCGA GCTCACCGGC
GGCTCGCTGA CGCTGATCGA GGGTGCCGGC CACGCGCCGC TGGCCCGAGA TCCGGTGCTG
ATCAACACGA TGATCCACGA TTTCGTCGCG ACGGTCGCGC CGTCGCCGCG CCTCAAGCAG
CGCGTCCGCG CGCCCCGCCG ACGGCGGAAG GCTCTGTACC TGTCCTCGCC GATCGGGCTC
GGCCACGCCC GCCGCGATGT CGCGATCGCC ACCGAGCTAC GTTCTGCAAC AGAAGATCTG
GAGATCGAGT GGCTGGCGCA GGACCCCGTC ACGCGGGTGC TCGCGTCAGC CGGTGAGCGG
ATTCATCCCG CATCGGCACA GCTGCTCAAC GAATCGACGC ACGTCGAACA CGAATCCGGT
GAGCACGACC TGCACGCGTT CGAGGCGCTG CGCCGGATGG ACGAGATCCT GGTCGCCAAC
TTCATGGTGT TCGCCGACCT GATCGCCGAG GAACCCTTCG ATCTGGTGAT CGCCGACGAA
GCGTGGGAGG TGGACTACTT CCTGCACGAG AATCCGGAAC TCAAGCGGTT CTCGTTCGCC
TGGCTGACCG ACTTCGTCGG CTGGTTGCCC ATGCCGGACG GTGGACCGCG GGAGGCAGCG
CTGACCGCCG ACTACAACGC GGAGATGATC GAGCAGCGCG CGAGGTTTCC GCGACTTCGG
GACCGGTCGA TATTCGTCGG CAATCCCGAA GACGTTGTGC GACAAGACTT CGGCCCTGGG
CTGCCCGACA TCAGGGAGTG GACCGGCCAG AACTTCGACT TCTCCGGATA TGTCACAGGC
TCGGTGCCGC CGGCGGGTCC GGAGCGGGCG GCACTGCGTC GGAAACTCGG GTTGCAGCCG
GATCAGCGAC TGTGCGTCGT CACCGTGGGC GGCACCTCGG TGGGGGAGTC GCTGCTGCAA
CGCATTCTGC ATGCGGTGCC CATCGTTCGC CGGGCAATGC CGGAGCTTCA CTTCCTGGTC
GTGACGGGTC CTCGCATCGA CCCCGCGACG CTGCCTCATC CGCGAGGCGT CCGGGTCCGT
GGCTTCGTCC CCGACCTCGC CGACTACCTC GCCGCCTGTG ACATCGCGCT GGTGCAGGGT
GGACTGACGA CGTGCATGGA GCTGACGGCG GCGGGAACGC CGTTCGTCTA TGTGCCACTG
GAGAATCACT TCGAACAGAA CTTCCATGTG CGTCACCGGT TGGAGCGCTA CGGCGGCGGC
CGTCCGATGC GCTACGCGGA GGCTGCCGAT CCGGACCTGC TGGCCAAGAT CATCTTCGAT
GAACTGTCCG CGACGCGACG GGTCCTTCCC GTCGAGACCG ACGGAGCCAG GCGTGCCGCG
GCGATGCTCG CCGATCTGCT GTAG
 
Protein sequence
MMIPEPHSTK HIEDMLAWSA DVDAATLVDT TDARLDPEFL AAEPFEDLAK AVGCPVHVVH 
GTADRISSPA VGEQLAELTG GSLTLIEGAG HAPLARDPVL INTMIHDFVA TVAPSPRLKQ
RVRAPRRRRK ALYLSSPIGL GHARRDVAIA TELRSATEDL EIEWLAQDPV TRVLASAGER
IHPASAQLLN ESTHVEHESG EHDLHAFEAL RRMDEILVAN FMVFADLIAE EPFDLVIADE
AWEVDYFLHE NPELKRFSFA WLTDFVGWLP MPDGGPREAA LTADYNAEMI EQRARFPRLR
DRSIFVGNPE DVVRQDFGPG LPDIREWTGQ NFDFSGYVTG SVPPAGPERA ALRRKLGLQP
DQRLCVVTVG GTSVGESLLQ RILHAVPIVR RAMPELHFLV VTGPRIDPAT LPHPRGVRVR
GFVPDLADYL AACDIALVQG GLTTCMELTA AGTPFVYVPL ENHFEQNFHV RHRLERYGGG
RPMRYAEAAD PDLLAKIIFD ELSATRRVLP VETDGARRAA AMLADLL