Gene Mvan_5649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5649 
Symbol 
ID4643334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp6037556 
End bp6039517 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content67% 
IMG OID639809125 
Productglycosyltransferases-like protein 
Protein accessionYP_956420 
Protein GI120406591 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.137997 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACA TCCCGTCCGG CGCGCTCGAC GCCGGGCAGT CACGGGCGGT GAGCCCCCTG 
GCCCGGATCA TCCTGCCGCG GCCGGGCGAG CCGCTCGACG TGCGCAAGCT CTATATAGAG
GAATCGGACA CCAACGCCAG GCGCGCGCAC GCCCCGACCC GTACCACCCT GGAGATCGGC
GCCGAATCCG AGGTGTCCTT CGCCACGTAC TTCAACGCGT TCCCGGCCAG CTACTGGCGG
CGCTGGTCGA TCCTGGAGGC GGTGGTACTG CGCGTCGAAC TCACCGGCAG CGCCCGCGTC
GACATCTACC GCTCCAAGGC CACCGGAGCC AGGATCACGG TCGGCGGCGC CCCGATCGCC
AGCAGAAATG TCGAACCGGC GGCTGGGTCC GATGTCGGCG CAAGCGCCTC CGTCGAGTTC
GAGATCGACC TCACCCCGTT CGAGGACGGC GGCTGGATCT GGTTCGACAT CACCACCGAC
GCCAAGTCCA CGCTGCACCA CGCCGGGTGG TACGCCCCGA TGCCCGCTCC GGGGCGCGCC
AATGTCGCGG TCGGCATCCC CACGTTCAAC CGGCCCTCGG ACTGCGTCAA CGCGCTCGCC
GCGCTGACCT CCGACCCGCT GGTCGACGAG GTGATCAGCG CGGTGATCGT GTCCGATCAG
GGCACGCAGA AAGCCAAGGA CCACCCGGGT TTCGACGCGG CCGCAGCGGC GCTGGGCAGC
CGGTTGTCCG TTCACAACCA GCCCAACCTG GGCGGCTCCG GCGGCTACAG CCGCGTCATG
TACGAGGCGC TGAAGAACAC CGACTGTGAG CAGATCCTGT TCATGGACGA CGACATCCGC
ATCGAGCCCG ACTCGATCCT GCGTGCGCTC GCGCTGAACC GGTTCGCCAA GGTCCCGACG
CTCGTCGGCG GGCAGATGCT CAACCTGCAG GAGCCCAGCC ACCTGCACGT GATGGGCGAG
ATGGTCGACT CGGCGAACTT CATGTGGACC GGGGCGGTCA ACACCGAGTA CGACCACAAC
TTCGCCAAGT ATCCGCTCAA CGACGAAGAG GAATACCGCA GCCGGCTCCT GCACCGCCGC
ATCGACGTCG ACTACAACGG CTGGTGGATG TGCATGATCC CGCGGCAGGT CGCCGAGGAG
CTCGGCCAGC CGCTGCCGCT GTTCATCAAG TGGGACGACG CGGACTACGG GCTGCGCGCC
GGTGAGCACG GCTATCCCAC CGTCACGCTG CCGGGCGCGG CCATCTGGCA CATGGCGTGG
AGCGACAAGG ACGACGCCAT CGACTGGCAG GCGTACTTCC ACCTGCGCAA CCGGCTGGTG
GTGGCGGCGC TGCACTGGGA CGGCAAGATC AGCGGGCTGT TGGCCAGCCA CCTCAAGGCG
ACACTCAAAC ACCTTCTGTG CCTTGAGTAT TCGACTGTGG CCATCCAGAA CAAGGCGATG
GACGACTTCC TGGCCGGCCC CGAGCACATC TTCTCGATTC TGGAATCCGC GCTGCCCGAC
GTGCGCAGAA TGCGCCAGGA GTACCCCGAC GCGGTGGTGC TGCCGAGCGC GACGGCGCTG
CCGGCACCAT CGGACAAGCG GTGGCGCAAG AAGGTGAGCA TCCCGACGAA CCCGGTGTCG
ATCTCGGCGC GCCTGGCACG CGGTGTGGTG CACCAGCTCA AGCCGCACGA CCCGGAGCAC
CATCGCCGTC CGCAGATCAA CGTGGCGACG CAGGACGCCC GGTGGTTCTC GCTGTGCAAC
GTCGATGGGG TGACGGTGAC CACCGCCGAC GGCCGCGGTG TCGTCTACCG GCAGCGGGAC
CGGGAGAAGA TGTTCACGCT GCTGCGCGAG TCGGTGAAGC GCCAGGTCCA GTTGGCCCGC
AAGTTCAACC GGATGCGCAA GGTGTACCGC GCGGCCCTGC CGACGATGAC CAGCACGCAG
AAGTGGGAGA CCGTGCTCCT CGACGAGTCC AGCCATGGCT GA
 
Protein sequence
MSDIPSGALD AGQSRAVSPL ARIILPRPGE PLDVRKLYIE ESDTNARRAH APTRTTLEIG 
AESEVSFATY FNAFPASYWR RWSILEAVVL RVELTGSARV DIYRSKATGA RITVGGAPIA
SRNVEPAAGS DVGASASVEF EIDLTPFEDG GWIWFDITTD AKSTLHHAGW YAPMPAPGRA
NVAVGIPTFN RPSDCVNALA ALTSDPLVDE VISAVIVSDQ GTQKAKDHPG FDAAAAALGS
RLSVHNQPNL GGSGGYSRVM YEALKNTDCE QILFMDDDIR IEPDSILRAL ALNRFAKVPT
LVGGQMLNLQ EPSHLHVMGE MVDSANFMWT GAVNTEYDHN FAKYPLNDEE EYRSRLLHRR
IDVDYNGWWM CMIPRQVAEE LGQPLPLFIK WDDADYGLRA GEHGYPTVTL PGAAIWHMAW
SDKDDAIDWQ AYFHLRNRLV VAALHWDGKI SGLLASHLKA TLKHLLCLEY STVAIQNKAM
DDFLAGPEHI FSILESALPD VRRMRQEYPD AVVLPSATAL PAPSDKRWRK KVSIPTNPVS
ISARLARGVV HQLKPHDPEH HRRPQINVAT QDARWFSLCN VDGVTVTTAD GRGVVYRQRD
REKMFTLLRE SVKRQVQLAR KFNRMRKVYR AALPTMTSTQ KWETVLLDES SHG