Gene Mvan_5750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5750 
Symbol 
ID4644205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp6138539 
End bp6140923 
Gene Length2385 bp 
Protein Length794 aa 
Translation table11 
GC content69% 
IMG OID639809226 
ProductKojibiose phosphorylase 
Protein accessionYP_956521 
Protein GI120406692 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.06218 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0760433 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCCG ACGAAGCCTT TCCGGTCGAG CCGTGGCACA TCCGGGAGAC CGAACTGCGG 
CTGGATCTGC TGCCGCAGAC CGAGTCGCTG TTCGCGCTGT CCAACGGGCA CATCGGGTTG
CGCGGCAACC TCGATGAAGG CGAACCGCAC GGGCTGCCGG GCACCTACCT CAACGGCTTC
TACGAAATCC GGCCACTGCC GTACGCGGAG GCGGGGTTCG GTTACCCCGA AGAGGGACAG
TCCATCGTCG ACGTCACCAA CGGCAAACTG ATCCGGCTGC TGGTCGACGA CGAACCGTTC
GACGTCCGGT ACGGTGAATT GAGTTCGCAC GAAAGGGTTC TGGATCTGCG GGCGGGCACG
CTGACGCGGG CCGCGGAGTG GACATCACCG GCCGGGAAGC GGGTCGAGGT GCACTCGACA
CGGCTGGTGT CGCTGACCCA GCGCGGTCTG GCGGCCATCG AATACGTGGT GGAGGCCGTC
GACGACGTGC TGCTGACGGT GCAGTCCGAA CTCGTGGCCA ACGAGGATCA GCCTCCGGCA
TCGGCCGACC CGAGGGTGGC CGCGGTGCTG AGTCACCCGC TGACCCCGGT GCAGCACGAG
GTCAGCGAAC GCGGGGCGAT GTTGATCCAC CGCACCCGTG CCAGCGAACT GACGCTGGCC
GCTGCGATGG ACCACGTCGT CGAGGCACCC GGCCGGGTGG ACCTCAGCGG TGACGCCGGA
CAGGACTGGG CGCGCACCAC GGTGGTGTGC GGGCTGAAAG CCGGTGAGCG TCTTCGGCTG
GTGAAGTACC TGGCCTACGG ATGGTCGAGC CTGCGGTCGC GCCCCGCGCT GCGGGACCAG
GTCGCCGCCG CCATCGCGGG CGCCCGCTAC ACCGGCTGGG ACGGGTTGCT CACTGCGCAG
CGTGAATACC TCGACGAATT CTGGGACTGC GCCGACGTGG AGGTGGACGG CGACGCCGAC
TGCCAACAGG CGGTGCGGTT CGGATTGTTC CACGTGCTGC AGGCCAGCGC CCGCGCCGAG
CGACGGGCCA TCGCGGGCAA GGGCCTGACC GGAACCGGTT ACGACGGCCA CGCCTTCTGG
GACACCGAGG GTTTCGTGCT TCCGGTGCTG ACCTACACCG CGCCGCGTGC GGCTGCGGAC
GCGCTGCGCT GGCGGGCCTC GACGTTAGAG ATGGCGCGGG ACCGCGCCGC CGAACTCGAC
CTGCGCGGCG CCGCCTTCCC GTGGCGCACC ATCCACGGCG AGGAGTGCTC GGCCTACTGG
CCCGCAGGCA CCGCGGGATT CCACGTCAAC GCCGACATCG CGATGGCGTT CGACCGTTAC
CGCGTGGTGA CGGGCGACGA ATCGCTGGAA AAGGATTGCG GTCTGGCGGT TCTCGTCGAC
ACCGCGCGGC TGTGGCTGTC GTTGGGACAC CACGACCGCT ACGGCAGGTG GCGCATCGAC
GGCGTGACCG GGCCCGACGA ATACACCGCC GTCGTGCGGG ACAACGTCTT CACCAACCTG
ATGGCGGCGG CGAACCTGCG TGTCGCCGCT GACGCCTGCA CCCGGCAGCC CGACGCGGCC
CGCGCGCTCG GGGTCGACAC CGAGGAGACC GCCGCCTGGC GCGACGCCGC CGACGCCGTG
CACATCCCGT ACGACGAGGA GTTGGGTGTG CATCCGCAGT GCGACGGGTT CACCACGCTG
CGGGAATGGG ATTTCGATCA GAACACGAAA TATCCACTGC TGCTTCATGA ACCGTACGTT
CGTCTCTATC CCGCCCAGGT GGTCAAGCAG GCCGACCTGG TGCTGGCGAT GCAGTGGCAG
AGCCACGCGT TCACCCCGGA TCAGAAGGCG CGCAACGTCG ACTACTACGA ATGCCGCACC
ACCAGGGATT CGTCGCTGTC GGCCTGTACC CAGGCCGTGA TGTGCGCCGA GGTGGGCCAT
CTGGAGTTGG CCCACGACTA CGCCTACGAG GCCGCGCTGA TCGATCTGCG CGATCTGCAC
CACAACACCG GGGACGGGCT GCACCTGGCG TCGCTGGCCG GCAGCTGGAC CGCGCTGGTC
GCCGGGTTCG GCGGACTCCG CGACGACGAG GGCGTCCTGG CGCTGGATCC GCAGCTGCCC
GGCGGTATCA GCAGGTTGCG GTTCCGGCTG CGCTGGCGCG GCTTTCGGGT CACCGTCGAT
GCCGACCACG ACGCCGTCAC CTACACGCTG CGGGACGGGC CGGAAGGCGT GCTGACCATC
CGCCACTCCG GTGACCCGCT CGAGATCAAC ACCCGCAAAC CCACGCGGGT CGCGGTGCGG
CCGAAGAAGC CGCTGCTCGC CCCGCCGAAG CAGCCGCCAG GCCGCGAGCC GCTGCGGAGA
TGGCGCTCAA CGGTGGACGC GTCGCGGAAT TCGCGGGAGA GCTGA
 
Protein sequence
MIADEAFPVE PWHIRETELR LDLLPQTESL FALSNGHIGL RGNLDEGEPH GLPGTYLNGF 
YEIRPLPYAE AGFGYPEEGQ SIVDVTNGKL IRLLVDDEPF DVRYGELSSH ERVLDLRAGT
LTRAAEWTSP AGKRVEVHST RLVSLTQRGL AAIEYVVEAV DDVLLTVQSE LVANEDQPPA
SADPRVAAVL SHPLTPVQHE VSERGAMLIH RTRASELTLA AAMDHVVEAP GRVDLSGDAG
QDWARTTVVC GLKAGERLRL VKYLAYGWSS LRSRPALRDQ VAAAIAGARY TGWDGLLTAQ
REYLDEFWDC ADVEVDGDAD CQQAVRFGLF HVLQASARAE RRAIAGKGLT GTGYDGHAFW
DTEGFVLPVL TYTAPRAAAD ALRWRASTLE MARDRAAELD LRGAAFPWRT IHGEECSAYW
PAGTAGFHVN ADIAMAFDRY RVVTGDESLE KDCGLAVLVD TARLWLSLGH HDRYGRWRID
GVTGPDEYTA VVRDNVFTNL MAAANLRVAA DACTRQPDAA RALGVDTEET AAWRDAADAV
HIPYDEELGV HPQCDGFTTL REWDFDQNTK YPLLLHEPYV RLYPAQVVKQ ADLVLAMQWQ
SHAFTPDQKA RNVDYYECRT TRDSSLSACT QAVMCAEVGH LELAHDYAYE AALIDLRDLH
HNTGDGLHLA SLAGSWTALV AGFGGLRDDE GVLALDPQLP GGISRLRFRL RWRGFRVTVD
ADHDAVTYTL RDGPEGVLTI RHSGDPLEIN TRKPTRVAVR PKKPLLAPPK QPPGREPLRR
WRSTVDASRN SRES