Gene Mvan_5178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5178 
Symbol 
ID4645695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5541150 
End bp5543408 
Gene Length2259 bp 
Protein Length752 aa 
Translation table11 
GC content67% 
IMG OID639808653 
Producttrehalose synthase 
Protein accessionYP_955955 
Protein GI120406126 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02455] trehalose synthase, Pseudomonas stutzeri type 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.365821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGTCGG AACCCGAAGA CCAGGGCACT GATTCTTCTG AGTCGCCCGA GCTCACGTTC 
GACGAGCACC TGCACCCGGC GCGACCGCGG TCGCTGCGGT TCCGGCCGCG GGTGCGGGCA
CCGTTCACCC GCCGCTCGCT GGCCCGGGAC GGCTCCCCCA CCGGCGACAA CCCCGCTTAC
GTGTCGTGGT TGATCAGCCA GTCGATGCTT GCCGATGCCA ACGAGATCAG CCAGCAGTTC
TCCGGCCAGG GCTCGATGTG GCAGAACCCT TACGCCACAC CGAGCCCGCG CGGCGCGGTC
GAAACGGCGT CGGTGTGGTT CACCGCGTAC CCGCTGTCGC TGATCACCCG GTCCAACGAG
TCTTTCCTCA AGGCGATGGC CGATGTGGAG ATGTGGAAGG CGTTCGCGGA GATCGGGATC
GAGGCGGTGC ACACCGGCCC GGTTAAACGC GCCGGCGGCA TCTCCGGCTG GGAGCTGACC
CCCAGCGTCG ACGGGCACTT CGACCGCATC AGCACCGAGA TCGACCCCGC GTTCGGGACC
GAGGACGAGT TCCGTCAGAT GTGCGGCACC GCGAACTGGT ACGGCGGCAC CATCATCGAC
GACATCGTGC CCGGCCACAC CGGTAAGGGC GCCGACTTCC GGCTCGCCGA GATGAAATAT
GCCGACTACC CCGGCATCTA CCACATGGTC GAGGTCGACC CCCGCGACTG GGAACACCTG
CCCGACGTGC CGCCCGGCGC GGACTCGGTC AACATCGACC CCGCGACCGA GCAGTGGCTC
GACAAGGCCG GCTACATCAT CGGCAAGCTT CAGCGGGTCA TCTTCTATGC CGAGGGCATC
AAGGAGACCA ACTGGAGCGT GACCCGCCCC GTCGTAGGCA TCGACGGTGT CGAACGCCGC
TGGGTGTACC TGCACTACTT CAAGGACGGC CAGCCCTCGA TCAACTGGCT GGACCCGTCC
TTCGCCGGGA TGCGGCTGGT CATCGGCGAC GCGCTGCACT CGCTGACGGA TCTGGGCACC
GGCGGACTCC GCTTGGACGC CAACGGTTTT CTCGGCGCGG AGAAAACCGC GGCCGAGGAC
AGCGCCGCCT GGTCGGAGGG CCACCCGCTC TCGGAAGCGG CCAACCACCT GATCGCCAGC
ATGGTGCGTA AGGTCGGCGG CTTCACCTTC CAGGAGCTCA ACCTCACCAT CGACGACATC
CGCGAGATCG GCGAGGCCGG CGCCGACCTG TCCTACGACT TCATCACCCG CCCGGCCGCC
CACCACGCGC TGGCCACCGC CGACACCGAG TTCCTTCGCC TGACGTTGCG CACCACCCTC
GAGCTCGGCG TGGACCCCGC CTCCCTGGTG CACGCGCTGC AGAATCATGA CGAACTCACC
TACGAACTCG TGCACTGGTC CACCGGCCAC CGCGACGACG TGTACACGTA CAAGGGCGAG
GAGATCACCG GGGAGGTCCT CGGCGAAACG ATCCGCAGCG ACCTGAGCGA GAAGCTGACC
GGCGAGAACG CGCCCTACAA CCTGGTGTTC ACCACCAACG GCATCGCCTG CACCACGGCC
ACCGTCATCG CCGCGACGTT GGGGATCGCC GCGCTGGACG ACATCGTCGA CGACGAGCAG
ATCGACCGGA TCCGCCGCGC CCATCTGCTG CTGGCGATGT TCAACGCATT GCAGCCGGGC
GTGTTTGCGC TGTCGGGATG GGACGTCTGC GGCATGCTGA CGCTGCCGCC GTCGCAGATC
ACCGAACTGC TGCACGGCGG CGACACCCGC TGGATTCACC GTGCCGCCCA CGATCTGATG
GGCGTCAACC CGACCGCTAC CCGGTCCCCA GCCGGAATGC CCAGGGGCCG AAGCCTTTAC
GGCTCCATAC CGGACCAGTT GGCCGAGGAC ACCAGCTTTC TGCGGCAGCT GCAGGCCATC
TTGAAGGTGC GGTCGCACTA CGGCATCGCC ACCAGCCGCC AGATCGACAT CCCCGAGGTG
TCGCACCGCG GCATGCTTGT GCTGGTGCAC CAGCTGCACG AGGAGGGCCG CTACCAGCTG
ACCGTGCTGA ACTTCGCCAA CGAGGACGTC GCGGGCACGG TGCGCTCGTT GAAGCTTCCT
CCGGGCAGTG AGGTGTCGGA CATGTTCTCG GGCAGGCCCC TCGCCACCGT CGACGACCTG
CACAGCTTCG GCGTCGAACT GACCGCGCAT CAAGGCTTGT CACTGCTCGT CGAGGCCCCG
GTGCCCGACG CCGGCACCGA GGGAGACCAC GTCCAGTAG
 
Protein sequence
MESEPEDQGT DSSESPELTF DEHLHPARPR SLRFRPRVRA PFTRRSLARD GSPTGDNPAY 
VSWLISQSML ADANEISQQF SGQGSMWQNP YATPSPRGAV ETASVWFTAY PLSLITRSNE
SFLKAMADVE MWKAFAEIGI EAVHTGPVKR AGGISGWELT PSVDGHFDRI STEIDPAFGT
EDEFRQMCGT ANWYGGTIID DIVPGHTGKG ADFRLAEMKY ADYPGIYHMV EVDPRDWEHL
PDVPPGADSV NIDPATEQWL DKAGYIIGKL QRVIFYAEGI KETNWSVTRP VVGIDGVERR
WVYLHYFKDG QPSINWLDPS FAGMRLVIGD ALHSLTDLGT GGLRLDANGF LGAEKTAAED
SAAWSEGHPL SEAANHLIAS MVRKVGGFTF QELNLTIDDI REIGEAGADL SYDFITRPAA
HHALATADTE FLRLTLRTTL ELGVDPASLV HALQNHDELT YELVHWSTGH RDDVYTYKGE
EITGEVLGET IRSDLSEKLT GENAPYNLVF TTNGIACTTA TVIAATLGIA ALDDIVDDEQ
IDRIRRAHLL LAMFNALQPG VFALSGWDVC GMLTLPPSQI TELLHGGDTR WIHRAAHDLM
GVNPTATRSP AGMPRGRSLY GSIPDQLAED TSFLRQLQAI LKVRSHYGIA TSRQIDIPEV
SHRGMLVLVH QLHEEGRYQL TVLNFANEDV AGTVRSLKLP PGSEVSDMFS GRPLATVDDL
HSFGVELTAH QGLSLLVEAP VPDAGTEGDH VQ