Gene Mjls_0335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_0335 
Symbol 
ID4876081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp366926 
End bp369196 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content68% 
IMG OID640137649 
Producttrehalose synthase 
Protein accessionYP_001068639 
Protein GI126432948 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02455] trehalose synthase, Pseudomonas stutzeri type 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.749024 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGG CGCCAGAGAC GGACTACGAC GACCAGGCCG GGTCGGGCAC CGACGATCAG 
ACGAACGAGC CCAGCGAGAT CACCTACGAC GAGTACCTGC ATCCGGCGCG TCCGCGGGCG
CTGCGGTTCC GTCCCCGCGT CAAATCGCCG TTCACCCGCC GCTCGGTGGC CCAGGACGGT
CCGGCGCGCG CGGACAACCC GGCGTACGTG TCGTGGCTGC TGTCGCAGTC GATGCTCGCC
GACGCCAACG AGATCAGCCA GCAGTTCTCC GGTCAGGGCT CGATGTGGCA GAACCCGTAC
GCGACACCGA GCCCGCGCGG CGCGGTGGAG ACGGCGTCGG TGTGGTTCAC CGCCTACCCG
TTGTCGTTGA TCACCCGGCC CAACGAGTCG TTCCTCAAGG CGCTGGCCGA CGACGAGATG
TGGAAGGCGT TCGCCGACAT CGGTATCGAG GCCATCCACA CCGGACCGGT CAAGCGCGCG
GGCGGGATCT CGGGGTGGCA GACCACCCCG AGCGTCGACG GCCACTTCGA CCGCATCAGC
ACCGAGATCG ACCCGGCCTT CGGCACCGAG GAGGAGTTCC GCAAGATGTG CGGGACGGCC
AACTGGTACG GCGGCACGAT CATCGACGAC ATCGTGCCCG GCCACACCGG TAAGGGCGCC
GACTTCCGCC TCGCGGAGAT GAAGTACGCC GACTATCCCG GGATCTACCA CATGGTGGAG
GTGGATCCCC GCGACTGGGA ACACCTGCCC GCCGTGCCGC CCGGTGTCGA CTCGGTCAAC
ATCGACCAGG CCACCGAGGA GTGGCTGGAC AAGGCCGGTT ACATCATCGG CCGCCTGCAA
CGGGTGATCT TCTACGCCGA GGGCATCAAG GAGACCAACT GGAGCGTCAC CCGGCCGGTG
CTCGGCGTCG ACGGCGTGGA GCGCCGCTGG GTGTACCTGC ACTACTTCAA GGACGGTCAG
CCGTCGATCA ACTGGCTCGA CCCGTCGTTC GCCGGGATGC GCCTCGTGAT CGGGGACGCG
CTGCATTCGC TCGCCGACCT GGGCACCGGC GGACTTCGCT TGGACGCCAA CGGTTTTCTC
GGCGCGGAGA AGACCGCCGC CGAGGACAGC ACCGCCTGGT CGGAGGGTCA TCCGCTGTCC
GAGGCGGCCA ACCATCTGAT CGCCAGCATG GTGCGCAAGG TGGGCGGGTT CACCTTCCAG
GAGCTCAACC TCACCATCGA CGACATCCGC CAGATCGGCG AAGCGGGCGC GGACCTGTCC
TACGACTTCA TCAACCGGCC CGCCTACCAG CACGCCCTGG CCACCGCGGA CACCGAGTTC
CTGCGGCTGA CGCTGCGCAC CACGCTGGAA CTCGGCGTCG ACCCGGCGTC GTTGATCCAC
GCCCTGCAGA ACCACGACGA GTTGACCTAC GAACTCGTGC ACTGGTCGAA CGGACACCGC
GACGACATCT ACACCTACAA GGGTGAGGAG ATCACCGGCG AGGCGCTCGG CGAGACGATC
CGCCGGGACC TCAGCGTACG GCTCACCGGC GAGAACGCCC CGTACAACCT GGTTTTCACC
ACCAACGGGA TCGCCTGCAC CACCGCGACG GTCATCGCGG CCACCCTCGG CATCACCGAT
CTGAACGAGA TCGACGACGA TCCGGTGCGC ATCGACCGCA TCCGGCGAGC CCATCTGCTG
CTGGCGATGT TCAACGCGCT GCAGCCGGGG GTGTTCGCGC TGTCGGGGTG GGACCTGTGC
GGCATGCTCA CGTTGCCGGC CGGGCAGGTC GGCGAACTCC TGCGCGGCGG CGACACCCGG
TGGATCCACC GGGCCGCCCA CGATCTGATG GGCGTGAATC CCGCTGCGAC ACAGTCGTTG
GCGGGGATGC CGCGGGGGCG GAGCCTCTAC GGGTCGATCC CCGATCAGCT CGACGACGAG
ACCAGCTTCC TACGCCAGTT GCAGGCGATC CTGCGGGTGC GTTCGCACTA CGGCATCGCG
ACGAGCCGCC AGGTCGACAT CCCCGAGGTC TCCCACCGCG GCATGCTCGT GCTGGTGCAC
CAACTGGCGG ACGAGGGCCG CTATCAGTTG ACCGTGCTGA ACTTCGCCAA CGAGGAGGTC
GCCGGAACGG TGCGCTCCGA GGCGCTGCCG CCCGGCGCCG AGGTGTCGGA CATGTTCACC
GGCCACGCCT TCGCCACGGT CGACGACCTG CACAGCTTCA CCGTCGAGAT GCCCGCGCAT
CACGGGATGT CCCTGCTGGT CGAGGTCACC GCCGACGACT CCGAGACGTG A
 
Protein sequence
MSEAPETDYD DQAGSGTDDQ TNEPSEITYD EYLHPARPRA LRFRPRVKSP FTRRSVAQDG 
PARADNPAYV SWLLSQSMLA DANEISQQFS GQGSMWQNPY ATPSPRGAVE TASVWFTAYP
LSLITRPNES FLKALADDEM WKAFADIGIE AIHTGPVKRA GGISGWQTTP SVDGHFDRIS
TEIDPAFGTE EEFRKMCGTA NWYGGTIIDD IVPGHTGKGA DFRLAEMKYA DYPGIYHMVE
VDPRDWEHLP AVPPGVDSVN IDQATEEWLD KAGYIIGRLQ RVIFYAEGIK ETNWSVTRPV
LGVDGVERRW VYLHYFKDGQ PSINWLDPSF AGMRLVIGDA LHSLADLGTG GLRLDANGFL
GAEKTAAEDS TAWSEGHPLS EAANHLIASM VRKVGGFTFQ ELNLTIDDIR QIGEAGADLS
YDFINRPAYQ HALATADTEF LRLTLRTTLE LGVDPASLIH ALQNHDELTY ELVHWSNGHR
DDIYTYKGEE ITGEALGETI RRDLSVRLTG ENAPYNLVFT TNGIACTTAT VIAATLGITD
LNEIDDDPVR IDRIRRAHLL LAMFNALQPG VFALSGWDLC GMLTLPAGQV GELLRGGDTR
WIHRAAHDLM GVNPAATQSL AGMPRGRSLY GSIPDQLDDE TSFLRQLQAI LRVRSHYGIA
TSRQVDIPEV SHRGMLVLVH QLADEGRYQL TVLNFANEEV AGTVRSEALP PGAEVSDMFT
GHAFATVDDL HSFTVEMPAH HGMSLLVEVT ADDSET