Gene Mjls_5501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_5501 
Symbol 
ID4881198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp5758686 
End bp5760476 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content65% 
IMG OID640142818 
Producttrehalose synthase 
Protein accessionYP_001073755 
Protein GI126438064 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0931844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCACA GCAGCGGCTC ACCCGCCCAC CCCGATCACG ATCCGGCCGA GGGCAGCCAC 
ATCGAGGACG GGGTGGTCGA ACATCCGACC GCAGGGGACT TCGGCCACGC GCGGATGGTC
CCCGAGGACC GGACGTGGTT CAAGCGGGCC GTGTTCTACG AGGTGCTCGT GCGTGCGTTC
CACGATTCGG ACGCCGACGG TTCCGGTGAC CTGCGCGGGC TGACCGAACG CCTGGACTAC
CTGCAGTGGC TCGGCGTCGA CTGTCTGTGG CTGCCGCCGT TCTACGATTC ACCGCTGCGC
GACGGTGGAT ACGACATCCG CGACTTCTAC AAGGTGCTGC CCGAATTCGG CACCGTCGAG
GACTTCGTCA CGCTGCTCGA CGCCGCCCAC CGCCGCGGCA TCCGGGTCAT CACCGACCTG
GTGATGAACC ACACCTCGGA CTCCCACCCG TGGTTCCAGG AGTCGCGCCG CGACCCGGAC
GGACCCTACG GCGACTTCTA CGTCTGGAGC GACACCAGCG ACAGGTACGC CGACGCGCGG
ATCATCTTCG TCGACACCGA GGAGTCCAAC TGGACCTTCG ACCCGGTGCG GCGGCAGTTC
TATTGGCACC GCTTCTTCTC CCACCAGCCG GATCTGAACT ACGACAACCC GGCCGTGCAG
GAGGCGATGC TCGACGTGCT GCGCTTCTGG CTCGACCTCG GCATCGACGG GTTCCGGCTC
GACGCCGTGC CGTACCTGTT CGAACGCGAG GGCACCAACT GCGAGAACCT GCCGGAGACC
CATGCGTTCC TGCGGCACTG CCGCAAGGTG ATCGACGACG AATATCCGGG CCGGGTGCTG
CTGGCCGAGG CCAACCAGTG GCCGGCCGAC GTGGTCGCGT ACTTCGGTGA CCCGGACACC
GGCGGCGACG AGTGCCATAT GGCGTTCCAT TTCCCGCTGA TGCCAAGGAT TTTCATGGCC
GTCCGGCGCG AGTCGCGGTT CCCGATCTCC GAGATCCTCG CGCAGACACC GGAGATCCCG
GATATGGCGC AGTGGGGGAT CTTCCTGCGC AACCACGACG AGTTGACCCT CGAGATGGTC
ACCGACGAAG AACGTGACTA CATGTACTCC GAATACGCCA AAGACCCACG GATGAAAGCG
AATGTCGGCA TCCGGCGGCG TCTGGCACCA CTACTGGAGA ACGACCGCAA TCAGATCGAA
TTGTTCACCG CGCTGCTTCT CTCACTCCCC GGGTCACCGG TGCTGTATTA CGGCGACGAG
ATCGGCATGG GCGACATCAT CTGGCTCGGT GACCGCGACG GTGTCCGCAC CCCGATGCAG
TGGACGCCGG ACCGCAACGC GGGCTTCTCG AAGGCCACGC CCGGCCGCCT GTACCTGCCG
CCCAACCAGG ACGCCATCTA CGGTTACCAA GCGGTGAATG TCGAAGCGCA GCGGGACAGT
TCGAATTCGC TGCTGAACTG GACGAAGACC ATGCTCGGGG TGCGCAGACG CCACGACGCG
TTCGCGATCG GCATGTTCCG CGAACTCGGC GGGTCGAACC CGTCGGTGCT GGCGTTCGTG
CGTGAGACCG CCACCGACAC GGTGCTCTGC GTCAACAACC TGTCCCGCTT CCCGCAGCCC
ATCGAACTGA ATCTGCAGCA GTGGAACGGT TTCACGCCGG TCGAGATGAC CGGCTACGTC
GACTTCCCGA GTATCGGGGC GCTGCCCTAC TTGCTGACCC TGCCCGGCCA CGGGTTCTAC
TGGTTCCAGC TACGCGCCCC CGACCCCGAA CCCGAAGGAG TGCAGCCATG A
 
Protein sequence
MDHSSGSPAH PDHDPAEGSH IEDGVVEHPT AGDFGHARMV PEDRTWFKRA VFYEVLVRAF 
HDSDADGSGD LRGLTERLDY LQWLGVDCLW LPPFYDSPLR DGGYDIRDFY KVLPEFGTVE
DFVTLLDAAH RRGIRVITDL VMNHTSDSHP WFQESRRDPD GPYGDFYVWS DTSDRYADAR
IIFVDTEESN WTFDPVRRQF YWHRFFSHQP DLNYDNPAVQ EAMLDVLRFW LDLGIDGFRL
DAVPYLFERE GTNCENLPET HAFLRHCRKV IDDEYPGRVL LAEANQWPAD VVAYFGDPDT
GGDECHMAFH FPLMPRIFMA VRRESRFPIS EILAQTPEIP DMAQWGIFLR NHDELTLEMV
TDEERDYMYS EYAKDPRMKA NVGIRRRLAP LLENDRNQIE LFTALLLSLP GSPVLYYGDE
IGMGDIIWLG DRDGVRTPMQ WTPDRNAGFS KATPGRLYLP PNQDAIYGYQ AVNVEAQRDS
SNSLLNWTKT MLGVRRRHDA FAIGMFRELG GSNPSVLAFV RETATDTVLC VNNLSRFPQP
IELNLQQWNG FTPVEMTGYV DFPSIGALPY LLTLPGHGFY WFQLRAPDPE PEGVQP