Gene Mjls_3497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMjls_3497 
Symbol 
ID4879208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. JLS 
KingdomBacteria 
Replicon accessionNC_009077 
Strand
Start bp3688210 
End bp3690243 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content68% 
IMG OID640140801 
Productglycoside hydrolase 15-related protein 
Protein accessionYP_001071765 
Protein GI126436074 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.83213 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.993865 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCGG GCATGGTTCT GGAACACACC GAGCCCACCG ACGGAGCCGC GACGATCGGG 
CAGCCGGCCT ATCTGCCCGA CACTCCGTTG ACGGTGACGG CGCCGGTCCC CTACGCGCCG
ACCGGTGGGC TGCGGAACCC GTTCCCGCCC ATCGCCGACT ACGGCTTCCT GTCCGACTGC
GAGAACACGT GCCTGATCTC CTCGGCCGGC TCCGTCGAGT GGCTGTGCGT GCCCCGCCCG
GATTCGCCGA GCGTGTTCGG CGCGATCCTC GACCGCGGTG CAGGTCACTT CCGGCTCGGC
CCGTACGGCG TGACGGTGCC CGCGGCGCGG CGTTACCTGC CGGGCAGCCT GATCCTCGAG
ACCACGTGGC AGACCCACAC CGGCTGGCTG ATCGTGCGCG ACACCCTGGT GATGGGTCCC
TGGCACGACC TCGAGGCGCG GTCGCGCACC CACCGGCGCA CGCCGATGGA CTGGGATGCC
GAGCACATCC TGTTGCGCAC CGTGCGGTGT GTCAGTGGCA CCGTCGAGCT GGTGATGAAC
TGTGAGCCGT CGTTCGACTA CCACCGGGTG AGCGCCGAGT GGGAGTACTC CGGCCCGGCC
TACGGTGAGG CGATCGCGCG CGCCAACCGC AACGCCGACT CCCATCCGAC GCTGCGGCTC
ACCACGAACC TGCGGATCGG GTTGGAGGGC CGCGAGGCCA GGGCCCGCAC CCGGCTCAAG
GAGGGCGACA ACGTCTTCGT GGCGCTGTCC TGGTCGAAGC ATCCGGCGCC GCAGAACTAC
GAAGAGGCCG CCGACAAGAT GTGGCAGACC AGCGAGGCGT GGCGGCAGTG GATCAACGTC
GGCGACTTCC CCGACCACCC GTGGCGGGCG TACCTGCAGC GCAGCGCGCT CACACTCAAG
GGCCTGACCT ACTCCCCGAC CGGCGCGCTG TTGGCCGCGA GCACCACGTC GTTGCCGGAA
ACACCTCAGG GCGAACGCAA TTGGGACTAC CGCTACGCGT GGGTGCGGGA TTCGACGTTC
GCACTGTGGG GTCTCTACAC GCTGGGCCTG GACCGCGAGG ACGACGACTT CTTCGCGTTC
ATCGCCGACG TGTCCGGCGC CAACAACGGG GAGCGCCACC CGCTGCAGGT GATGTACGGC
GTCGGGGGTG AGCGCAGCCT GGTCGAGGAG GAACTGCACC ACCTGTCGGG GTACGACGGC
GCCCGCCCGG TGCGGATCGG CAACGGTGCC TACAACCAGA TGCAGCACGA CATCTGGGGC
ACCATGCTCG ATTCGGTCTA CCTGCACACC AAGTCGCGTG AGCAGATCCC CGAGGCGTTG
TGGCCGGTGC TCAAGCACCA GGTCGAGGAG GCCATCAAGC ACTGGAAGGA ACCCGACCGC
GGCATCTGGG AGGTCCGCGG CGAACCGCAG CACTTCACCA GTTCGAAGGT GATGTGCTGG
GTGGCGCTCG ACCGTGGCGC GAAGCTCGCC GAACTCGAGG GCGAGAAGAG CTACGCCCAG
GAGTGGCGCA CCATCGCCGA GCAGATCAAG GCCGACATCC TCGCCAACGG CGTCGACTCG
CGGGGCGTGT TCACCCAGCG TTACGGCGAC GACGCGCTGG ACGCCTCCCT GCTGCTGGTG
CCGCTGGTCC GGTTCCTGCC GCCGGACGAC CCGCGGGTGC GGGCCACGGT GCTGGCGATC
GCCGACGAGC TGACCGAGGA GGGTCTGGTC CTGCGCTACC GCGTCGAGGA GACCGACGAC
GGGTTGGCCG GCGAGGAGGG CACGTTCACG ATCTGCTCGT TCTGGCTGGT GTCGGCGCTC
GTGGAGATCG GTGAGATCAG CCGTGCCAAG CACCTGTGTG AACGGTTGTT GTCGTTCGCC
AGTCCGCTGC ACCTCTACGC CGAGGAAATC GAACCCCGCA CCGGCCGCCA TCTGGGCAAC
TTCCCGCAGG CGTTCACCCA CCTGGCCTTG ATCAACGCGG TCGTGCACGT CATCCGCGCC
GAGGAGGAAG CCGACAGCTC GGGGGTCTTC GTCCCGGCCA ACGCGCCGTC GTAA
 
Protein sequence
MMAGMVLEHT EPTDGAATIG QPAYLPDTPL TVTAPVPYAP TGGLRNPFPP IADYGFLSDC 
ENTCLISSAG SVEWLCVPRP DSPSVFGAIL DRGAGHFRLG PYGVTVPAAR RYLPGSLILE
TTWQTHTGWL IVRDTLVMGP WHDLEARSRT HRRTPMDWDA EHILLRTVRC VSGTVELVMN
CEPSFDYHRV SAEWEYSGPA YGEAIARANR NADSHPTLRL TTNLRIGLEG REARARTRLK
EGDNVFVALS WSKHPAPQNY EEAADKMWQT SEAWRQWINV GDFPDHPWRA YLQRSALTLK
GLTYSPTGAL LAASTTSLPE TPQGERNWDY RYAWVRDSTF ALWGLYTLGL DREDDDFFAF
IADVSGANNG ERHPLQVMYG VGGERSLVEE ELHHLSGYDG ARPVRIGNGA YNQMQHDIWG
TMLDSVYLHT KSREQIPEAL WPVLKHQVEE AIKHWKEPDR GIWEVRGEPQ HFTSSKVMCW
VALDRGAKLA ELEGEKSYAQ EWRTIAEQIK ADILANGVDS RGVFTQRYGD DALDASLLLV
PLVRFLPPDD PRVRATVLAI ADELTEEGLV LRYRVEETDD GLAGEEGTFT ICSFWLVSAL
VEIGEISRAK HLCERLLSFA SPLHLYAEEI EPRTGRHLGN FPQAFTHLAL INAVVHVIRA
EEEADSSGVF VPANAPS