Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_2913 |
Symbol | |
ID | 4610742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 3053747 |
End bp | 3055906 |
Gene Length | 2160 bp |
Protein Length | 719 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639792578 |
Product | 4-alpha-glucanotransferase |
Protein accession | YP_938897 |
Protein GI | 119868945 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1640] 4-alpha-glucanotransferase |
TIGRFAM ID | [TIGR00217] 4-alpha-glucanotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAC TGTCCCCGTC ACTGGTCGAA CTCGCCGCCC GCCACGGCGT CGCCACCCGC TACGAGGACT GGTCCGGCAC CCAGGTCGCC GTGCCCGAGT CGACGCTGAT CGCGGTGCTC GCCGCACTGG GCGTCGCCGC AGCGGACGAC GAGCAACGCG CCGCCGCGCT CATACAGCAC GACCGCAATT ACTGGCAGCG GTCACTGCCG CCCACGATCC TGGCCCGCAC CGGAAGCACG TCGTCGTTCT GGGTGCACGT CACCCACGGT GACCCGGTCG AGGTCTGGGT GGCCCTGGAA GACGGGTCGA TCCGCCAGAG CCTGCACCAA TTACGCAACG ACACACCGCC TTTCGACCTC GACGGCCGAC TGGTCGGGGA GGCCAGCTTC GAGCTGCCCG CCGATCTGCC GCTGGGTTAC CACCGGGTGC ACATGCGGAC CGGAACGGTC GAGGCCGATG CACCGCTGAT CGTCTCCCCC GCCACGCTGC CCGTGCCGCC GCGGCTCGGG GCCCGGCGCA GCTGGGGGTT GGCGACCCAG CTCTACAGCG TTCGCTCCCA CCGCTCCTGG GGTGTCGGCG ACCTGACGGA TCTGACGGAC CTGGCGGTGT GGTCGGCGAC GCGGCACGGC GCCGGGTTCA TCCTGGTCAA CCCCCTGCAC GCGGCCGCGC CTGCCGCGCC GATGGAGCCG TCGCCGTACC TGCCCACCTC GCGACGGTTC GTCAACCCGC TGTACCTACG GCCGGAGGCG ATCCCCGAGT ACGCCGAACT CCGCCATCGG GGTCGGTTGC GCAGGTCGCG GACCGAGGTT CAGGCCCGTG CCCGGCGCAG GGAGCTCATC GATCGCGATT CCGCCTGGCG GGCCAAACGG GCGGCCCTGG CGACCATCTA CCGTGTCGAA CGTTCGGCCG GCCGTGAGCT CGCCTACGCC GGGTACCGGG CGCGGGAGGG CCGCGCGCTC GACGATTTCG CCGTGTGGTG CGCGTTGGCC GAGAAGTACG GCAACGACTG GCACGGCTGG CCCGCCGAAC TGCAGCATCC CCGCAACGTG GCGGTGGCCG CGTTCGCCGC CGAGCACGCC GACGAGGTGG ACTTCCACCG GTGGCTGCAG TGGCAGCTCG ACGAGCAGCT CACCGCCGCG CACGCCACGG CGATCGGCGC GGGGATGGAC ATCGGGGTCA TGCACGACCT CGCGGTCGGG GTGAATCCCG ACGGGGCCGA CGCGTGGGCA CTGCAGGAGG TCCTGGCCCT CGGCGTCACC GCGGGCGCGC CGCCCGACGA GTTCAACCAG CTCGGCCAGG ACTGGTCGCA GCCGCCGTGG CGACCCGACC AGCTCGTCGA GCAGGCCTAC GAACCGTTCC GGGCGTTGGT CAACGGGGTG CTGCGGCACG CGGGCGGTGT CCGGATCGAC CACATCATCG GACTGTTCCG GCTGTGGTGG ATCCCCCGGG GCGCCGCACC GACCGAGGGC ACCTATGTCC GCTACGACCA CGAGGCGATG ATCGGCATCG TCGCGCTCGA GGCGCACCGT ACGGGGGCGG TCGTGGTCGG TGAGGATCTC GGCACCGTCG AACCCTGGGT GCGCGACTAC CTGCACGACC GCGGACTGTT CGGCACCTCG ATCCTGTGGT TCGAACAGGA CCGCGACGGC CAGGACGCCA CAGGTGGTCC GCTCCCCGCC GAGCGGTGGC GGGAGTACTG CCTCTCGGCG GTGACCACCC ATGATCTGCC GCCGACGGCC GGTTACCTCG CCGGTGAACA CGTCCGGTTG CGCGACGAAC TCGGACTGCT TACCCGCCCG GCGGCCGAGG AACTCGCCGA CGACCGGGCG GCGCAGGCCG CGTGGCTGGC CGAACTGCGC CGGGTCGGCC TGTTGGCCCA CCCGGACAAC GGCGAGGAAC CGGAGTCCGA TGCCGTGATC CTCGCCCTGC ACCGGTATCT GGGCCGGACC CCGTCGAAGC TGCTGTCGCT GTCCCTGGCC GACGCCGTCG GCGATCTGAA GACCCAGAAC CAGCCGGGAA CGAGTGACGA GTACCCCAAC TGGCGGGTGC CGCTGCGGGG TCCGGACGGG CGCCAGCGCC TGGTCGAGGA CGTGTTCACC GACCCGCGGG CCGCGGCGCT CGGCGCCGTG ATGGGTTCGT TGGTGCACCC GGTCCTGTGA
|
Protein sequence | MTELSPSLVE LAARHGVATR YEDWSGTQVA VPESTLIAVL AALGVAAADD EQRAAALIQH DRNYWQRSLP PTILARTGST SSFWVHVTHG DPVEVWVALE DGSIRQSLHQ LRNDTPPFDL DGRLVGEASF ELPADLPLGY HRVHMRTGTV EADAPLIVSP ATLPVPPRLG ARRSWGLATQ LYSVRSHRSW GVGDLTDLTD LAVWSATRHG AGFILVNPLH AAAPAAPMEP SPYLPTSRRF VNPLYLRPEA IPEYAELRHR GRLRRSRTEV QARARRRELI DRDSAWRAKR AALATIYRVE RSAGRELAYA GYRAREGRAL DDFAVWCALA EKYGNDWHGW PAELQHPRNV AVAAFAAEHA DEVDFHRWLQ WQLDEQLTAA HATAIGAGMD IGVMHDLAVG VNPDGADAWA LQEVLALGVT AGAPPDEFNQ LGQDWSQPPW RPDQLVEQAY EPFRALVNGV LRHAGGVRID HIIGLFRLWW IPRGAAPTEG TYVRYDHEAM IGIVALEAHR TGAVVVGEDL GTVEPWVRDY LHDRGLFGTS ILWFEQDRDG QDATGGPLPA ERWREYCLSA VTTHDLPPTA GYLAGEHVRL RDELGLLTRP AAEELADDRA AQAAWLAELR RVGLLAHPDN GEEPESDAVI LALHRYLGRT PSKLLSLSLA DAVGDLKTQN QPGTSDEYPN WRVPLRGPDG RQRLVEDVFT DPRAAALGAV MGSLVHPVL
|
| |