Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_0794 |
Symbol | |
ID | 4614814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 841620 |
End bp | 844619 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639790470 |
Product | PKD domain-containing protein |
Protein accession | YP_936800 |
Protein GI | 119866848 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.135837 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0870072 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGCTA ACGACCGCAA GGTGACCTCC ACTTACGCCA GGTACATCGG GCGGGTCGGT GCGCTTGCGC TCGCGCTCGG TGTCGGGGGA GCCGTGGCAA CGACTCCGGG GGTTGCCCGG GCCGAGGACT CCCCGTCGTC GGTGACCGAT TCGACCGGCG TCTCGAGTCC GTCGGGCGGT GAGCAAGCGC GGACCACCAC GGGCGACGAC ACCACCCCGG ATGACGACCA GTCCGAGATC GTCGACGACG GTGACGCCGA AGACCCCGCC GAGAGCGAGG ACGCCGAAGA AAGCGAAGAC CTCGCCGAGG ACGAAGACCC CGACGAGGGC GAGGACCTCG ACGAGCCGGA CGGCATCGGT GACCCCCTCG AGCCGGCGCA GCCACCGGCA GAGCCCGTCG AGGAACCCGA CGTTCCCGCC GAACCCGCCG TCGAGACGCC GGTCGACACC GGTACGCCCG CGCAGGAAAC CGGAGGACCG GCGGGCGACG ACGCCGGCCA AGCCGCCGAC CCGGCCGACA CCACCCTCAC TGACGACCTA CCCGCGCCGG ACCCGGACGC CGCCGTGCCC GACGTGGACG CGGCGAGTGA CGAGTCGACG AATTCCCGCT CGTACCAAAC ACTGTCGTTG ACCGAAAGCG ACGATTCGGC ACTGGTCACC TCGGCGCGCT CGAAGCTCCC CGGTCTGCCG CCCCTGCGGC TGAACCTGCC GACGCCCGAA CAGTTCATCG CGAGCATTCC GGCGCCGGTC GTCACCTGCC TGTGCGGGGT CATCAACACG GTGACGAACT TCTTCGACAA CGTGCTCAAA CCGATGCTGG GCGGGGGCGC GGGAGCGGCG GGCCCCGGCC CCGCAGCACC AGGCTCCTCG CCCGCGCTGT GGGCGGTCGC GGCCTGGGTC CGCCGGCAGA CGACCCAGGC GATCGACGGC TTCCTCGCGT CGCCGCTGGC CACTCCGATC CGGATGTTCG AAAGAGCCGT CTTCGACTTC GGTGGCTCTC CGCAGGGCCG CGCGCTGAGT GCGGCCGTCG TCCAATTCAT CGGACAATGC GGGCCGAGTG CCGACCTGCC TGCGGAACTC GACCGCACGG TGGTGGTCTC CGGCCTCACC GAGCCGACCG ATTTCAAGAT CCTCAACAAG CACGACAAGG ACGAGGTCGA CCGGATCTTC ATCGCCGAGA AGGGCGGTTC GGTCAAGGTC TACAACCCGG AGACCCGAAC GGTCACCACG TTGACGGTCA TCTCCACGAC CACCGGCGGA GAACGTGGCC TCACCGGCAT CGAGGTCCAT CCTGATTTCT GGCACGAAGA CGAATTCGGA TATCGCTCCA TCTACGTCGC CTACACAGCG GGCGACACCA ATAGGGACAC CCTCTCGCGA CTGATCCTGT CCGACGACAT GACCAGGGTC GAGCGCTCCG AGATCCTCAT CGAATCGACC GAGAACGCCA ACACCTTCCA CCACGGCGGC GATCTGTCCT TCGACAACGA AGGTCAGCAC CTGTACTGGG TGGTCGGCGA CAACACCCAG GGTGTGGTGA ATTCACAGAG CCTGAGCAAC ATTCACGGCA AGGTGCTACG GCTCAACGCC GATGGTTCGG TGCCCGAGGA CAATCCGTTC GTCGACGACG ATCCCGACAC GGCGAGCCCG GCCGACTACA TCTACGCCTA CGGTTTCCGC AATCCCTTCC GGCTGACCTT CACACCGGAC GGGAAGCTGC TCGTCGCCGA CGTGGGCGAG TCCAAATGGG AAGAACTCAA CCTCGTCGTG AAGGGCGGCA ACTACGGCTG GCCGCAGGCC GAGGGCAACT GCACCGGATG TGCGTCGATC AATCCCATTT ACGTGTACGA ACATTCGGCA CCACCCGTCA GCGGCGGCGC GATCACGTCG GTGACGGTCT TCGACGGCGC CGGATTCCCC GAGGAGTACC GCAACCGGGT GTTCATCGCC GATTACAGCC TGGGCTGGAT CCGCGTGCTC GACTTCGACG ACCAGTACAC CAGCCTGATC AGTGCGAAGA CGTTCTGGGG CAACGCCGGT GCGACGGTCA ACCTGGCCCA GGGACCCGAC GACAACCTCT ACCAGCTGAC GATCTATCCC GGTGAGCTGT CGATGATCTC ACCGTCCGGC GGCAACCGGG CGCCCACCGC GGTCATCGAT GCCTCGCAGA CGACGACGGC GGACAAGACG CTGGTCGTCA ACTTCTCCGG CCTGCGCTCG TACGACCCCG ATGACGACGA CACCCTCACC TACCGCTGGG ATTTCGGCGA CGGCAACAGC TCCACGGAGG CGACGCCGGT CAACGAGTTC AAGACCACGG GGTCCTACAG CACCTATACG GTCACCCTGA CGGTCAGCGA CGGTGAGAAG ACGAACACGA CCACGCAGAA GATCACGGTG GGCAGTACGC CACCGGTCGC CGAGATCGTG AGCATCCCGT CGTCATACGA CGCCGGTGAC ACCATCACGT TCACGGGCAG GGGGACGGAC GCTCAGGACC GTCCGGACGG TACGCCGCTC CCCGGAACCG CCTACAAGTG GACGGTCGTG TTCCACCACA ACGAACACAC GCACCCGTTC GCCGACAACC TCGTGGGGGA GACCGCCACC ATCACCATCC CGCGGTCACG CGATCAGATC GACGGCACCT TCTACCGCGT GCACCTGACG GTCACCGACA GCAGCGGTCT GTCGACGACG ACGTACAAGG ACGTCAATCC CAACCTCGTC GAGCTCACGA TCGCCGCGAG CGATCCCGGC GCGCGGTTCA GCATCGACGG GTTGCCGTAC ACGGGCTCGT ACACCGAGCG TGCGGTGGTG GGCGTGGACT ACGTGATCAG CGCCCCGACC ACACAGACGG TCAACGGCAG GCAGCTGACC TTCGACGGGT GGTCCGACGG TGGCGCGGCG ACGCACACGA TCCGGGTTCC GGAACAGGCG ACGACGTACA CCGCGACCTA CACGGCCAGT TCCGGGGCGA ATCTGGCGCA GGTGGTGTGA
|
Protein sequence | MTANDRKVTS TYARYIGRVG ALALALGVGG AVATTPGVAR AEDSPSSVTD STGVSSPSGG EQARTTTGDD TTPDDDQSEI VDDGDAEDPA ESEDAEESED LAEDEDPDEG EDLDEPDGIG DPLEPAQPPA EPVEEPDVPA EPAVETPVDT GTPAQETGGP AGDDAGQAAD PADTTLTDDL PAPDPDAAVP DVDAASDEST NSRSYQTLSL TESDDSALVT SARSKLPGLP PLRLNLPTPE QFIASIPAPV VTCLCGVINT VTNFFDNVLK PMLGGGAGAA GPGPAAPGSS PALWAVAAWV RRQTTQAIDG FLASPLATPI RMFERAVFDF GGSPQGRALS AAVVQFIGQC GPSADLPAEL DRTVVVSGLT EPTDFKILNK HDKDEVDRIF IAEKGGSVKV YNPETRTVTT LTVISTTTGG ERGLTGIEVH PDFWHEDEFG YRSIYVAYTA GDTNRDTLSR LILSDDMTRV ERSEILIEST ENANTFHHGG DLSFDNEGQH LYWVVGDNTQ GVVNSQSLSN IHGKVLRLNA DGSVPEDNPF VDDDPDTASP ADYIYAYGFR NPFRLTFTPD GKLLVADVGE SKWEELNLVV KGGNYGWPQA EGNCTGCASI NPIYVYEHSA PPVSGGAITS VTVFDGAGFP EEYRNRVFIA DYSLGWIRVL DFDDQYTSLI SAKTFWGNAG ATVNLAQGPD DNLYQLTIYP GELSMISPSG GNRAPTAVID ASQTTTADKT LVVNFSGLRS YDPDDDDTLT YRWDFGDGNS STEATPVNEF KTTGSYSTYT VTLTVSDGEK TNTTTQKITV GSTPPVAEIV SIPSSYDAGD TITFTGRGTD AQDRPDGTPL PGTAYKWTVV FHHNEHTHPF ADNLVGETAT ITIPRSRDQI DGTFYRVHLT VTDSSGLSTT TYKDVNPNLV ELTIAASDPG ARFSIDGLPY TGSYTERAVV GVDYVISAPT TQTVNGRQLT FDGWSDGGAA THTIRVPEQA TTYTATYTAS SGANLAQVV
|
| |