Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mjls_0775 |
Symbol | |
ID | 4876518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. JLS |
Kingdom | Bacteria |
Replicon accession | NC_009077 |
Strand | + |
Start bp | 821197 |
End bp | 824196 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640138088 |
Product | PKD domain-containing protein |
Protein accession | YP_001069076 |
Protein GI | 126433385 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.482876 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.807579 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGCTA ACGACCGCAA GGTGACCTCC ACTTACGCCA GGTACATCGG GCGGGTCGGT GCGCTTGCGC TCGCGCTCGG TGTCGGGGGA GCCGTGGCAA CGACTCCGGG GGTTGCCCGG GCCGAGGACT CCCCGTCGTC GGTGACCGAT TCGACCGGCG TCTCGAGTCC GTCGGGCGGT GAGCAAGCGC GGACCACCAC GGGCGACGAC ACCACCCCGG ATGACGACCA GTCCGAGATC GTCGACGACG GTGACGCCGA AGACCCCGCC GAGAGCGAGG ACGCCGAAGA AAGCGAAGAC CTCGCCGAGG ACGAAGACCC CGACGAGGGC GAGGACCTCG ACGAGCCGGA CGGCATCGGT GACCCCCTCG AGCCGGCGCA GCCACCGGCA GAGCCCGTCG AGGAACCCGA CGTTCCCGCC GAACCCGCCG TCGAGACGCC GGTCGACACC GGTACGCCCG CGCAGGAAAC CGGAGGACCG GCGGGCGACG ACGCCGGCCA AGCCGCCGAC CCGGCCGACG CCACCGTCAT CGATGACGCA CCGGCGCCGG ACCCGGACGC CGCCGTGCCC GACGTGGACG CGGCGAGTGA CGAGTCGACG AGTTCCCGCT CGTACCAAAC ACTGTCGTTG ACCGAAAGCG ACGATTCGGC ACTGGTCACC TCGGCGCGCA CGAAGCTCAC CGGTCTGCCG CCCCTGCGGC TGAATCTGCC GACGCCCGAA GAGTTCATCG CGAGCATTCC GGCGCCGGTG GTCACCTGCC TGTGCGGGGT CATCAACACG GTGACGAAGT TCTTCGACAA CGTGCTCAAA CCGATGCTGG GCGGGGGCGC GGGAGCGGCA GGTCCCGGTC CCGCAGCACC AGGCTCCTCG CCTGCGCTGT GGGCGGTCGC GGCATGGGTC CGGCGACAGG CGACCCAGGC GATCGACGGC TTCCTCGCGT CGCCGCTGGC CACTCCGATC CGGATGTCCG AAAGAGCCGT CTTCGACTTC GGTGGCTCTC CGCAGGGCCG CGCGCTGAGT GCGGCCGTCG TCCAATTCAT CGGACAATGC GGGCCGAGTG CCGACCTGCC TGCGGAACTC GACCGCACGG TGGTGGTCTC CGGCCTCACC GAGCCGACCG ATTTCAAGAT CCTCAACAAG CACGACAAGG ACGAGGTCGA CCGGATCTTC ATCGCCGAGA AGGGCGGTTC GGTCAAGGTC TACAACCCGG AGACCCGAAC GGTCACCACG TTGACGGTCA TCTCCACGAC CACCGGCGGA GAACGGGGCC TCACCGGCAT CGAGGTCCAT CCGGATTTCT GGCACGAAGA CGAATTCGGA TACCGCTCCA TCTACGTCGC CTACACAGCG GGCGACACCA ATCGGGACAC CCTCGCGCGA CTGATCCTGT CCGACGACAT GACCAGGGTC GAGCGCTCCG AGATCCTCAT CGAATCGACC GAGAACGCCA ACACCTTCCA CCACGGCGGC GATCTGTCCT TCGACAACGA AGGTCAGCAC CTGTACTGGG TGGTCGGCGA CAACACCCAG GGTGTGGTGA ATTCACAGAG CCTGAGCAAC ATTCACGGCA AGGTGCTACG GCTCAACGCC GATGGTTCGG TGCCCGAGGA CAATCCGTTC GTCGACGACG ATCCCGACAC GGCGAGCCCG GCCGACTACA TCTACGCCTA CGGTTTCCGC AATCCCTTCC GGCTGACCTT CACACCGGAC GGGAAGCTGC TCGTCGCCGA CGTGGGCGAG TCCAAATGGG AAGAACTCAA CCTCGTCGTA AAGGGCGGCA ACTACGGCTG GCCGCAGGCC GAGGGCAGCT GCACCGGATG TGCGTCGATC AATCCCATTT ACGTGTACGA ACATTCGGCA CCACCCGTCA GCGGCGGCGC GATCACGTCG GTGACGGTCT TCGACGGCGC CGGATTCCCC GAGGAGTACC GCAACCGGGT GTTCATCGCC GATTACAGCC TGGGCTGGAT CCGCGTGCTC GACTTCGACG AACAGTACAC CAGCCTGATC AGTGCGAAGA CGTTCTGGGG CAACGCCGGT GCGACGGTCA ACCTGGCCCA GGGACCCGAC GACAACCTCT ACCAGCTGAC GATCTATCCC GGTGAGCTGT CGATGATCTC GCCGTCCGGC GGCAACCGGG CGCCCACCGC GGTCATCGAT GCCTCGCAGA CGACGACGGC GGACAAGACG CTGGTCGTCA ACTTCTCCGG CCTGCGCTCG TACGACCCCG ATGACGACGA CACCCTCACC TACCGCTGGG ATTTCGGTGA CGGCAGCAGC TCCACGGAGG CGACGCCGGT CAACGAGTTC AAGGCCACGG GGTCCTACAG CACCTATACG GTCACCCTGA CGGTCAGCGA CGGTGAGAAG ACGAACACGA CCACGCAGAA GATCACGGTG GGCAGTACGC CACCGGTCGC CGAGATCGTG AGCATCCCGT CGTCGTACGA CGCCGGTGAC ACCATCACGT TCACGGGCAG GGGGACGGAC GCTCAGGACC GTCCGGACGG TACGCCGCTT CCCGGAACCG CCTACAAGTG GACGGTGGTG TTCCACCACA ACGAACACAC GCACCCGTTC GCCGACAACC TCGTCGGGGA GACCGCCACC ATCACCATCC CGCGGTCACG CGATCAGATC GACGGCACCT TCTACCGCGT GCACCTGACG GTCACCGACA GCAGCGGTCT GTCGAGGACG ACGTACAAGG ACGTCAATCC CAACCTCGTC GAGCTCACGA TCGCCGCGAG CGATCCCGGC GCGCGGTTCA GCATCGACGG GTTGCCGTAC ACGGGCTCGT ACACCGAGCG TGCGGTGGTG GGCGTGGACT ACGTGATCAG CGCCCCGACC ACACAGACGG TCAACGGCAG GCAGCTGACC TTCGACGGGT GGTCCGACGG TGGCGCGGCG ACGCACACGA TCCGGGTTCC GGAACAGGCG ACGACGTACA CCGCGACCTA CACGGCCAGT TCCGGGGCGA ATCTGGCGCA GGTGGTGTGA
|
Protein sequence | MTANDRKVTS TYARYIGRVG ALALALGVGG AVATTPGVAR AEDSPSSVTD STGVSSPSGG EQARTTTGDD TTPDDDQSEI VDDGDAEDPA ESEDAEESED LAEDEDPDEG EDLDEPDGIG DPLEPAQPPA EPVEEPDVPA EPAVETPVDT GTPAQETGGP AGDDAGQAAD PADATVIDDA PAPDPDAAVP DVDAASDEST SSRSYQTLSL TESDDSALVT SARTKLTGLP PLRLNLPTPE EFIASIPAPV VTCLCGVINT VTKFFDNVLK PMLGGGAGAA GPGPAAPGSS PALWAVAAWV RRQATQAIDG FLASPLATPI RMSERAVFDF GGSPQGRALS AAVVQFIGQC GPSADLPAEL DRTVVVSGLT EPTDFKILNK HDKDEVDRIF IAEKGGSVKV YNPETRTVTT LTVISTTTGG ERGLTGIEVH PDFWHEDEFG YRSIYVAYTA GDTNRDTLAR LILSDDMTRV ERSEILIEST ENANTFHHGG DLSFDNEGQH LYWVVGDNTQ GVVNSQSLSN IHGKVLRLNA DGSVPEDNPF VDDDPDTASP ADYIYAYGFR NPFRLTFTPD GKLLVADVGE SKWEELNLVV KGGNYGWPQA EGSCTGCASI NPIYVYEHSA PPVSGGAITS VTVFDGAGFP EEYRNRVFIA DYSLGWIRVL DFDEQYTSLI SAKTFWGNAG ATVNLAQGPD DNLYQLTIYP GELSMISPSG GNRAPTAVID ASQTTTADKT LVVNFSGLRS YDPDDDDTLT YRWDFGDGSS STEATPVNEF KATGSYSTYT VTLTVSDGEK TNTTTQKITV GSTPPVAEIV SIPSSYDAGD TITFTGRGTD AQDRPDGTPL PGTAYKWTVV FHHNEHTHPF ADNLVGETAT ITIPRSRDQI DGTFYRVHLT VTDSSGLSRT TYKDVNPNLV ELTIAASDPG ARFSIDGLPY TGSYTERAVV GVDYVISAPT TQTVNGRQLT FDGWSDGGAA THTIRVPEQA TTYTATYTAS SGANLAQVV
|
| |