Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_5501 |
Symbol | |
ID | 4610322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008703 |
Strand | + |
Start bp | 3679 |
End bp | 6267 |
Gene Length | 2589 bp |
Protein Length | 862 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639789166 |
Product | hypothetical protein |
Protein accession | YP_935501 |
Protein GI | 119854896 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.432777 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0032369 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTGTGC CCGCCCCGCT GCGCGCGATC GGCAACCTGC GGCTGACCCC GCACGGGGTC TACGCCGACT ATCTGCTCTC CGGACAGCCG TTCATCTTCC TCTCCGAAGA GTGGCAGGAC CGGGTGGCTG CCGAGCACGC CGAGCTGTGG CGTGCGCTGC CGTCCGGATC GTCGATCAGT GGCCTGACGG TGCCGGTGGC CCCGCGCGCC ACCGTCCGCA AAATGCTGTA CAGCCACCCC GATTTGCGCC CCGGCGCGGC CGTGCCCGAG GGCGTGTCCG AGGCGGCGCG GCCCTGGGTG CAGCACTGCC GCAGCTGGGA ACCCACCATC GCCGGGCACC GCGCTCGCCG CCGTATCTAC TGGCTGAGCC TGCCCCTGGA TTACGGGCTG GCCGGGCGCA CCCCCAGCGG CACGTGGCGG CGCATGGTCG ACGCCGCACT CGGCCGTGAC AAGGACACCG ACTCCTCCAT CGCCTACTAC CGTGACCTGG CCGCCCAGAT GGTCGCCGCG CTGCCTGCGG TGTTCTTTCC CAAACCGGCC ACGGTGGAAC AGATCTGGTG GCATTGGAAC TACATCGCCA GCCGCGGCGC CTGGGATGCG CCGCTGCCGA CACAACCGTT CAATCCCGAC GCCACCCTGC CGGGGTCGGC GTTCACCCCG GTCCATCTGG ACCCGGGTGC GGCTCAGCTG CGCGACCGGC GCTGGCGGGC AGCCCGCACC GACGCCGACG TGTTCGTACG TACCTTCCGG GATCGCACCG ACGGTGTTGC GGACTCCTAT CAGGCGCTGC TTCCCTTAGA CAGCTTCCCG GACAACGGCA TTGCCTGGCC GCGATCGACA CTGTTCAAGG TTCTCGATGA CCTCACGACA CCGACCACGG TCTTGGACTG GACGATCAAC ATCACTTTCA CCAGCGCGGA CGTGGCCGTG TCGACCGCAG AGAACGTCAT CGTCAACATT CGCGACCAGT ACCGCCAGCG CGGCCGCCAC GCATCGAGTT CCGACGAGCT GCTGCGCAAG CTGGCCTCCG GGCGGGAGCT GGCCTCAGAA CTCAAACGCG GTAGCGCCGA GCGTGGTGTG AACGCCGCTA TCGTCATCGC CGCGGCCGCC GGTGACCCGG ATACGGTGAA CCGGGCCGTG GCCGACGTCG CCCGCACCTA CCGCGGCCAG AACATCGGCT CGAAACGGTG GCGCGGCAGC CAGCCCACAT TGTGGCGGGC GTTCGCCCCG GGCGGGGAGC GCCGCGCCGC CCTCGACGAG TTCCGCAACC CGACCACCAC CAAGCGGTTT GCGCCGTTCG TCCCGCTGCT GGCGAGCAAG CTCGGAAACA ACACCGGCGT CCCGTTGGGG ATGAACCTGA CCAGCCCGGG GCTGCGCGAC GTCGTTCTCC TCGATGTCCT CAACGCGCCG GCCCGCGAGA ATCCGGCGAA TCTGGTGATC TGCGGCTCCC CGGGCCGCGG CAAATCGCAC GCGACGAAGA ATTTGAGCCG CTCGTGGCTC AAACTCGGCG CCGGTCTGCA TTTATTCGAT CCCACCGACG CCCGCGAACA CGAAACAGCG TTGGCCGATT TCGACGATAA AGTCGTCATT GATGTCAGTC GCATGAATTT CAGTCTCGAT GGATTGCGGG TTTTTCCTTA TAAAGAAGCC GCAGAACGAA CCATCGACCA TTTGCTACCG CAATTGGGAT TGTCGCCATT GAGCCGGGGC GCTCAGCGGC TGTGGGGGCT GCTGGCCCCG GAGTCACGCG AGGCCAACGG CATCGGCAGC ACCGCGCAGC TGATCAGATA TCTCCGCGAC ATGCCCACAG CGCGGCGCAC CGACGCCGAC GAAGATCTGC TCATCGGGTT GGAGGGCCTG GCCGCCCAAC GCCTGCTGCG GCCACTGTTT GATGAGTCTC TGCCCGTTCC CGACATCGCC ACCACCCAAT GCGTGATCTG GAATTTCGCC GGACTCAAGC TGCCCACGGT CACCGAGGAA TACCAGGCCC ACCTGCATCA GCAGACCACC CCGGGCCAGC GCGCCGCCCA AGCGCTCTAC GGGCTGGGCG CCGAAGTGGC GCAGTCGATC TTTTTCGGCC GCCCCGATCA GCCCGACATG CTGGTCGTCG AGGAGTGCGC AGCGTGGACC AACTCTCCGG GCGGGCAGAA GTGCGCGAAC ACGATCATCC GCCAGGGCCG TAAGGCCTGG ACGGGGTTCT GCGGTATCAG CCAGCAGCCG ATCAAAGACT TCGCCGTGCT GGAGGACGAG TTCATCGATC AGCGACTGTG CTTGGGGTTC AAGCGATCTG ACATCGCCAA AGCAACCTTG CAGTGGTGTG ACCGCGACCT GGACCGCCAC CCGGAGCTGC TGGCCAACTA CGTCAACAAC ACCAGCCCCG TGCAGCTGGT CGACCACGGC GACGATGCGA TCGATGACCG CTACGGAAAG GTGATCCCCG GCCGCGAGGG CGAAGCGTGG TTCCTCGACG AGTTCGGTGG CTTCGGCAAG GTGGCGCTGT TTGCAGCCCC GACCGCAGCA CTGGCCGCCC GCTTCGACAC CAACCCCCAC CGAGCTCGGC AGCGCAGCCA GGCCACGCAG CGATCATGA
|
Protein sequence | MSVPAPLRAI GNLRLTPHGV YADYLLSGQP FIFLSEEWQD RVAAEHAELW RALPSGSSIS GLTVPVAPRA TVRKMLYSHP DLRPGAAVPE GVSEAARPWV QHCRSWEPTI AGHRARRRIY WLSLPLDYGL AGRTPSGTWR RMVDAALGRD KDTDSSIAYY RDLAAQMVAA LPAVFFPKPA TVEQIWWHWN YIASRGAWDA PLPTQPFNPD ATLPGSAFTP VHLDPGAAQL RDRRWRAART DADVFVRTFR DRTDGVADSY QALLPLDSFP DNGIAWPRST LFKVLDDLTT PTTVLDWTIN ITFTSADVAV STAENVIVNI RDQYRQRGRH ASSSDELLRK LASGRELASE LKRGSAERGV NAAIVIAAAA GDPDTVNRAV ADVARTYRGQ NIGSKRWRGS QPTLWRAFAP GGERRAALDE FRNPTTTKRF APFVPLLASK LGNNTGVPLG MNLTSPGLRD VVLLDVLNAP ARENPANLVI CGSPGRGKSH ATKNLSRSWL KLGAGLHLFD PTDAREHETA LADFDDKVVI DVSRMNFSLD GLRVFPYKEA AERTIDHLLP QLGLSPLSRG AQRLWGLLAP ESREANGIGS TAQLIRYLRD MPTARRTDAD EDLLIGLEGL AAQRLLRPLF DESLPVPDIA TTQCVIWNFA GLKLPTVTEE YQAHLHQQTT PGQRAAQALY GLGAEVAQSI FFGRPDQPDM LVVEECAAWT NSPGGQKCAN TIIRQGRKAW TGFCGISQQP IKDFAVLEDE FIDQRLCLGF KRSDIAKATL QWCDRDLDRH PELLANYVNN TSPVQLVDHG DDAIDDRYGK VIPGREGEAW FLDEFGGFGK VALFAAPTAA LAARFDTNPH RARQRSQATQ RS
|
| |