Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_1434 |
Symbol | |
ID | 4614265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 1540135 |
End bp | 1543560 |
Gene Length | 3426 bp |
Protein Length | 1141 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639791109 |
Product | SARP family transcriptional regulator |
Protein accession | YP_937436 |
Protein GI | 119867484 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTGGGAC CGCTGCAGGT GACGCATGCG GACTGCCCGG TCGACATCGG GTCGCCGAAG CAGCGCGCGG TGCTCGCCGT GCTCCTGCTC GCCGCGGGCC GGGTCGTGTC GGTGGACCGG CTGATCGACG CGGTCTGGGG TGACGATGCC CCGGGCAGCG CGACGGCCAG CCTGCAGGCC TACATCTCCA ACCTGCGCCG GGCCCTGCGC GACGCGGGTC AGTCGCAGGT CGCCTCGCCG ATCGTGCGGC AACCGCCGGG GTATTTCCTC AACGTCGAAC CCGGCCAGGT CGATCTGGCG GTGTTCGCCT CCGGCTGCGC CAGGGCGGTG GCCGCCGTGG ACAGCGGGGA CTGGGACGAG GCGCTGGCCG CCGCCGACGA GGCCCTGGCG TGGTGGCGTG GGCCGCTGCT GGCCGACCTG TCCGACGAAT CGTGGGTGGC CGACGAGGCG GCCCGCGCCG AGCACCTGCG CGCCGATTGT CTGGACGCGC GGATCACCGC GCTGCTGGGG CTGGGCCGGG TGCCGCAAGC GCTGGCGGCG GCGGCCGAGT TGCGCTCCGC CACACCGCTG GCCGACCGCG GCTGCTGGCT GCACATGCTC GCGCTGTACC GGGCGGGTCG GGTGACCGAC GCGCTGGACG TCTACACCCG CCACGCTCGG CTGCTCGACG ACGAACTCGG CGTGCAGCCG GGACGCGAGG TCCGCGAGCT GCAGACGGCG ATGCTGCGGC AGGCACCCGA GTTGTCGGCG TGGCCGCGGT CCCCGGAGTG GACGGGCGCC GGTGCGGTGG CCACCCCGGC CGCGCCCACG GTCGAGCCGT CGGTGGTCCC GGCGGGCCCG AGCCGGGGCG CGCTGATCGG CCGCAGCCGC GAATTGTCCA CCGCCGCGGG TGTGCTCTCG GATGTCGCGG CGGGTGCGGC GCGGTGGCTG GTGCTGTCGG GTCCGGCGGG AATCGGCAAG ACCCGGCTGG CGGAGGAGGT CGCGGCCCGG GTGGTCGCCG ACGGCGGGGA CATGGTGTGG GTGAGCTGTC CCGACGAGCG GGCGACCCCG CCGTGGTGGC CGATGCGGCA GCTGGTGCGG GCGCTGGGCG CCGATCCCGA CGACGTGCTG GAGGTGCCGC CGGACGCCGA TCCGGACACC GCGAGATTCC GTGTCTACGA ACGCATCCAG ACCCTGCTGG AGTCGGCGCC GCGCACGCTT GCGGTGGTCA TCGACGACGT GCAGTGGGCG GACACCACGT CGGCGGCGTG CCTGGCCTAC ATCGCCGGGG CGCTACGCGA CCACCCGGTG GCGCTGATCC TGACCGTGCG CGACGGCGAA CACAGCGCCG AGGTTTCCAG GCTGGTGACG ACCGTGGCCC GTGGCGACCG CAACCGCCAC GTGGCGGTTC CGGCGTTGTC CACCGAAGAT GTTGCGGCGC TGGCGAATCA GGTCGCCGAC GATCCGGTGA CCGAGGCGGA GGCGGCGCTG CTGGCCGATC GCACGGGCGG CAACCCGTTC TTCGTCTCCG AGTACGCGCG GCTGCCCCGC GCCGACCGGG TCGGCAGCGA GATCCCGGTC GCGGTGAAAT CGGTGCTGGA CCGCCGCCTG GCCGGGCTCG ACCCGGCCGC CGTGCAGGTG CTGCGGACGG CCGCGATCAT CGGCGACGCG CTCGATTCGG ACGCTGTGCC GGTGTTGGCC CAGGCCACCG GAATGGACGT CGACACGCTG GCCGACCATC TCGACGATGC GGCCGACGAG CGGATCGTGA TCGCCGCGCA CACCGGTGAC GGGTACGCGT TCGCGCACGG ACTCCTCCGT GACCATCTCA TCGCGGGGAT CCCCCCGCTG CGCCGCCAAC GCCTGCACGC CAAGATCGCT GACGTGCTCG ACGGCAGCAC CGCCGAGGGT GCGCTGACGC GCCGCGCCCA GCACCTCATC GCCGCGCAGC CACTGGTGGA CGCCGGCGCG GTGGTGCAGG CGTGCCGACT GGCCGCCGAG GATGCCACGG CGCGGTGGAG TTCGGACATC GCGGCGGTGT GGTGGCAGGC CGCGCTGGAC GCCTACGACC GCCTTCCGGC GGCGTCGCGT TCGGAAGAGG AGCGCGACGG GCTGACCGTG GCGATGCTCG AGGCGCATTC GCGCGCGGGG CGCGGCCGGC TGGTCCTCGA CACCGTCGCC GCACAACTCG GTGATGCCGT GCGCACCGGT CGGGCCGCGA CGGCCGGTCG GCTGGCCAGC GCGCTACTGC GGGCCAGCGG CGGGTGGCCG TGGCTGGCCC CCGGCCATGA TCCCGGTGGG GTGCTCGCGC TGCTGGAGGG GGCCGCGGTG CTGGCCGAGA GTGACCCCGC CGCGGGGGCG CGGGTGCTGA CCGCCCTCGC GGTCGGGCAC TGCTATCACC CCGACGCCGC GGTGTCGGCC GGGCATCTCG AACGGGCCGC GCGGTTGGCC GAGGCCACCG GGGACCGCGA TGTCATTGCC GACGTGTTGA TGGGCCGGTT GATCACCTAC TCCGGGGTGG CCGCCTACAG CCATCAGACT TTGGAGTGGG TCGCGGAACT GAACGCGCTC GGGCACAGCA GGTCCCGGGA GGACTCCGTG ATCGCGCACT CGGTGGCCAC GATGGCCGCG GTGAACCTGA CCGAGATCGA CCTGGCGAAA CTGCATCTGC AGGAGGGCAT CTCGGGCAGT GAGGAACTGC GGCTGCCGGT GCTGCGGGCA CAGCTGCGCT GGATGGAGGC GGTGCTGGCG GTGTGGCGGG GGGACTTCGC CGAAGCCGAA CGCCACCACC GGATCGCGGC GGAGGTTCAT GAGCAGACCG AACTGTACGA AGCCGGAAGC GGTTTGATCG CGACGGTGAT TCTGATCCGG GAGAGGGGCG GCCCCGTCGA GCCGGGTTGG CCGGGCTCGC GTGCCGACAC CGAGAGCGGG GGACAGGGCA TGGTCGGCCT GGTGCACACC GCTCTGCTCA CCGTGGACAG CGGCGACGAG GCACGGGCGC AGGCGCTGAT GCGACTGCGG GAGTGGGACG CCCAACCACA CCGGGCACAT GTGTGGACGA CGCTCGGGCA CGCGACGCTG CTGGCCCATC TGGCGTGCGA CCACGGATAC GCCGAGTTCG CCCCCGCGCT GCTGGAGAGG CTGCTGCCGT TCGTCGACCG CATCGCGGAG ATCGGTCAGG TCGGCGTGGT GGGGCCGGTC GCCCTGGCGA CCGCGCGTCT GCGGGCGCTG ATGGGCGACA CCGACCGCGC GCGGGCCGAC CTGGCCGACG CCGAGGACAT CGCCGCGCGC ACCGGGGGTG TTCCCGTCCT GCTGCGGTGC CGGCTGCTGC GTGCGGAGCT GACCCCGCCG GGTGAGCAGC GGAAGGCGGC GGCGCGGGCG CTCGCCACTG ATGCCGATGC GCTGGGCATG CGCGGCGTGG CCGATCTGGC ACGTCGGCTC GCGTGA
|
Protein sequence | MLGPLQVTHA DCPVDIGSPK QRAVLAVLLL AAGRVVSVDR LIDAVWGDDA PGSATASLQA YISNLRRALR DAGQSQVASP IVRQPPGYFL NVEPGQVDLA VFASGCARAV AAVDSGDWDE ALAAADEALA WWRGPLLADL SDESWVADEA ARAEHLRADC LDARITALLG LGRVPQALAA AAELRSATPL ADRGCWLHML ALYRAGRVTD ALDVYTRHAR LLDDELGVQP GREVRELQTA MLRQAPELSA WPRSPEWTGA GAVATPAAPT VEPSVVPAGP SRGALIGRSR ELSTAAGVLS DVAAGAARWL VLSGPAGIGK TRLAEEVAAR VVADGGDMVW VSCPDERATP PWWPMRQLVR ALGADPDDVL EVPPDADPDT ARFRVYERIQ TLLESAPRTL AVVIDDVQWA DTTSAACLAY IAGALRDHPV ALILTVRDGE HSAEVSRLVT TVARGDRNRH VAVPALSTED VAALANQVAD DPVTEAEAAL LADRTGGNPF FVSEYARLPR ADRVGSEIPV AVKSVLDRRL AGLDPAAVQV LRTAAIIGDA LDSDAVPVLA QATGMDVDTL ADHLDDAADE RIVIAAHTGD GYAFAHGLLR DHLIAGIPPL RRQRLHAKIA DVLDGSTAEG ALTRRAQHLI AAQPLVDAGA VVQACRLAAE DATARWSSDI AAVWWQAALD AYDRLPAASR SEEERDGLTV AMLEAHSRAG RGRLVLDTVA AQLGDAVRTG RAATAGRLAS ALLRASGGWP WLAPGHDPGG VLALLEGAAV LAESDPAAGA RVLTALAVGH CYHPDAAVSA GHLERAARLA EATGDRDVIA DVLMGRLITY SGVAAYSHQT LEWVAELNAL GHSRSREDSV IAHSVATMAA VNLTEIDLAK LHLQEGISGS EELRLPVLRA QLRWMEAVLA VWRGDFAEAE RHHRIAAEVH EQTELYEAGS GLIATVILIR ERGGPVEPGW PGSRADTESG GQGMVGLVHT ALLTVDSGDE ARAQALMRLR EWDAQPHRAH VWTTLGHATL LAHLACDHGY AEFAPALLER LLPFVDRIAE IGQVGVVGPV ALATARLRAL MGDTDRARAD LADAEDIAAR TGGVPVLLRC RLLRAELTPP GEQRKAAARA LATDADALGM RGVADLARRL A
|
| |