Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_3654 |
Symbol | |
ID | 4611586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 3849038 |
End bp | 3851413 |
Gene Length | 2376 bp |
Protein Length | 791 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639793332 |
Product | hypothetical protein |
Protein accession | YP_939638 |
Protein GI | 119869686 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTCAGTC AGTCATGTCG CGCGGGAGGT GCGGCGCTGG CGGTGGGTAT CGGCATGCTC CTCGCGCCAG GGATCGCGGC AGCAGACCCC TCCGCCGACG CGGCAGGCAC CGATGTCTCC GCGCATGCAC CGGCCGACAC CCGGCAAGAC GACCACACCG AGAAAGCCGA CGAGGAAACG GACGCCCCCG AAGACAACGC GGAGGACATC CCCGAGGACA GCGCGGAGGA CGAAGCGGAG GACGAAGCGG CGCCGGTGGA GGACGAAGAT ACCGACGCGA AAACCGGCCA CCGCGACGCG GACGACGAAG AACCCACCGA GGATCCCGTC GACGAACCCG TTGCGGACGA CCCCGTTGAA GACAACGAAG TCGATCCAGA AGAACCCGCC GAATCGCCCG CGCCCGTCGC GACGCTCACC GACACCGTCG GCGCCGGCGG CACGCCTGCC CCGGTCGAGT CACCCGCGAC GTGGGCCGTA CTCGCCTGGG CCCGCCGCCA ACCGTTCAGC ACCACCACGG CCGCGAACAC ATCCGCCAGG CACACGACCG CGTCATCCTC GACGGCGACG CCGGCCACGA CGGTCGACGT CAAGGACTAC GGGGCGGTCG GTGACGGCGT CACCGACGAC TCGGCGGCGA TCAAGGCCGC CGAGGCCGCG CTGGCCTCGG GTCAGCGCCT CTACTTCCCC GAGGGCAGTT ACCGGTTCGC CCAGCAGAAC CCCGCCGGCA ACGCCGCGGT CCTGCTCAAG GGTCTCTCCG ACGTCACGGT GGAGTTCGCA CCGCATGCCC GGCTGCTGAT GGACAACCTC GACGCCGCCG AGCACGGCAC CAGCCACGGC ATCCGCGTCG AGGGCGCGGC GTCGAACGTG ACGATCCTCA ACGCCACGAT CGAGTGGAAG ACCCGACCAT CCGCGCGCAG CTTCGGCGAC GGGTTCTCGA TCCTCGGGTG GGCGTCGAAC ACCGCGCCCC CGCCGGGCTG GACCGGATCG ACCGGAACGG TCTCCAACGT GTCGCTCGTC AACGCCACGG TGATCAACGC GCCGCAGACC GGCGCGATCT TCATGGGCGC CTCCGACGTG ACCGTCACGA ACTTCACCGC GATCGGCACG CTGGCCGACG GGTTGCACTT CAACGCGAAC CGCCGGGTGA CCGTGCACGG GCTCCTCGCG CAGAACACCG GCGACGACGG CCTGGCGTTC GTCACCTACT ACGACCCGAC CCTGCCGTGG ACCTACGGGC CCGGCGACGG CCCGTTCAAC CAGCCCGGCC TCGGCGAGTG GAACAACGGC GGTTCGGTGG CGACGAACAT CACCGTGACG GGTGGGGCGG CCAGCGGGGT GCGCGTCCAG GGTGGTTATG ACATCACGAT CACCGATGTC ACCGTGACCG GTAAGGAGTT CGGCCTCCAG GTCAACTCCG CCAAGGCCAC CGGTCCGGGC GACTGGACGA GTCTGGCGTC GCGCGACATC TCCATCTCCG ACGTGACCAT CAGCGCTACC GTGACAGGAA TCGTCCTGGC CACCAACAAC ATCGACGGCA CCGAGGCCTC CATGTGGTGG GACTTCTCGG GCCTGACGAT CAGCGACGTC ACCATCCACA ACTCCCGCAA CTGGTCGCTC GCCGTCGAGA CGCCGGCGAG CACCACGAGC AGATTCGCCG GCGTCACCCT GCGCAACATT CATGCCGAAG TCGACGCGGA CGTCGGCCCA CTCGGCGGCG GCAACGGCGG CATCCTGCTT GCGTCGCTTC GGGATTCCGT GATCGACGGT GTGCGCCTGG TGTCGGTCCA CGGTAGCGAC ATCAACGTCG TCGGCGCGGC TCAGATCCGC AGTCAGTACA GCGTCGCCGA TCTGCCGTCG TCGAACCTGA CGATCGACGA TCTGGTCCTC GAGGGCCCGG GTCGGATCCT GATCCAGGAC ATCGCCGGCC TGGACGTCGG GACTGTGGCG TCCCACGGCG CCAACAGCGC CGCCATCGAA CTCTTCCGCG TCAAGTCCGC CTCGTTCGAC ACCATCGGGG CGTACCTGCC CGGCCGCGGC AACGGGGCGG GCTGGGGCGT ACGGCTGCTG CAGGTCCACG ACCTCGACGT GGCGAACATC GAGGTGATCA CCGACGACCA CATCGGAACA TCCTGGTGGG CAGTCGAACT CGGCGGCGGC AATCCTGCAC AGGACATCGC CGGCGCCGGT GTGCGCATCG ACGACGTCAC CTACGTCAGC GGTCGTGACG CCACGGACTC CGACATCGTG GTCCAGGGTG GACCGTACGG ACCGGTCGAC TGGTACATCA ACGCAACCTG GCGCCACGAG GGTGAGGCGT CGCCGCTGTG GCGCGCCGGT CTGTGGGGCG ACGCGATCCC CTCGCTCACA TCCTGA
|
Protein sequence | MVSQSCRAGG AALAVGIGML LAPGIAAADP SADAAGTDVS AHAPADTRQD DHTEKADEET DAPEDNAEDI PEDSAEDEAE DEAAPVEDED TDAKTGHRDA DDEEPTEDPV DEPVADDPVE DNEVDPEEPA ESPAPVATLT DTVGAGGTPA PVESPATWAV LAWARRQPFS TTTAANTSAR HTTASSSTAT PATTVDVKDY GAVGDGVTDD SAAIKAAEAA LASGQRLYFP EGSYRFAQQN PAGNAAVLLK GLSDVTVEFA PHARLLMDNL DAAEHGTSHG IRVEGAASNV TILNATIEWK TRPSARSFGD GFSILGWASN TAPPPGWTGS TGTVSNVSLV NATVINAPQT GAIFMGASDV TVTNFTAIGT LADGLHFNAN RRVTVHGLLA QNTGDDGLAF VTYYDPTLPW TYGPGDGPFN QPGLGEWNNG GSVATNITVT GGAASGVRVQ GGYDITITDV TVTGKEFGLQ VNSAKATGPG DWTSLASRDI SISDVTISAT VTGIVLATNN IDGTEASMWW DFSGLTISDV TIHNSRNWSL AVETPASTTS RFAGVTLRNI HAEVDADVGP LGGGNGGILL ASLRDSVIDG VRLVSVHGSD INVVGAAQIR SQYSVADLPS SNLTIDDLVL EGPGRILIQD IAGLDVGTVA SHGANSAAIE LFRVKSASFD TIGAYLPGRG NGAGWGVRLL QVHDLDVANI EVITDDHIGT SWWAVELGGG NPAQDIAGAG VRIDDVTYVS GRDATDSDIV VQGGPYGPVD WYINATWRHE GEASPLWRAG LWGDAIPSLT S
|
| |