Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_4791 |
Symbol | |
ID | 4113620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | + |
Start bp | 5072067 |
End bp | 5074352 |
Gene Length | 2286 bp |
Protein Length | 761 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638033942 |
Product | hypothetical protein |
Protein accession | YP_641951 |
Protein GI | 108801754 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.141175 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGTGA GCGCCCTCGA CGGTTTCTAC ACAACCTGGG ACAACGCGCG GCAAACGTTC GGCCAGGGCA CACCGCAGCC GGGCGCCGAC TTCGACAAGA GCCCGCAGCT GACCGACCTC GGGTCCGGGG TGACGGCGGC GGCGCCGGGT TCGAAGTGGT CGGGGGCCGC GGCGACGAAC TACGACAAGG CCAACACCGA CCATCAGGCC GTGTTCACGC GGTTGGCGGA GCTGGACCGC AGGATCGCCC AACAGGTCGA CCAGTCGGCG CAGGTGGTGG CGACCGGCAG GCAGAACCTC GATCAGGTGC GGCAGTGGGT CACCGACGCC GCCAACAGCG TGCCGCCCGG CAAGCAGCGC GACATGTTCC TCATGCAGAT CGCCAACCGT GGGCTCGGGC AGCTTTCCGA GGTCGTGCAG AAGACGAACG CCGAGTCGAA CACGGTCGCG CAGAACCTCG CGAAGCTCGG GCCGGAGTTC GACGCGATCA AGAACGAACA GAAGTTCGGC AACGGCGAGA AGGACGAGAA GAAGGACGAC GCCGAGGCGC TGGGGAACGA GGAGATGCAT GCGGTCCCAG AGAAGGAGCG TGCAGAGCAG GACGTGCAGG CTGCGCTGGC CGGAGACCAA GGTGCAGCCG CGCGGGTCGA CGAAGTACTA GGCAAGATCC AGCCGGGCCA ACAGTTGACC TCTGAGCAGG CTTCGTATCT GAGTCAGATG CAGGCGCAGC AGCATGGCAT GAGCATCGAT GATCTCCACA AGGCCGAACA ACGGCTCGGC GACCACAAGG ACATCATCGG TGACTCATGG CAGCTCATGA GCAACGACGA CGTGTGGATG CCACACACCG AGCTCGAACA AGGCGCTCTG GACGACCCCA GCTGGAGAGT GAGAGGCGGC TTCGAACAAC TCCCGACGAG CGTGCAAGAT GCCCTGAGGG ACGCTGATAA GTCAGCCCCG CTCGGGGAGG GGATTGCGGG CCTCACTCAT GACCGGGATC TTCAGAAAAT CTCGGAGATC GTCACCGATG GCAACGATAA ATTCCAGACC GGCACTGCAC TCGACCGGGA AATGATGACG GCGGCCGACA AGATCATGGA CACAAATCTT GGAACGTTGC AGCCGATGGG TGATCCCGAG ATTGTTCAGG GACTGTTCGA GGCCGCCGGG AGTGACCACC AAGTGGTGCA CGATCATCTG CTAGGCGATG GCGGCGACGA CTTCCTCCAC GATGTCAACG CGGTGGAGTG GACCGACGAT GGAAAGTCCG CCGCTTCGCT ATTCAGCTGG ACCCACGACG CAGCAGCGGG ACACGAAGCC GGCATCGCAG GAAATACAGC GGAAAAATAC GCGTCTTACA TCGGCTCGCA TAAAGATGAG CTAATGAATA TCGACGGGCA GACACTCGGC GAATTAAATC CGGAATTGGT AAAAGGGTAT GCACACGGCT TGACACCCTA CATGTCCGAT TTGTCGGGGC TCGCAAGCGC GGATCCGAAC AACACGTTTG ACTTCATCGA TAAGGAAAAC GCGGTCGAGA AGCCAGTGGC AAAAGGCATC TTTGCTGTCA TGAGCACCGA AGAAAGCGCC TTCAAAGAAT TCCACAGCGT CGCGGACGCA CACATCCTCA AGAATAGCTA CGAGTGGGCG AACGACGTCA AGAATGGCGT TCAGGTGCCT GCCAATGATG AGCGACTGTT GGACTGCGCC AGCCTCCAAG CACTCGAAAA GGTCGGCATC ACCGAAGCCG CCAATGCTCT CGGCCTTAAT AGAGAGCAGG TGGAAGCGCA GCGCGCAGAA GCCTACGACA TGGGACTCAA GATGCTGAGT GGCACAGGAA GCGTCGTACC CGGGATCGGA ACCGCAGTCG GCCCAGCCAT AGACAATTTC GGCGGTGCGA TGAAGGATTC AATTTTCGGC GATCCGCGAG ACTTCCAAGA AATGACCGTA CGCAATATGG ACGCGGGAGA GTCGGCACGC TTCGCATTGA ATGCTCTGAT TGCACAGGAC GTTCCGTTGA ACTATGGGCG CTACTCACTC GACGATTCCT GGTACGAGTA CGCGTCTGTC GATCCGGCAC GTCCGGAATT AGGAGAGGCC CGATTCATCA AGAACATCGA GGGACTGCAG AATGCCAGCA TCTCCGACTC CTCCATGGAG CAGAATCTGA CCACGATACT CAAGCAAACA CTCGGAGAAT CGTCGCCGAC TGACGCGGTG AAAGATCAGT ACGATGACGT AGTCGCCAAT CCCGATCCCA GAGAGGCCAT TCGGCCAGGA CAATGA
|
Protein sequence | MPVSALDGFY TTWDNARQTF GQGTPQPGAD FDKSPQLTDL GSGVTAAAPG SKWSGAAATN YDKANTDHQA VFTRLAELDR RIAQQVDQSA QVVATGRQNL DQVRQWVTDA ANSVPPGKQR DMFLMQIANR GLGQLSEVVQ KTNAESNTVA QNLAKLGPEF DAIKNEQKFG NGEKDEKKDD AEALGNEEMH AVPEKERAEQ DVQAALAGDQ GAAARVDEVL GKIQPGQQLT SEQASYLSQM QAQQHGMSID DLHKAEQRLG DHKDIIGDSW QLMSNDDVWM PHTELEQGAL DDPSWRVRGG FEQLPTSVQD ALRDADKSAP LGEGIAGLTH DRDLQKISEI VTDGNDKFQT GTALDREMMT AADKIMDTNL GTLQPMGDPE IVQGLFEAAG SDHQVVHDHL LGDGGDDFLH DVNAVEWTDD GKSAASLFSW THDAAAGHEA GIAGNTAEKY ASYIGSHKDE LMNIDGQTLG ELNPELVKGY AHGLTPYMSD LSGLASADPN NTFDFIDKEN AVEKPVAKGI FAVMSTEESA FKEFHSVADA HILKNSYEWA NDVKNGVQVP ANDERLLDCA SLQALEKVGI TEAANALGLN REQVEAQRAE AYDMGLKMLS GTGSVVPGIG TAVGPAIDNF GGAMKDSIFG DPRDFQEMTV RNMDAGESAR FALNALIAQD VPLNYGRYSL DDSWYEYASV DPARPELGEA RFIKNIEGLQ NASISDSSME QNLTTILKQT LGESSPTDAV KDQYDDVVAN PDPREAIRPG Q
|
| |