Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_5491 |
Symbol | |
ID | 4114359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008147 |
Strand | - |
Start bp | 63789 |
End bp | 66797 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 638034646 |
Product | AAA family ATPase |
Protein accession | YP_642647 |
Protein GI | 108802451 |
COG category | [R] General function prediction only |
COG ID | [COG1483] Predicted ATPase (AAA+ superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.188779 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACCC CGGTGACGCC GTGGTGGAAA GCGCTGAAAC TTCGGCAGGA GATCCTGTCG GCGTCCGGGC AGATCGACGA CGTACAGATG TCGTTGTTCG CCGCCGTCCA TGGCGCCGGC GCGACTCGAC CGCCCTACGC TGACGCCGGC TACTACGGGG ACATCACCCA TCCCACCGAA CGTCTGGTCG ACCTGCTCAC CGAAATCGCA ATCCGGATCG GCGGCGGCGA GGACTACATG AAGGCCCGCG CGGTCACCCG CCTCGACCAG GGCATGGGCG GCGGCAAATC GCATGCGTGC ATCGGGGCGT TTCACCTGGC CGCTAATCCC GAAGCCCTTC TGGGTACCGA GTTGGGTAAG CAGGTGGCTG CCCGGGCGAA GGCCAAGATC GGCCAACCAC TGGCGGAGAA TCTGCGCGGC CCGCACGTCG TTGTTCTGCC GTGCGACAAC ATGACGCCGG GGGCGTCGGT GCAGGAGTAC GACGGCCCGG CGGTCAACCT CTACGAACGG TTCCTGTGGC GGCTGTTCTC GAAGGATTAC TCGCTTTTTG AGCGTTACCA ACCGTTCTGG AGCGACAAGC ACAAGATCGC CGAGGCGATT CGGGCGGTGA ACCGGCCGGT GCTCATCATC GTCGACGAAG TGCTCGACTA CATCGGCAAC GGCCTCGACG GGGCGAACAA GCCCGACCTC GCCGCGCAGG ACATGGCGTT CTTGAGGGCG CTGCTTGATG TCGTCAACGA TGTCCCCAAC GTGGCGATGC TCATGGTGAT GATCGCTTCG GACGCCGACC AGACGGCGCT GTCCAAGGTC GCTCAGGAAC GTCGCGATGA CCTCAACCGT CTGCTCGAAC GCAACGGGTT CCCAGCCGTG GTCACTGAGG TCGGCGACTT CGCCGACATT CTGCGGCGCC GACTGTTTGA CTCCGAACCC GCCGCCGAGG TGGTCAGCGC CACCGCAGCC CAGTACGACT CGGTTTTCGC CGACAAAGGG TGGAGCAAGA ACGTCTGGGA CACCATCGGC GCCACCTGGC GTGATCGCTG GTCCGACGAG GTAGCCGCCT GCTACCCGTT CCATCCGATG CTGATGGCGT TGGCCAAAGA CGAGTGGAGC AAGGTCACCG GGTTTCAGCG GGTCCGCTCC ACGATTCGCA TTTTCGCGGC CACGGTCTAC GCCCAACAGG AGCGCGGCAA GGACGGTCAG TGGGTTCCCA CACTCATCGG GCCCGGTGAC TTTCCGCTGT CGGATTCCGC AGTGCGCGAG GCGATCCTGG GTAGCGGTCT GGTCGAAGAC GAACGCACAA TCGCGAACTA TCGCAGTCTC GCCGAGATCG AGGTGGTCAA TCACACCGGC AGCAGCGGCA CTGCTCGCCG TCAGGACCTC AGCCGTGAGC CGCTGCTGTG GAGTGAGGCC AACCCGCGGG CCGCTGAGCG TGCAGCGACA TTCATCTTCA TGGCCAGCAT CGTCGGCACA CTGCGTCCGG GCCGCGGCCG CGGCGCCAGC GCACCGGAGG TTAAAGCCGC CACGAGCGTT CCTGATGTCG GATACACCAT CACCGACGCC GACGCCGTGG TCGCCGAGCT GGTCGACGTC GACCGCGGAC TGAGCGCTTT AGACATCATT CCGGGCCAAG GCAATAACAA GCCGGCACGC TACTTCCTTT CCACGCGGCT CACCCACCGA ATGCTGGTCA ACAACATTCG CCGCACCATC ACCGAGGCCG AGCGCGACGC CGTGCTCGTT GAGTTCGCCC GGCGGCTCGC TAGCACCGGT CCCTTCCGGG AACAGCGGTT CGTCGCGGCC GACGCCACCC GCACCCCCGT CGAGGTGCTG TCCACCGCCG GTCTGGACAC CGCGTTCACC ACGCGGCTGA TCGTGCTGGA TCCCGCGCAG TTCAGCCTGC GCAACGGCGC TGAGGCGGCC ACCCTCGAAG CGTTGCAGGC TGCGATGGGC CTGGGGCAGG GGCCGCAGCA GCTTCCCGTG CAGTGGGCTA GTAGCGCGGT GTATGCGGTG GTGAACACCC AGCGCCGCAG CCTCGCGCGC AGCGTGGCCG CCGAGTACCT GGCCCGCAGC AAGGCCCTCG CCGCGCCGGA GATTCAGGCC GACGAAGAAT TGAAGGCCAC TGGTACGAAA GAGCTTTTGG CTGCCAAAGA ACAACTCGAG AAGGCCTTGA AACGGGCGTA TCAACACGTC GCCTACGTCG CACAGCCAGA CCCTGACGGG GAGCGGTACC TCGACCAGCT CACCTTCGAC GACGACACCC TCACCGCGCT CAACGGCACG ATCGTGTGGA AGGGGTTGGC CGACCGCGAC AAGGTGTTCG ACGCCGGTCA GTTCGGCGCG CAGGCCCTGT TGCACAACCT GCGCGAACAG GACTACGGCA AGACACTGGC CGACATCCGC GCCGCGTTCT ACAGCGCTCC ACGGCTACCG CTGCTCTACG AGGCGGACCG TGACCTGCAG CAGGCCATCT ATGACGCGGT GAGCCAGGGC TCGTTGCGCA TCGTCGACGC GGCCGGTACT GCGGTCGAGG TCACCACTCC CGGGCAGGTG AACCTCACCA GCACTGCCCT GAGAATCGCG GCGCCGGCTC CTGCTGCCGG AGAAGGCAAT GGCGCGCCTG CTAGTGGGCA AGGAGGAGAG CCGGCGGGCG GCTCCGCCGG CTCGGGCGCG ACGGGCTCAG CCGGTGGGGC TGGCGGAGGG TCGGCACCGG CCGCCGCGAC GGGTGGTTCC GGTGCCGCCG CGTCAGCTGC TGGCGGCGGG CAGCCCGGTC CGGCTGCCGA AGCTGCTGAC CGGCAAGTGG CGTTCTCGTT CACTAGCAAC CTGCTGGCCG GCGCTGAAAC GGCGGACGGT TTCGCCGCCC TGTTCCGGGC GTTCTACATG GCGCTCGATG AGCGCCAGAT CAGCTACCTG CAGGGAACCC TGCAGCTCGT CGTTGATTCC ACGGTTGTCG ACCAAATCGC CCCGCTGCTG GCCGATTTGG GGATCACCGC GACGATCAAG AACATCTGA
|
Protein sequence | MTTPVTPWWK ALKLRQEILS ASGQIDDVQM SLFAAVHGAG ATRPPYADAG YYGDITHPTE RLVDLLTEIA IRIGGGEDYM KARAVTRLDQ GMGGGKSHAC IGAFHLAANP EALLGTELGK QVAARAKAKI GQPLAENLRG PHVVVLPCDN MTPGASVQEY DGPAVNLYER FLWRLFSKDY SLFERYQPFW SDKHKIAEAI RAVNRPVLII VDEVLDYIGN GLDGANKPDL AAQDMAFLRA LLDVVNDVPN VAMLMVMIAS DADQTALSKV AQERRDDLNR LLERNGFPAV VTEVGDFADI LRRRLFDSEP AAEVVSATAA QYDSVFADKG WSKNVWDTIG ATWRDRWSDE VAACYPFHPM LMALAKDEWS KVTGFQRVRS TIRIFAATVY AQQERGKDGQ WVPTLIGPGD FPLSDSAVRE AILGSGLVED ERTIANYRSL AEIEVVNHTG SSGTARRQDL SREPLLWSEA NPRAAERAAT FIFMASIVGT LRPGRGRGAS APEVKAATSV PDVGYTITDA DAVVAELVDV DRGLSALDII PGQGNNKPAR YFLSTRLTHR MLVNNIRRTI TEAERDAVLV EFARRLASTG PFREQRFVAA DATRTPVEVL STAGLDTAFT TRLIVLDPAQ FSLRNGAEAA TLEALQAAMG LGQGPQQLPV QWASSAVYAV VNTQRRSLAR SVAAEYLARS KALAAPEIQA DEELKATGTK ELLAAKEQLE KALKRAYQHV AYVAQPDPDG ERYLDQLTFD DDTLTALNGT IVWKGLADRD KVFDAGQFGA QALLHNLREQ DYGKTLADIR AAFYSAPRLP LLYEADRDLQ QAIYDAVSQG SLRIVDAAGT AVEVTTPGQV NLTSTALRIA APAPAAGEGN GAPASGQGGE PAGGSAGSGA TGSAGGAGGG SAPAAATGGS GAAASAAGGG QPGPAAEAAD RQVAFSFTSN LLAGAETADG FAALFRAFYM ALDERQISYL QGTLQLVVDS TVVDQIAPLL ADLGITATIK NI
|
| |