Gene Mmcs_5491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5491 
Symbol 
ID4114359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008147 
Strand
Start bp63789 
End bp66797 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content66% 
IMG OID638034646 
ProductAAA family ATPase 
Protein accessionYP_642647 
Protein GI108802451 
COG category[R] General function prediction only 
COG ID[COG1483] Predicted ATPase (AAA+ superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.188779 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCC CGGTGACGCC GTGGTGGAAA GCGCTGAAAC TTCGGCAGGA GATCCTGTCG 
GCGTCCGGGC AGATCGACGA CGTACAGATG TCGTTGTTCG CCGCCGTCCA TGGCGCCGGC
GCGACTCGAC CGCCCTACGC TGACGCCGGC TACTACGGGG ACATCACCCA TCCCACCGAA
CGTCTGGTCG ACCTGCTCAC CGAAATCGCA ATCCGGATCG GCGGCGGCGA GGACTACATG
AAGGCCCGCG CGGTCACCCG CCTCGACCAG GGCATGGGCG GCGGCAAATC GCATGCGTGC
ATCGGGGCGT TTCACCTGGC CGCTAATCCC GAAGCCCTTC TGGGTACCGA GTTGGGTAAG
CAGGTGGCTG CCCGGGCGAA GGCCAAGATC GGCCAACCAC TGGCGGAGAA TCTGCGCGGC
CCGCACGTCG TTGTTCTGCC GTGCGACAAC ATGACGCCGG GGGCGTCGGT GCAGGAGTAC
GACGGCCCGG CGGTCAACCT CTACGAACGG TTCCTGTGGC GGCTGTTCTC GAAGGATTAC
TCGCTTTTTG AGCGTTACCA ACCGTTCTGG AGCGACAAGC ACAAGATCGC CGAGGCGATT
CGGGCGGTGA ACCGGCCGGT GCTCATCATC GTCGACGAAG TGCTCGACTA CATCGGCAAC
GGCCTCGACG GGGCGAACAA GCCCGACCTC GCCGCGCAGG ACATGGCGTT CTTGAGGGCG
CTGCTTGATG TCGTCAACGA TGTCCCCAAC GTGGCGATGC TCATGGTGAT GATCGCTTCG
GACGCCGACC AGACGGCGCT GTCCAAGGTC GCTCAGGAAC GTCGCGATGA CCTCAACCGT
CTGCTCGAAC GCAACGGGTT CCCAGCCGTG GTCACTGAGG TCGGCGACTT CGCCGACATT
CTGCGGCGCC GACTGTTTGA CTCCGAACCC GCCGCCGAGG TGGTCAGCGC CACCGCAGCC
CAGTACGACT CGGTTTTCGC CGACAAAGGG TGGAGCAAGA ACGTCTGGGA CACCATCGGC
GCCACCTGGC GTGATCGCTG GTCCGACGAG GTAGCCGCCT GCTACCCGTT CCATCCGATG
CTGATGGCGT TGGCCAAAGA CGAGTGGAGC AAGGTCACCG GGTTTCAGCG GGTCCGCTCC
ACGATTCGCA TTTTCGCGGC CACGGTCTAC GCCCAACAGG AGCGCGGCAA GGACGGTCAG
TGGGTTCCCA CACTCATCGG GCCCGGTGAC TTTCCGCTGT CGGATTCCGC AGTGCGCGAG
GCGATCCTGG GTAGCGGTCT GGTCGAAGAC GAACGCACAA TCGCGAACTA TCGCAGTCTC
GCCGAGATCG AGGTGGTCAA TCACACCGGC AGCAGCGGCA CTGCTCGCCG TCAGGACCTC
AGCCGTGAGC CGCTGCTGTG GAGTGAGGCC AACCCGCGGG CCGCTGAGCG TGCAGCGACA
TTCATCTTCA TGGCCAGCAT CGTCGGCACA CTGCGTCCGG GCCGCGGCCG CGGCGCCAGC
GCACCGGAGG TTAAAGCCGC CACGAGCGTT CCTGATGTCG GATACACCAT CACCGACGCC
GACGCCGTGG TCGCCGAGCT GGTCGACGTC GACCGCGGAC TGAGCGCTTT AGACATCATT
CCGGGCCAAG GCAATAACAA GCCGGCACGC TACTTCCTTT CCACGCGGCT CACCCACCGA
ATGCTGGTCA ACAACATTCG CCGCACCATC ACCGAGGCCG AGCGCGACGC CGTGCTCGTT
GAGTTCGCCC GGCGGCTCGC TAGCACCGGT CCCTTCCGGG AACAGCGGTT CGTCGCGGCC
GACGCCACCC GCACCCCCGT CGAGGTGCTG TCCACCGCCG GTCTGGACAC CGCGTTCACC
ACGCGGCTGA TCGTGCTGGA TCCCGCGCAG TTCAGCCTGC GCAACGGCGC TGAGGCGGCC
ACCCTCGAAG CGTTGCAGGC TGCGATGGGC CTGGGGCAGG GGCCGCAGCA GCTTCCCGTG
CAGTGGGCTA GTAGCGCGGT GTATGCGGTG GTGAACACCC AGCGCCGCAG CCTCGCGCGC
AGCGTGGCCG CCGAGTACCT GGCCCGCAGC AAGGCCCTCG CCGCGCCGGA GATTCAGGCC
GACGAAGAAT TGAAGGCCAC TGGTACGAAA GAGCTTTTGG CTGCCAAAGA ACAACTCGAG
AAGGCCTTGA AACGGGCGTA TCAACACGTC GCCTACGTCG CACAGCCAGA CCCTGACGGG
GAGCGGTACC TCGACCAGCT CACCTTCGAC GACGACACCC TCACCGCGCT CAACGGCACG
ATCGTGTGGA AGGGGTTGGC CGACCGCGAC AAGGTGTTCG ACGCCGGTCA GTTCGGCGCG
CAGGCCCTGT TGCACAACCT GCGCGAACAG GACTACGGCA AGACACTGGC CGACATCCGC
GCCGCGTTCT ACAGCGCTCC ACGGCTACCG CTGCTCTACG AGGCGGACCG TGACCTGCAG
CAGGCCATCT ATGACGCGGT GAGCCAGGGC TCGTTGCGCA TCGTCGACGC GGCCGGTACT
GCGGTCGAGG TCACCACTCC CGGGCAGGTG AACCTCACCA GCACTGCCCT GAGAATCGCG
GCGCCGGCTC CTGCTGCCGG AGAAGGCAAT GGCGCGCCTG CTAGTGGGCA AGGAGGAGAG
CCGGCGGGCG GCTCCGCCGG CTCGGGCGCG ACGGGCTCAG CCGGTGGGGC TGGCGGAGGG
TCGGCACCGG CCGCCGCGAC GGGTGGTTCC GGTGCCGCCG CGTCAGCTGC TGGCGGCGGG
CAGCCCGGTC CGGCTGCCGA AGCTGCTGAC CGGCAAGTGG CGTTCTCGTT CACTAGCAAC
CTGCTGGCCG GCGCTGAAAC GGCGGACGGT TTCGCCGCCC TGTTCCGGGC GTTCTACATG
GCGCTCGATG AGCGCCAGAT CAGCTACCTG CAGGGAACCC TGCAGCTCGT CGTTGATTCC
ACGGTTGTCG ACCAAATCGC CCCGCTGCTG GCCGATTTGG GGATCACCGC GACGATCAAG
AACATCTGA
 
Protein sequence
MTTPVTPWWK ALKLRQEILS ASGQIDDVQM SLFAAVHGAG ATRPPYADAG YYGDITHPTE 
RLVDLLTEIA IRIGGGEDYM KARAVTRLDQ GMGGGKSHAC IGAFHLAANP EALLGTELGK
QVAARAKAKI GQPLAENLRG PHVVVLPCDN MTPGASVQEY DGPAVNLYER FLWRLFSKDY
SLFERYQPFW SDKHKIAEAI RAVNRPVLII VDEVLDYIGN GLDGANKPDL AAQDMAFLRA
LLDVVNDVPN VAMLMVMIAS DADQTALSKV AQERRDDLNR LLERNGFPAV VTEVGDFADI
LRRRLFDSEP AAEVVSATAA QYDSVFADKG WSKNVWDTIG ATWRDRWSDE VAACYPFHPM
LMALAKDEWS KVTGFQRVRS TIRIFAATVY AQQERGKDGQ WVPTLIGPGD FPLSDSAVRE
AILGSGLVED ERTIANYRSL AEIEVVNHTG SSGTARRQDL SREPLLWSEA NPRAAERAAT
FIFMASIVGT LRPGRGRGAS APEVKAATSV PDVGYTITDA DAVVAELVDV DRGLSALDII
PGQGNNKPAR YFLSTRLTHR MLVNNIRRTI TEAERDAVLV EFARRLASTG PFREQRFVAA
DATRTPVEVL STAGLDTAFT TRLIVLDPAQ FSLRNGAEAA TLEALQAAMG LGQGPQQLPV
QWASSAVYAV VNTQRRSLAR SVAAEYLARS KALAAPEIQA DEELKATGTK ELLAAKEQLE
KALKRAYQHV AYVAQPDPDG ERYLDQLTFD DDTLTALNGT IVWKGLADRD KVFDAGQFGA
QALLHNLREQ DYGKTLADIR AAFYSAPRLP LLYEADRDLQ QAIYDAVSQG SLRIVDAAGT
AVEVTTPGQV NLTSTALRIA APAPAAGEGN GAPASGQGGE PAGGSAGSGA TGSAGGAGGG
SAPAAATGGS GAAASAAGGG QPGPAAEAAD RQVAFSFTSN LLAGAETADG FAALFRAFYM
ALDERQISYL QGTLQLVVDS TVVDQIAPLL ADLGITATIK NI