Gene Mmcs_5493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5493 
Symbol 
ID4114361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008147 
Strand
Start bp69570 
End bp72779 
Gene Length3210 bp 
Protein Length1069 aa 
Translation table11 
GC content66% 
IMG OID638034648 
Producthelicase-like protein 
Protein accessionYP_642649 
Protein GI108802453 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0553] Superfamily II DNA/RNA helicases, SNF2 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0695829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTTTG ACCGCGGTCA ACGGGTCCGG GTTCAGGCGC CGGGCGTACC GGAGTTCGCG 
ACCGTTCGCT TCGCCATTCC TGGCGATTCT GATGGTTCGT GGGACCTGAT CCTGGTCGAT
GACGAGGACC GCCGCCACGA GATTAACCTT GCCGCTGGCG ACACCGAGAC GGTGCGCAAG
CTCGTCAGCG ACGGGCACGG CGATTCCGCG CGGGTGCTGG CCGCGATGTG GACTCAGTGG
ATGGACGCTG CGGCGACCAA CGCGGAGTCC AGCGCGATGG CGTCGATGCC GCTCAAGCCG
TACGCCCATC AGACCACCGC GGTGTATGGG GCGATGCTGC CGCAACCTCA GCTGCGGTTT
CTGCTGGCTG ATGAACCCGG CACCGGCAAG ACGATCATGG CCGGCCTGTA CCTGCGTGAA
ATGCAGCGGC TCGGATTGGT CAAGCGTGCG GTGATCGTGT GCCCCGCCAA CTTGGCCTCG
AAGTGGGTTG ATGACTTCTC ACGGCTACTC GGTGGGGGGC TGCGGCAGAT CACCTCGGCG
ACGGTGCGCG AGGATGCGCT CAACTCGAAC GACCTGTGGG TGGTGTCGCT GGAATTGGCC
GCGGTCAATC CTGCTGTTCA GGACGCGCTG CGCCCCGACA AAGCGGGCTG GGATCTGGTG
GTGTTCGACG AGGCGCACCG GCTGACCCCG ACCGCGGCCG GTTTCCATCA GGTGGGTCGG
CTCCTGGCGA AGAACACGCC GCGGGCGTTG TTGATGACGG CGACCCCGCA CCGCGGTAAG
GAATGGCTGT TTCGGCATCT ACTGCACCTC GTCGACCCCG CCATCTATCC GGATCCCGGC
AGCGACCCGA ATGTCGCGCT GTCGGCGCTG CGGCCGGGCC CGATCCACTT CCTGCGCAGG
ATGAAGGAGG ATCTGGTCGA TTACGACGGC AAGACCCGGT TGTTCAAGGG TCGGTCCGCC
CACAACCACA GCGTTGCCCT CTCCCAGGTC GAGTACGCCT ACTACCAGGC TGCGCTGGAC
ATGGTGGAGC AGTTCTTTCC GCAGTCGGCG CAGCCTCTTG CGCGGATGGT GTACGGCAAG
CGGGCCGCGT CGAGCTTGTA TGCGCTAGCC GAGACGCTGC GGCGCCGTTC CACCCACATG
GGGGAGATGA GCGAAGCCGA GGCGGCGCTG ATCGCCGAAA GCAGTAACGA CGGCGATGAG
AGCGAAGTCG ACGAGGCGAA GGTCATCCAC ACCGGATCGA CATCGACACG TGCGGAGCGA
ACGGCGATCA AGGGGCTGGT CGATCGGATC GACGCTACCC TCGCCGATGC GGCGTGGCTT
CCCTCCAAGT GGCGGCGCCT GACTGAGGAC TGTCTGGCCA AGCACTCGAT CTTGCCGGGC
AACGGCGAGC AGGCGGTGGT GTTCACCGAG TACGCCGATT CGGCGCAGTG GATCGCTGAC
CGGCTCAAGG CCGAGGGTTA CACCGCCCAG ATGTACTCGG GTCGGCAGAG CAAACCGGAT
CGCGATGAAG TCCGAAAAGC GTTCATGCGC GGCGATTTTC AGATAATCGT GACAACCGAT
GCGGGCAACG AAGGCATCGA TCTGCAGGCC GCGCACGTGC TGGTGAACTA CGACATCCCA
TGGTCACTGG TCCGGCTGGA GCAGCGGATG GGCCGCATCC ACCGGGTCGG TCAGCAGCGC
GAGGTCCATC TGTACAACCT GGTCGCCACC GACACTCGTG AAGGCGAGAC CCTGTTGCGA
CTGCTCGACA ATTTCGTTAC CGCCGCCAAC GAGCTGCACG GACAAATGTT CGACAGCCTC
TCGGCGGTCG CCGAGATCAC CGGCGTGGAC TACGACCGCT GGCTGACCGA TCTGTACGGC
AATGACGAGG CCAAGAAGCA GACCGCGATC GACGCCGCTC GGGCGGTGCA GGCCCAGGAA
CTGACTCGCG TCGCTCGCCA AGTGCGGGAC AACGAACGCC GGCTGGCCAG TCAGGTCGAC
GCCGTGGCCG CGCTGACTCT GCTGCAGCGG GATCTGTTCG CCCGAATCAA CCCCGCGATC
GTTGAGGCCT ACCTGGATCG GCTGTCGGCG GCGGGTGTGC TGCAGGCAGC GCCCACCGCT
GCGGGACCTG GCTTCCGGCG ACTGTCACTG GCTGGGCAGA GGCTGCCCGA CTCCCTTGGC
GGGGGCACTG ATGCCCATAT CGCCACCAGC GGCGAGGCCG TGCGCGCCGC CGAGGAAAGC
ATTGACGTCG GTGACACCGT GACCCTGGGG CCAGGGGAGC CGGCGTTCAC CGATTTGATC
GCGCTGGCCG ACCGCGCTCT GGCTGAAGAT CTCTACCGGT CGGGCGCGGC CGCTGACCCC
AGCAGTCTGA CGCCCTATGA CCTGTATGCC TACGAGGCCA CCATGACCGA GAGCGACGGT
AAGCGCGCCA GTGTGTGGGC GACGCTGGTG AAGGTCGATG ACAGCGGCAA CGCGCGCGCG
GTGCGGTGGG AGACGCTGGC CAACCTGGTG CCCACGGACA TCGCGGGCAC CGCAGCTCAT
CCGGCACGGG AGGGCCGCGC CCAGGAGGTG GCGCGGGAGG TTGCTGATAC CACGGTGAGC
GAGCACCGCA AGGTGCGTAC CGATTGGTTC GGCCAGGCGC GGCGGGACTT GAACAACCTT
CCGTTGAATC TGACCGAGGG CATTGAGGAC CGCGACACCC GGGTGGCGCT GCGACGCCAG
TTTCAGGTCC AGACCTCCGC CCGGATCGCG GAGTTGGAGC GCCTGACCAA TGTGCAGCTG
ACCGAGCCGA AGCTGGTGGG TCGCATTCGG GTCCTGGCGG CCGCGGACGC GGGTACGCAG
GCCGAGATCG ACGCCGAGAT GGTGTCGATG AGTCACGTCC GTCAGCTGTT GGTCGACGAT
GGCTGGGTCG TCGAGGACGT GCACACCGAA GGTCGCGGGT ACGACTTGGA GGCGCGCCGC
AACAGCCAGA TCCGCCACAT CGAGGTCAAA GGTGTGCTCG ACAGCGCGGC CAGCAACGGG
ATCCGGATGA CCGGCAATGA GGTTCTCATC GCCACCCAGC ACCGCCGCAG CTACTGGCTG
TATGTGATCG ACCAGTGCGC CGATGGGGTC GGCCGGTTTT TCGGCGCCTA CGAAGATCCC
GCCACCCTGT TTTCCACCGA CATGACCGGG GATGCGATCT TCCGCGTTCC CGGCAGCAGC
CTGAAGAACG CGCCGGGAAG CAACCTATGA
 
Protein sequence
MTFDRGQRVR VQAPGVPEFA TVRFAIPGDS DGSWDLILVD DEDRRHEINL AAGDTETVRK 
LVSDGHGDSA RVLAAMWTQW MDAAATNAES SAMASMPLKP YAHQTTAVYG AMLPQPQLRF
LLADEPGTGK TIMAGLYLRE MQRLGLVKRA VIVCPANLAS KWVDDFSRLL GGGLRQITSA
TVREDALNSN DLWVVSLELA AVNPAVQDAL RPDKAGWDLV VFDEAHRLTP TAAGFHQVGR
LLAKNTPRAL LMTATPHRGK EWLFRHLLHL VDPAIYPDPG SDPNVALSAL RPGPIHFLRR
MKEDLVDYDG KTRLFKGRSA HNHSVALSQV EYAYYQAALD MVEQFFPQSA QPLARMVYGK
RAASSLYALA ETLRRRSTHM GEMSEAEAAL IAESSNDGDE SEVDEAKVIH TGSTSTRAER
TAIKGLVDRI DATLADAAWL PSKWRRLTED CLAKHSILPG NGEQAVVFTE YADSAQWIAD
RLKAEGYTAQ MYSGRQSKPD RDEVRKAFMR GDFQIIVTTD AGNEGIDLQA AHVLVNYDIP
WSLVRLEQRM GRIHRVGQQR EVHLYNLVAT DTREGETLLR LLDNFVTAAN ELHGQMFDSL
SAVAEITGVD YDRWLTDLYG NDEAKKQTAI DAARAVQAQE LTRVARQVRD NERRLASQVD
AVAALTLLQR DLFARINPAI VEAYLDRLSA AGVLQAAPTA AGPGFRRLSL AGQRLPDSLG
GGTDAHIATS GEAVRAAEES IDVGDTVTLG PGEPAFTDLI ALADRALAED LYRSGAAADP
SSLTPYDLYA YEATMTESDG KRASVWATLV KVDDSGNARA VRWETLANLV PTDIAGTAAH
PAREGRAQEV AREVADTTVS EHRKVRTDWF GQARRDLNNL PLNLTEGIED RDTRVALRRQ
FQVQTSARIA ELERLTNVQL TEPKLVGRIR VLAAADAGTQ AEIDAEMVSM SHVRQLLVDD
GWVVEDVHTE GRGYDLEARR NSQIRHIEVK GVLDSAASNG IRMTGNEVLI ATQHRRSYWL
YVIDQCADGV GRFFGAYEDP ATLFSTDMTG DAIFRVPGSS LKNAPGSNL