Gene Mkms_4331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_4331 
Symbol 
ID4612272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp4546973 
End bp4550608 
Gene Length3636 bp 
Protein Length1211 aa 
Translation table11 
GC content70% 
IMG OID639794015 
Producttranscription-repair coupling factor 
Protein accessionYP_940312 
Protein GI119870360 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1197] Transcription-repair coupling factor (superfamily II helicase) 
TIGRFAM ID[TIGR00580] transcription-repair coupling factor (mfd) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.376638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTAT CGGGGCTTCA TCATGTCCAG ACCCCGATTG CGGGGCTCAT CGAGTTGGCG 
TTGCGCGATC CGTGTCTGGC CGATCTGTCC GCACGCGCCG CCGACAAACC CGACGATCTC
GCGATGGTGG GTCCGGCCAG TGCCCGGCTC CTCGTCACCG CCGCGCTCGC GCAGGCCGGC
CCGCTGCTCG TCGTCACCGC CACCGGCCGG GAGGCCGACG ATCTGACCGC GGAGTTGCGC
GGGGTGATCG GCGACTCCGC GGCGCTGTTC CCGTCCTGGG AGACGCTTCC GCACGAGCGG
TTGTCACCCG GTGTGGACAC CGTCGGCGCG CGGATGATGC TGTTGCGCCG GCTCGCGCAT
CCCGATGACG CCCGCCTCGG CCCGCCGCTG CGGGTGGTGG TCACCACGGC CCGCTCACTG
GTGCAGCCGA TGGCGCCGGG TCTGGCAGAG GTCGAGCCGG TGACGCTGAC CGTGGGCGCC
GAGATGGATT TCGACGGCGT CATCAAGCGG CTGGTCGATC TGGCCTACAC GCGCTGCGAC
ATGGTGGCCA AGCGTGGCGA GTTCGCGGTC CGCGGCGGCA TCCTCGACGT GTTCCCGCCG
ACCGCCGAAC ATCCGGTGCG CGTCGAGTTC TGGGGTGACG AGATCTCCGA GATGCGGATG
TTCGCCGTCG CCGATCAGCG GTCCATCCCC GAGATCGAGG TCGACACCGT CATCGCGGTG
CCGTGCCGCG AGCTGCTGAT GTCCGACGAG GTGAGAAGTC GTGCTGCCGT GCTCGCTGCG
GAGCACCCGA CCCACGAGAA CTCCGTGCCG GGCAGCGTGC CCGACATGCT GGCCAAACTC
GCCGAGGGAA TCCCGGTCGA CGGGATGGAG GCGCTGCTGC CGCTGTTGCG GCCGACCGAT
TTGGCGATGC TGTCCGACCA CCTGCCGGAC GGCGCGCCGA TCCTGGTGTG CGATCCGGAG
AAGGTGCGCA CCCGCGCCGG GGATCTGATC AAGACCGGTC GGGAGTTCCT CGAGGCGTCG
TGGTCGACGG CCGCCGTCGG TGGTGCCGCG CCGATCGACC TGGAGGCGAT GGGCGCGTCG
GGCTTCCTCG GTTTCGAGGA GGTGCGCACC GGCGCGCGGG CCGGCGGCCA CCCGTGGTGG
ACGCTGAGCC AGCTGTCGGA CGAGAAGGCG ATCGAACTCG ACATCCGGTC GGCGCCGTCG
GCGCGCGGGC AGCAGTCGTC GGTCGAGGAG ATCTTCGCGA TGCTCCGCGC GCACGTGGCG
ACCGGCGGCT ACGGCGCGGT CGTCACCCCC GGTGCGGGCA CCGCGCACCG CGTCGTCGAG
CAGCTCGGCG AAAACGACAC GCCCGCAACG ATGCTCGAGC CCGGCGAGGA GCCGAAGGCC
GGTGTCGTCG GGGTGCTGAA GGGGCCGCTG CACGACGGGG TGGTGCTGCC GGGCGCGAAC
CTGGTGATCG TCACCGAGGC CGATCTGACC GGCAGCCGCG TCACCGCGAC CGAGGGCAAA
CGCCTTGCCG CCAAACGGCG CAACGTCGTC GACCCGCTGG CCCTGACGGC GGGGGACCTG
GTGGTCCACG ATCAGCACGG CATCGGCCGG TTCGTCGAGA TGACCGAACG CGTCGTCGGC
GGTGCGCGCC GCGAGTACCT GGTGCTGGAG TACGCGTCGA GTAAACGCGG GGGCGGGTCC
GACCGGCTCT ACGTGCCGAT GGACTCCCTC GACCAGCTGT CGCGCTACGT CGGCGGTGAG
GCGCCGTCGC TGTCCAAACT CGGCGGCAGC GACTGGGCCA ACACCAAGAC CAAGGCGCGC
CGGGCCGTGC GGGAGATCGC CAGCGAGCTG GTGGCGCTCT ACGCCAAACG GCAGGCCGCG
CCCGGCCACG CCTTCAGCCC GGACACCCCG TGGCAGAACG AGATGGAGGA CGCGTTCGGG
TTCACCGAGA CCGTCGACCA GCTCACCGCC ATCGAGGAGG TCAAGGCCGA TATGGAGAAG
CCGGTCCCGA TGGACCGGGT GATCTGCGGC GACGTCGGTT ACGGCAAGAC CGAGATCGCG
GTGCGCGCGG CGTTCAAGGC GGTGCAGGAC GGTAAGCAGG TGGCGGTGCT CGTGCCGACC
ACGCTGCTGG CCGACCAGCA CCTGCAGACG TTCACCGCGC GGATGGCCGG CTTCCCTGTG
ACCGTGAAAG GCCTGTCGCG CTTCACCGAC CCCGCCGAGT CCCGCGCCGC GCTCGAGGGC
ATGAAGGACG GCTCGGTCGA CATCGTCATC GGCACCCACC GCCTGCTGCA GACCGGGGTG
GTGTGGAAGG ACCTCGGGCT GATCATCGTC GACGAGGAAC AGCGGTTCGG CGTCGAGCAC
AAGGAGCACA TCAAATCGAT GCGCACCCAC GTCGACGTGC TGACCATGAG CGCCACGCCG
ATCCCGCGCA CGCTGGAGAT GAGCCTGGCC GGTATCCGCG AGATGTCGAC GATCCTCACG
CCGCCCGAGG AGCGCTACCC GGTGCTGACC TACGTCGGGC CGCAGGACGA CAAACAGGTC
GCCGCGGCGC TGCGGCGCGA ACTGCTGCGC GACGGGCAGG CGTTCTACAT CCACAACCGG
GTGCGCACCA TCGATCAGGC GGCGTCGAAG ATCGCCGCGC TGGTGCCCGA GGCCCGGGTC
GTCGTCGCCC ACGGCCAGAT GCCCGAGGAA CTGCTGGAGC GCACGGTCGA GGGGTTCTGG
AACCGCGAGT ACGACATCCT CGTGTGCACG ACGATCGTCG AGACCGGCCT CGACATCTCG
AACGCCAACA CGCTGATCGT CGAGCGCGCC GACACCTTCG GCCTGTCGCA GCTGCACCAG
CTGCGGGGCC GGGTGGGCCG CAGCCGCGAA CGCGGATACG CCTACTTCCT GTATCCGCCC
GAGCAGCCGT TGACCGAGAC CGCGTACGAC CGGCTGGCCA CCATCGCGCA GAACAACGAG
CTCGGCGCGG GTATGGCCGT GGCCATGAAG GACCTCGAGA TCCGCGGGGC GGGCAACGTG
CTCGGCGCCG AACAGTCCGG TCACGTGGCC GGGGTCGGGT TCGACCTCTA CGTGCGCCTG
GTCGGTGAGG CCGTCGAGGC CTATCGAGCG GCCGCCGACG GGAAAACCGT TGCGACACCG
CAGGAAACGA AGGATGTTCG GATCGACCTG CCGGTCGATG CGAACCTGCC GCCGGACTAC
ATCGGCAGTG ACCGGCTGCG GCTCGAGGCC TACCGGCGGC TGGCCGCCGC CCAGGACGAC
GCCGGCGTCG ACGCGGTCAT CGACGAACTC GTCGACCGCT ACGGACCGCT GCCCGAACCG
GCGCAGCGCC TGGTCGGCGT CGCGCGGCTG CGGCTGGTGT GCCGCGAGTA CGGCATCACC
GACGTCAGCT CGGTGTCGGC GTCGACGGTC AAGCTGTCGC CGATGGAGCT GCCGGACTCG
GCGCTGTTGC GGCTCAAGCG GATGTATCCC GGGGCCACCT ACCGGGCGAC GACCGGGACG
GTGTCGGTGC CCATCCCGCG CGCCACCGAC AGCGTCGGCG CACCGCGCAT CCGGGACGCC
GAACTGGTCG CGATGGTGGC CGGTTTGGTT CTCGCGCTCA ATGGAAAACC GCAGGCGCAG
ATCGATACGG CGAAGTTCGG GGGGTCACGA GCATGA
 
Protein sequence
MTVSGLHHVQ TPIAGLIELA LRDPCLADLS ARAADKPDDL AMVGPASARL LVTAALAQAG 
PLLVVTATGR EADDLTAELR GVIGDSAALF PSWETLPHER LSPGVDTVGA RMMLLRRLAH
PDDARLGPPL RVVVTTARSL VQPMAPGLAE VEPVTLTVGA EMDFDGVIKR LVDLAYTRCD
MVAKRGEFAV RGGILDVFPP TAEHPVRVEF WGDEISEMRM FAVADQRSIP EIEVDTVIAV
PCRELLMSDE VRSRAAVLAA EHPTHENSVP GSVPDMLAKL AEGIPVDGME ALLPLLRPTD
LAMLSDHLPD GAPILVCDPE KVRTRAGDLI KTGREFLEAS WSTAAVGGAA PIDLEAMGAS
GFLGFEEVRT GARAGGHPWW TLSQLSDEKA IELDIRSAPS ARGQQSSVEE IFAMLRAHVA
TGGYGAVVTP GAGTAHRVVE QLGENDTPAT MLEPGEEPKA GVVGVLKGPL HDGVVLPGAN
LVIVTEADLT GSRVTATEGK RLAAKRRNVV DPLALTAGDL VVHDQHGIGR FVEMTERVVG
GARREYLVLE YASSKRGGGS DRLYVPMDSL DQLSRYVGGE APSLSKLGGS DWANTKTKAR
RAVREIASEL VALYAKRQAA PGHAFSPDTP WQNEMEDAFG FTETVDQLTA IEEVKADMEK
PVPMDRVICG DVGYGKTEIA VRAAFKAVQD GKQVAVLVPT TLLADQHLQT FTARMAGFPV
TVKGLSRFTD PAESRAALEG MKDGSVDIVI GTHRLLQTGV VWKDLGLIIV DEEQRFGVEH
KEHIKSMRTH VDVLTMSATP IPRTLEMSLA GIREMSTILT PPEERYPVLT YVGPQDDKQV
AAALRRELLR DGQAFYIHNR VRTIDQAASK IAALVPEARV VVAHGQMPEE LLERTVEGFW
NREYDILVCT TIVETGLDIS NANTLIVERA DTFGLSQLHQ LRGRVGRSRE RGYAYFLYPP
EQPLTETAYD RLATIAQNNE LGAGMAVAMK DLEIRGAGNV LGAEQSGHVA GVGFDLYVRL
VGEAVEAYRA AADGKTVATP QETKDVRIDL PVDANLPPDY IGSDRLRLEA YRRLAAAQDD
AGVDAVIDEL VDRYGPLPEP AQRLVGVARL RLVCREYGIT DVSSVSASTV KLSPMELPDS
ALLRLKRMYP GATYRATTGT VSVPIPRATD SVGAPRIRDA ELVAMVAGLV LALNGKPQAQ
IDTAKFGGSR A