Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_4331 |
Symbol | |
ID | 4612272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | - |
Start bp | 4546973 |
End bp | 4550608 |
Gene Length | 3636 bp |
Protein Length | 1211 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639794015 |
Product | transcription-repair coupling factor |
Protein accession | YP_940312 |
Protein GI | 119870360 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1197] Transcription-repair coupling factor (superfamily II helicase) |
TIGRFAM ID | [TIGR00580] transcription-repair coupling factor (mfd) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.376638 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTAT CGGGGCTTCA TCATGTCCAG ACCCCGATTG CGGGGCTCAT CGAGTTGGCG TTGCGCGATC CGTGTCTGGC CGATCTGTCC GCACGCGCCG CCGACAAACC CGACGATCTC GCGATGGTGG GTCCGGCCAG TGCCCGGCTC CTCGTCACCG CCGCGCTCGC GCAGGCCGGC CCGCTGCTCG TCGTCACCGC CACCGGCCGG GAGGCCGACG ATCTGACCGC GGAGTTGCGC GGGGTGATCG GCGACTCCGC GGCGCTGTTC CCGTCCTGGG AGACGCTTCC GCACGAGCGG TTGTCACCCG GTGTGGACAC CGTCGGCGCG CGGATGATGC TGTTGCGCCG GCTCGCGCAT CCCGATGACG CCCGCCTCGG CCCGCCGCTG CGGGTGGTGG TCACCACGGC CCGCTCACTG GTGCAGCCGA TGGCGCCGGG TCTGGCAGAG GTCGAGCCGG TGACGCTGAC CGTGGGCGCC GAGATGGATT TCGACGGCGT CATCAAGCGG CTGGTCGATC TGGCCTACAC GCGCTGCGAC ATGGTGGCCA AGCGTGGCGA GTTCGCGGTC CGCGGCGGCA TCCTCGACGT GTTCCCGCCG ACCGCCGAAC ATCCGGTGCG CGTCGAGTTC TGGGGTGACG AGATCTCCGA GATGCGGATG TTCGCCGTCG CCGATCAGCG GTCCATCCCC GAGATCGAGG TCGACACCGT CATCGCGGTG CCGTGCCGCG AGCTGCTGAT GTCCGACGAG GTGAGAAGTC GTGCTGCCGT GCTCGCTGCG GAGCACCCGA CCCACGAGAA CTCCGTGCCG GGCAGCGTGC CCGACATGCT GGCCAAACTC GCCGAGGGAA TCCCGGTCGA CGGGATGGAG GCGCTGCTGC CGCTGTTGCG GCCGACCGAT TTGGCGATGC TGTCCGACCA CCTGCCGGAC GGCGCGCCGA TCCTGGTGTG CGATCCGGAG AAGGTGCGCA CCCGCGCCGG GGATCTGATC AAGACCGGTC GGGAGTTCCT CGAGGCGTCG TGGTCGACGG CCGCCGTCGG TGGTGCCGCG CCGATCGACC TGGAGGCGAT GGGCGCGTCG GGCTTCCTCG GTTTCGAGGA GGTGCGCACC GGCGCGCGGG CCGGCGGCCA CCCGTGGTGG ACGCTGAGCC AGCTGTCGGA CGAGAAGGCG ATCGAACTCG ACATCCGGTC GGCGCCGTCG GCGCGCGGGC AGCAGTCGTC GGTCGAGGAG ATCTTCGCGA TGCTCCGCGC GCACGTGGCG ACCGGCGGCT ACGGCGCGGT CGTCACCCCC GGTGCGGGCA CCGCGCACCG CGTCGTCGAG CAGCTCGGCG AAAACGACAC GCCCGCAACG ATGCTCGAGC CCGGCGAGGA GCCGAAGGCC GGTGTCGTCG GGGTGCTGAA GGGGCCGCTG CACGACGGGG TGGTGCTGCC GGGCGCGAAC CTGGTGATCG TCACCGAGGC CGATCTGACC GGCAGCCGCG TCACCGCGAC CGAGGGCAAA CGCCTTGCCG CCAAACGGCG CAACGTCGTC GACCCGCTGG CCCTGACGGC GGGGGACCTG GTGGTCCACG ATCAGCACGG CATCGGCCGG TTCGTCGAGA TGACCGAACG CGTCGTCGGC GGTGCGCGCC GCGAGTACCT GGTGCTGGAG TACGCGTCGA GTAAACGCGG GGGCGGGTCC GACCGGCTCT ACGTGCCGAT GGACTCCCTC GACCAGCTGT CGCGCTACGT CGGCGGTGAG GCGCCGTCGC TGTCCAAACT CGGCGGCAGC GACTGGGCCA ACACCAAGAC CAAGGCGCGC CGGGCCGTGC GGGAGATCGC CAGCGAGCTG GTGGCGCTCT ACGCCAAACG GCAGGCCGCG CCCGGCCACG CCTTCAGCCC GGACACCCCG TGGCAGAACG AGATGGAGGA CGCGTTCGGG TTCACCGAGA CCGTCGACCA GCTCACCGCC ATCGAGGAGG TCAAGGCCGA TATGGAGAAG CCGGTCCCGA TGGACCGGGT GATCTGCGGC GACGTCGGTT ACGGCAAGAC CGAGATCGCG GTGCGCGCGG CGTTCAAGGC GGTGCAGGAC GGTAAGCAGG TGGCGGTGCT CGTGCCGACC ACGCTGCTGG CCGACCAGCA CCTGCAGACG TTCACCGCGC GGATGGCCGG CTTCCCTGTG ACCGTGAAAG GCCTGTCGCG CTTCACCGAC CCCGCCGAGT CCCGCGCCGC GCTCGAGGGC ATGAAGGACG GCTCGGTCGA CATCGTCATC GGCACCCACC GCCTGCTGCA GACCGGGGTG GTGTGGAAGG ACCTCGGGCT GATCATCGTC GACGAGGAAC AGCGGTTCGG CGTCGAGCAC AAGGAGCACA TCAAATCGAT GCGCACCCAC GTCGACGTGC TGACCATGAG CGCCACGCCG ATCCCGCGCA CGCTGGAGAT GAGCCTGGCC GGTATCCGCG AGATGTCGAC GATCCTCACG CCGCCCGAGG AGCGCTACCC GGTGCTGACC TACGTCGGGC CGCAGGACGA CAAACAGGTC GCCGCGGCGC TGCGGCGCGA ACTGCTGCGC GACGGGCAGG CGTTCTACAT CCACAACCGG GTGCGCACCA TCGATCAGGC GGCGTCGAAG ATCGCCGCGC TGGTGCCCGA GGCCCGGGTC GTCGTCGCCC ACGGCCAGAT GCCCGAGGAA CTGCTGGAGC GCACGGTCGA GGGGTTCTGG AACCGCGAGT ACGACATCCT CGTGTGCACG ACGATCGTCG AGACCGGCCT CGACATCTCG AACGCCAACA CGCTGATCGT CGAGCGCGCC GACACCTTCG GCCTGTCGCA GCTGCACCAG CTGCGGGGCC GGGTGGGCCG CAGCCGCGAA CGCGGATACG CCTACTTCCT GTATCCGCCC GAGCAGCCGT TGACCGAGAC CGCGTACGAC CGGCTGGCCA CCATCGCGCA GAACAACGAG CTCGGCGCGG GTATGGCCGT GGCCATGAAG GACCTCGAGA TCCGCGGGGC GGGCAACGTG CTCGGCGCCG AACAGTCCGG TCACGTGGCC GGGGTCGGGT TCGACCTCTA CGTGCGCCTG GTCGGTGAGG CCGTCGAGGC CTATCGAGCG GCCGCCGACG GGAAAACCGT TGCGACACCG CAGGAAACGA AGGATGTTCG GATCGACCTG CCGGTCGATG CGAACCTGCC GCCGGACTAC ATCGGCAGTG ACCGGCTGCG GCTCGAGGCC TACCGGCGGC TGGCCGCCGC CCAGGACGAC GCCGGCGTCG ACGCGGTCAT CGACGAACTC GTCGACCGCT ACGGACCGCT GCCCGAACCG GCGCAGCGCC TGGTCGGCGT CGCGCGGCTG CGGCTGGTGT GCCGCGAGTA CGGCATCACC GACGTCAGCT CGGTGTCGGC GTCGACGGTC AAGCTGTCGC CGATGGAGCT GCCGGACTCG GCGCTGTTGC GGCTCAAGCG GATGTATCCC GGGGCCACCT ACCGGGCGAC GACCGGGACG GTGTCGGTGC CCATCCCGCG CGCCACCGAC AGCGTCGGCG CACCGCGCAT CCGGGACGCC GAACTGGTCG CGATGGTGGC CGGTTTGGTT CTCGCGCTCA ATGGAAAACC GCAGGCGCAG ATCGATACGG CGAAGTTCGG GGGGTCACGA GCATGA
|
Protein sequence | MTVSGLHHVQ TPIAGLIELA LRDPCLADLS ARAADKPDDL AMVGPASARL LVTAALAQAG PLLVVTATGR EADDLTAELR GVIGDSAALF PSWETLPHER LSPGVDTVGA RMMLLRRLAH PDDARLGPPL RVVVTTARSL VQPMAPGLAE VEPVTLTVGA EMDFDGVIKR LVDLAYTRCD MVAKRGEFAV RGGILDVFPP TAEHPVRVEF WGDEISEMRM FAVADQRSIP EIEVDTVIAV PCRELLMSDE VRSRAAVLAA EHPTHENSVP GSVPDMLAKL AEGIPVDGME ALLPLLRPTD LAMLSDHLPD GAPILVCDPE KVRTRAGDLI KTGREFLEAS WSTAAVGGAA PIDLEAMGAS GFLGFEEVRT GARAGGHPWW TLSQLSDEKA IELDIRSAPS ARGQQSSVEE IFAMLRAHVA TGGYGAVVTP GAGTAHRVVE QLGENDTPAT MLEPGEEPKA GVVGVLKGPL HDGVVLPGAN LVIVTEADLT GSRVTATEGK RLAAKRRNVV DPLALTAGDL VVHDQHGIGR FVEMTERVVG GARREYLVLE YASSKRGGGS DRLYVPMDSL DQLSRYVGGE APSLSKLGGS DWANTKTKAR RAVREIASEL VALYAKRQAA PGHAFSPDTP WQNEMEDAFG FTETVDQLTA IEEVKADMEK PVPMDRVICG DVGYGKTEIA VRAAFKAVQD GKQVAVLVPT TLLADQHLQT FTARMAGFPV TVKGLSRFTD PAESRAALEG MKDGSVDIVI GTHRLLQTGV VWKDLGLIIV DEEQRFGVEH KEHIKSMRTH VDVLTMSATP IPRTLEMSLA GIREMSTILT PPEERYPVLT YVGPQDDKQV AAALRRELLR DGQAFYIHNR VRTIDQAASK IAALVPEARV VVAHGQMPEE LLERTVEGFW NREYDILVCT TIVETGLDIS NANTLIVERA DTFGLSQLHQ LRGRVGRSRE RGYAYFLYPP EQPLTETAYD RLATIAQNNE LGAGMAVAMK DLEIRGAGNV LGAEQSGHVA GVGFDLYVRL VGEAVEAYRA AADGKTVATP QETKDVRIDL PVDANLPPDY IGSDRLRLEA YRRLAAAQDD AGVDAVIDEL VDRYGPLPEP AQRLVGVARL RLVCREYGIT DVSSVSASTV KLSPMELPDS ALLRLKRMYP GATYRATTGT VSVPIPRATD SVGAPRIRDA ELVAMVAGLV LALNGKPQAQ IDTAKFGGSR A
|
| |