Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1561 |
Symbol | mfd |
ID | 5712705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1622403 |
End bp | 1625885 |
Gene Length | 3483 bp |
Protein Length | 1160 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641267476 |
Product | transcription repair coupling factor |
Protein accession | YP_001532904 |
Protein GI | 159044110 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1197] Transcription-repair coupling factor (superfamily II helicase) |
TIGRFAM ID | [TIGR00580] transcription-repair coupling factor (mfd) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.224407 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.305942 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTCCC AGACCGTTCC AACACATCTG CGCGTCACCG GCGTACCCGA AGGCTTCGAC GCCCAGATCG TGCTGCGCGA GGCCTCCCGC CCGGACGGGC AGGCCGTGTT CGTCGCCCGC GACGACAAGC GCATGGCCGC CATGGCCGCC GCCCTTGCGG TGACCGCGCC GCAGATCCCC GTGCTGCGCT TTCCCGGCTG GGACTGCCTG CCCTATGACC GCTCCTCCCC GAACCCGGAG ATCTCGGCCA CGCGCATGGC GACCCTCGCG GCGCTGGCCC ACGGGGTGCC GGGGCCCTTC GTGCTGCTCA CCACCCTGTC GGCGGTCACC CAGCGTGTGC CCGCCCGCGC CACCCTGGCC GAAGCCAGTT TTTCCGCCCA GGTCGGCGGG CGCATCGACG AGGCCGCGTT GCGGCAGTTC CTGACCCGCA TGGGATTTGT CCAGGCCCCC ACCGTGACCG AGCCCGGCGA CTACGCGATC CGCGGCGGCA TCATCGACAT CTTTCCGCCG GGCCAGTCCG GGCCCGTGCG GCTCGATCTG TTCGGCGACG TGCTGGACGG CGCGCGCCGT TTCGATGCCG CCACCCAGCG CACGACCGAG AAACTGGACG CGATCGAGCT GGCCCCGGTG TCCGAGATCA TCCTCGACCC CGCCGCCATC ACCCGGTTCC GCCAATCCTA CCGGATCGAA TTCGGTGCCG CGGGCACCGA CGATCCGCTC TACGAGGCGG TCTCGGCGGG CCGCAAACAT GCAGGGATGG AGCATTGGTT GCCCTTCTTT CACGACCGGC TGGAGACATT GCTGGACTAC GTGCCCGAGG CCAGCCTGAT CCTGGACGAC CAGTTCGAAG CGATGCATCT CAGCCGCTGG GAAGGGATCA AGGACCAGTA CGAAACGCGC CGCCATGCGC TTGCCCAGAA GGGGCAGATG GGCACGGTCT ACAAACCTGC CCCGCCCGAA ACCCTCTATA TCCCGCCGGC CGACGAGACA GCGCTACTGG CCACCAAGCG CACCCTGCAG CTGTCGGTGC TGCCCTCGGC TTCGGGCCCC GGGGTGACCG ACGCGGGCGG GCGGATCGGA CGCAACTTCG CGCCCGAGCG GCAGTCCCAG GCAACCGGGC TTTTCGAGGC GCTCGCCACC CATATCACCG AAAAACGCAA GACCTCCCAG GTGGTGATCG CCAGCTGGTC CGAGGGCGCG CGCGAACGGC TTCGCGGGCT GCTGGAGGAC CAGGACCTGT CCGGGCTGAC CGAGATCGCG CGCCTGTCGG ACATCCCGGA GGGCACCGGC GGCGTCCACC TTCTGGTCTG GGCGCTGGAC GAAGGTTTCG AGGGGCCCGA CCACCGGAGC ACACGCCTGA CGGTGATCTC CGAACAGGAC GTGCTCGGCG ACCGGCTGAT CCGCACCACC AAGCGCAAGC GCCGGGCCGA GAATTTTTTG CAGGAGGCCA CGAGCCTCAG CGCCGGCGAC CTGGTGGTCC ATGTCGATCA CGGGGTCGGG GCGTTCAAGG GGTTGGAGAC GGTCACCGCC ATGGGTGCGC CCCATGAATG CCTGCTGCTG GAATACGCGG GCGGCGACCG GCTCTACCTG CCGGTGGAAA ACATCGAGCT TCTGAGCCGC TTCGGCCAGG AAATCGGCAT GCTCGACAAG CTGGGCGGCG GCGCCTGGCA GGCCAAGAAG GCCAAGCTCA AGGAGCGCAT CCGCGAGATG GCCGACAAGC TCATCCGCAT CGCCGCCGAA CGCGCGCTCC GCCGCGCCCC CATGCTGGAG CCGCCCCCGG ACATGTGGGA GGCATTCTCC GCGCGCTTCC CCTACACCGA AACCGACGAC CAGCTCTCCG CGATCGAGGA CGTGGTCCAT GACCTCGCCG CGGGCACGCC GATGGACCGG CTGATCTGCG GCGATGTGGG GTTCGGCAAG ACCGAGGTCG CCATGCGCGC GGCCTTCATC GCGGCCCTCT CGGGCGTGCA GGTCGCGGTG ATCGCACCCA CGACGCTGCT GGCGCGCCAA CACTACAAGA GCTTCGCCGA CCGCTTCCGC GGCTTCCCGC TGGAGGTCCG GCCGCTCTCG CGCTTCGTGC CCGCGAAAGC CGCGGCCGAC ACGCGCAAGG GCCTCGCCGC CGGGTCCGTG GACATCGTCG TGGGCACCCA TGCGCTCCTG GCCAAGGGCG TCCGGTTTCA CAATCTCGGC CTGCTGATCA TCGACGAGGA ACAGCGCTTC GGCGTCGGCC ATAAGGAACG CCTGAAAGAG CTGCGCTCGG ACGTCCATGT CCTGACCCTC ACCGCGACCC CGATCCCGCG CACCCTGCAA CTCAGCCTCT CGGGGGTGCG GGACCTGTCG ATCATCGGCA CGCCCCCCGT CGACCGCCTG TCGATCCGCA CCTACGTGTC CGAATTCGAC CCCGTCACAT TGCGCGAAGC CCTCCTGCGC GAACACTACC GCGGCGGACA AAGCTTCTTC GTCGTCCCAC GCATCAAGGA CATCCCCGAG ATCGAGGCGT TCCTGCGCGA CCAGGTGCCC GAGGTCAGCT TCGTCGTCGC CCATGGCCAG ATGGCGGCGG GTGAGCTCGA CGACCGGATG AACGCCTTTT ACGATGGCAA ATACGACGTC CTGCTGGCCA CGACCATCGT CGAGTCGGGC CTCGACATCC CGACCGCCAA CACGATGATC ATCCACCGCG CCGACATGTT CGGGCTCAGC CAGCTTTACC AGATCCGGGG CAGGGTGGGG CGCGCCAAGA CCCGCGCCTA TGCGTACCTG ACCACCAAGC CCCGCATGAA ACTCACACCC GCGGCCGAAA AACGCCTCCG CGTGCTGGGC AGCCTCGATA GCCTCGGCGC GGGCTTCACC TTGGCCTCCC AAGACCTCGA TATCCGCGGC GCGGGCAACC TTCTGGGCGA GGCGCAGTCG GGCCAGTTCC GCGAGGTCGG CTTCGAGCTC TACCAATCCA TGCTCGAAGA GGCGATCGGC AAGATCAAAT CCGGCAGCCT CGAAGGGCTC ACCGACGATG ACGGCCAATG GGCGCCGCAG ATCAACCTCG GCGTGCCCGT GCTGATCCCC GAGGCCTACG TGCCCGACCT CGACGTCCGG CTCGGCCTCT ACCGCCGCCT GTCGCAGTTG ACCACCAAGG TGGAGCTCGA AGGCTTCGCC GCCGAGCTGA TCGACCGCTT CGGCAAGCTG CCCAAGGAGG TCAACACGCT CCTGCTGATC GTGCGGATCA AGGCGATGTG CAAGAAGGCC GGGATCGCCA AGCTCGATGG CGGCCCCAAG GGCGCGACCG TGCAGTTCCA CAACGACAAG TTCGCCAATC CCGCGGGGCT CGTGAAATTC ATCAACGATC AGAAGGGGCT GGCCAAGGTA CGCGACAACA AGATCGTCGT CCGCCGCGAC TGGGCCAAGG AAAGCGACCG GATCAAGGGT GCGTTTTCCA TCGCTCGCGA CCTGGCGGTC GAGGCGAAAG CCGCCAAGGC CCAGGCAGGC TGA
|
Protein sequence | MTSQTVPTHL RVTGVPEGFD AQIVLREASR PDGQAVFVAR DDKRMAAMAA ALAVTAPQIP VLRFPGWDCL PYDRSSPNPE ISATRMATLA ALAHGVPGPF VLLTTLSAVT QRVPARATLA EASFSAQVGG RIDEAALRQF LTRMGFVQAP TVTEPGDYAI RGGIIDIFPP GQSGPVRLDL FGDVLDGARR FDAATQRTTE KLDAIELAPV SEIILDPAAI TRFRQSYRIE FGAAGTDDPL YEAVSAGRKH AGMEHWLPFF HDRLETLLDY VPEASLILDD QFEAMHLSRW EGIKDQYETR RHALAQKGQM GTVYKPAPPE TLYIPPADET ALLATKRTLQ LSVLPSASGP GVTDAGGRIG RNFAPERQSQ ATGLFEALAT HITEKRKTSQ VVIASWSEGA RERLRGLLED QDLSGLTEIA RLSDIPEGTG GVHLLVWALD EGFEGPDHRS TRLTVISEQD VLGDRLIRTT KRKRRAENFL QEATSLSAGD LVVHVDHGVG AFKGLETVTA MGAPHECLLL EYAGGDRLYL PVENIELLSR FGQEIGMLDK LGGGAWQAKK AKLKERIREM ADKLIRIAAE RALRRAPMLE PPPDMWEAFS ARFPYTETDD QLSAIEDVVH DLAAGTPMDR LICGDVGFGK TEVAMRAAFI AALSGVQVAV IAPTTLLARQ HYKSFADRFR GFPLEVRPLS RFVPAKAAAD TRKGLAAGSV DIVVGTHALL AKGVRFHNLG LLIIDEEQRF GVGHKERLKE LRSDVHVLTL TATPIPRTLQ LSLSGVRDLS IIGTPPVDRL SIRTYVSEFD PVTLREALLR EHYRGGQSFF VVPRIKDIPE IEAFLRDQVP EVSFVVAHGQ MAAGELDDRM NAFYDGKYDV LLATTIVESG LDIPTANTMI IHRADMFGLS QLYQIRGRVG RAKTRAYAYL TTKPRMKLTP AAEKRLRVLG SLDSLGAGFT LASQDLDIRG AGNLLGEAQS GQFREVGFEL YQSMLEEAIG KIKSGSLEGL TDDDGQWAPQ INLGVPVLIP EAYVPDLDVR LGLYRRLSQL TTKVELEGFA AELIDRFGKL PKEVNTLLLI VRIKAMCKKA GIAKLDGGPK GATVQFHNDK FANPAGLVKF INDQKGLAKV RDNKIVVRRD WAKESDRIKG AFSIARDLAV EAKAAKAQAG
|
| |