Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2487 |
Symbol | |
ID | 6065208 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 2735869 |
End bp | 2739363 |
Gene Length | 3495 bp |
Protein Length | 1164 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641601893 |
Product | transcription-repair coupling factor |
Protein accession | YP_001725445 |
Protein GI | 170020491 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1197] Transcription-repair coupling factor (superfamily II helicase) |
TIGRFAM ID | [TIGR00580] transcription-repair coupling factor (mfd) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0203456 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00300257 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCCCAT ATGTTGAGGC ATATCCTAAC GAGAATCTGA CAACCGTTAT GCCTGAACAA TATCGTTATA CGCTGCCCGT CAAAGCGGGT GAGCAGCGTC TGCTGGGCGA GTTAACCGGC GCAGCCTGTG CAACGCTGGT AGCGGAAATT GCCGAACGTC ACGCCGGTCC GGTGGTACTC ATTGCACCAG ATATGCAAAA TGCTCTGCGT TTGCATGATG AAATCAGCCA GTTCACCGAT CAGATGGTGA TGAATCTGGC GGACTGGGAA ACTCTTCCCT ACGACAGTTT TTCGCCTCAT CAGGACATTA TCTCCTCGCG CCTTTCCACC CTTTACCAGC TACCGACGAT GCAGCGTGGC GTACTGATTG TTCCGGTGAA TACGCTTATG CAGCGCGTTT GCCCACACAG TTTTCTCCAC GGTCATGCGC TGGTGATGAA AAAAGGTCAG CGCCTGTCAC GAGATGCATT ACGAACCCAA CTGGACAGCG CCGGTTATCG CCATGTTGAC CAGGTGATGG AGCACGGCGA ATACGCCACG CGCGGCGCGT TGCTGGATCT CTTCCCGATG GGGAGTGAGC TGCCTTATCG TCTTGATTTC TTTGATGATG AAATCGACAG CCTGCGGGTG TTTGACGTCG ACAGCCAGCG CACGCTGGAG GAAGTAGAAG CGATCAATCT GCTGCCCGCG CACGAATTTC CGACCGATAA AGCGGCAATT GAACTGTTCC GCAGCCAGTG GCGCGATACC TTCGAAGTGA AGCGCGATCC GGAACATATT TACCAGCAAG TGAGTAAAGG CACATTACCT GCCGGGATCG AGTACTGGCA GCCATTGTTC TTCAGCGAAC CACTGCCGCC GCTGTTCAGT TATTTCCCTG CCAATACCTT GCTGGTGAAT ACTGGCGATC TGGAAACCAG TGCCGAACGT TTCCAGGCTG ACACGCTGGC GCGTTTTGAG AATCGCGGCG TCGATCCGAT GCGCCCGCTG TTGCCACCAC AATCGCTCTG GCTGCGGGTG GACGAGCTCT TCTCAGAGCT GAAAAACTGG CCCCGGGTGC AGCTAAAAAC TGAACATTTA CCGACAAAAG CCGCGAATGC CAATTTAGGT TTCCAGAAAC TGCCAGACCT GGCCGTTCAG GCGCAACAAA AAGCGCCGCT GGATGCGCTG CGTAAGTTCC TCGAGACTTT CGACGGTCCG GTGGTGTTCT CGGTAGAAAG TGAAGGTCGC CGTGAAGCGC TGGGTGAACT GCTCGCACGA ATTAAAATTG CTCCGCAACG CATTATGCGT CTTGATGAAG CCAGCGACCG TGGGCGTTAT CTGATGATTG GCGCTGCCGA ACATGGTTTT GTCGATACGG TGCGTAATCT GGCGCTGATC TGCGAAAGCG ATCTGCTCGG TGAACGTGTT GCCCGCCGTC GTCAGGATTC TCGCCGCACC ATCAACCCCG ATACACTGAT CCGTAACCTT GCGGAGCTGC ATATTGGTCA GCCGGTGGTC CATCTGGAGC ACGGTGTCGG GCGCTATGCC GGAATGACCA CGCTCGAAGC GGGCGGCATT ACTGGCGAGT ATTTGATGCT CACCTATGCC AACGACGCCA AACTGTATGT TCCGGTGTCG TCACTGCATC TGATTAGCCG TTACGCAGGT GGCGCGGAAG AAAACGCCCC GCTGCATAAA CTTGGCGGCG ATGCGTGGTC ACGCGCGCGG CAGAAAGCGG CGGAAAAAGT GCGTGATGTG GCGGCGGAAT TGCTGGATAT CTACGCGCAA CGCGCCGCCA AAGAGGGCTT CGCGTTTAAA CACGATCGTG AGCAGTATCA GTTGTTCTGC GACAGCTTCC CGTTTGAAAC CACGCCGGAT CAGGCGCAGG CCATTAATGC GGTACTTAGC GACATGTGTC AGCCGCTGGC AATGGATCGT CTGGTGTGCG GCGATGTTGG CTTTGGTAAA ACAGAAGTGG CGATGCGCGC CGCTTTCCTG GCAGTAGATA ACCACAAGCA GGTAGCGGTG TTGGTGCCTA CCACCCTTCT TGCGCAGCAG CATTACGACA ACTTCCGCGA CCGTTTCGCC AACTGGCCGG TACGTATCGA AATGATCTCC CGTTTCCGCA GCGCCAAAGA GCAGACGCAA ATCCTTGCGG AAGTGGCGGA AGGGAAAATC GATATTCTGA TCGGTACGCA CAAACTGCTG CAAAGTGACG TCAAGTTTAA AGATTTAGGC CTGCTGATTG TCGATGAAGA ACACCGCTTC GGGGTGCGTC ATAAAGAGCG CATTAAAGCG ATGCGCGCGA ACGTGGATAT TCTGACGCTT ACTGCAACGC CGATCCCACG CACGCTGAAT ATGGCAATGA GCGGAATGCG TGACCTGTCG ATTATCGCCA CGCCGCCCGC CCGTCGTCTG GCAGTTAAAA CCTTTGTCCG TGAGTATGAC AGCCTGGTGG TCCGGGAGGC GATCCTGCGT GAAATTTTGC GCGGGGGGCA GGTTTATTAT CTCTACAATG ATGTGGAAAA CATCCAGAAA GCTGCCGAAC GGCTGGCAGA ACTGGTGCCT GAAGCACGGA TTGCCATCGG TCACGGGCAG ATGCGCGAGC GCGAACTGGA ACGGGTGATG AATGATTTCC ATCATCAACG TTTCAACGTG CTGGTTTGTA CCACCATTAT CGAAACCGGG ATCGACATCC CGACAGCCAA CACCATTATC ATTGAACGTG CGGATCACTT CGGTCTGGCG CAGCTGCACC AGTTACGCGG TCGCGTCGGA CGTTCACATC ATCAGGCATA TGCATGGCTG CTGACGCCGC ATCCAAAAGC GATGACTACC GATGCACAAA AACGTCTTGA AGCGATTGCC TCGCTGGAAG ATCTCGGTGC AGGCTTTGCG CTGGCAACGC ACGATCTGGA GATCCGCGGC GCGGGTGAAC TGCTTGGCGA AGAACAAAGT GGCTCAATGG AAACCATCGG TTTCTCGCTG TATATGGAGT TGCTGGAAAA CGCCGTCGAT GCACTGAAAG CCGGACGCGA GCCGTCGCTG GAAGATCTCA CCAGCCAGCA AACAGAAGTC GAGCTGCGGA TGCCGTCGCT ATTGCCAGAT GATTTCATTC CTGACGTGAA TACGCGTTTG TCGTTCTATA AACGTATTGC CAGCGCCAAA ACGGAAAACG AACTGGAAGA GATCAAAGTC GAGCTTATCG ATCGCTTCGG CCTGCTGCCG GATCCGGCGC GTACCCTGCT GGATGTTGCC CGTCTGCGCC AGCAAGCGCA GAAACTGGGG ATCAGGAAGC TGGAAGGTAA TGAGAAAGGC GGGGTGATCG AATTTGCCGA GAAGAATCAC GTTAATCCGG CCTGGTTGAT TGGTTTGCTG CAAAAACAGC CGCAGCATTA CCGCCTTGAT GGTCCGACGC GCCTGAAATT TATTCAGGAT TTGAGTGAGC GGAAAACGCG TATCGAATGG GTACGCCAGT TTATGCGTGA ACTGGAAGAG AACGCGATCG CTTAA
|
Protein sequence | MPPYVEAYPN ENLTTVMPEQ YRYTLPVKAG EQRLLGELTG AACATLVAEI AERHAGPVVL IAPDMQNALR LHDEISQFTD QMVMNLADWE TLPYDSFSPH QDIISSRLST LYQLPTMQRG VLIVPVNTLM QRVCPHSFLH GHALVMKKGQ RLSRDALRTQ LDSAGYRHVD QVMEHGEYAT RGALLDLFPM GSELPYRLDF FDDEIDSLRV FDVDSQRTLE EVEAINLLPA HEFPTDKAAI ELFRSQWRDT FEVKRDPEHI YQQVSKGTLP AGIEYWQPLF FSEPLPPLFS YFPANTLLVN TGDLETSAER FQADTLARFE NRGVDPMRPL LPPQSLWLRV DELFSELKNW PRVQLKTEHL PTKAANANLG FQKLPDLAVQ AQQKAPLDAL RKFLETFDGP VVFSVESEGR REALGELLAR IKIAPQRIMR LDEASDRGRY LMIGAAEHGF VDTVRNLALI CESDLLGERV ARRRQDSRRT INPDTLIRNL AELHIGQPVV HLEHGVGRYA GMTTLEAGGI TGEYLMLTYA NDAKLYVPVS SLHLISRYAG GAEENAPLHK LGGDAWSRAR QKAAEKVRDV AAELLDIYAQ RAAKEGFAFK HDREQYQLFC DSFPFETTPD QAQAINAVLS DMCQPLAMDR LVCGDVGFGK TEVAMRAAFL AVDNHKQVAV LVPTTLLAQQ HYDNFRDRFA NWPVRIEMIS RFRSAKEQTQ ILAEVAEGKI DILIGTHKLL QSDVKFKDLG LLIVDEEHRF GVRHKERIKA MRANVDILTL TATPIPRTLN MAMSGMRDLS IIATPPARRL AVKTFVREYD SLVVREAILR EILRGGQVYY LYNDVENIQK AAERLAELVP EARIAIGHGQ MRERELERVM NDFHHQRFNV LVCTTIIETG IDIPTANTII IERADHFGLA QLHQLRGRVG RSHHQAYAWL LTPHPKAMTT DAQKRLEAIA SLEDLGAGFA LATHDLEIRG AGELLGEEQS GSMETIGFSL YMELLENAVD ALKAGREPSL EDLTSQQTEV ELRMPSLLPD DFIPDVNTRL SFYKRIASAK TENELEEIKV ELIDRFGLLP DPARTLLDVA RLRQQAQKLG IRKLEGNEKG GVIEFAEKNH VNPAWLIGLL QKQPQHYRLD GPTRLKFIQD LSERKTRIEW VRQFMRELEE NAIA
|
| |