Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_10511 |
Symbol | mfd |
ID | 4779323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 968831 |
End bp | 972334 |
Gene Length | 3504 bp |
Protein Length | 1167 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640084330 |
Product | transcriptional-repair coupling factor |
Protein accession | YP_001014874 |
Protein GI | 124025758 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1197] Transcription-repair coupling factor (superfamily II helicase) |
TIGRFAM ID | [TIGR00580] transcription-repair coupling factor (mfd) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00605187 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTCTTTAG AGTCAATAGC AAAGTATTTA GAAAAGCATC ATTTAACAAC TGAGTTGATT GAACGAACAA ATAGAGAAGA AAGATTAACA TTAACAGGAG CATCACGGAC AGCAAAGGCA TTAATAACAA CTTCACTTGC TAAAAATGAG TCCAAAAGAT TATTAGTAAT TGTTCCAACA TTAGAAGAAG CAACTAGATG GTATCCCCTT GTAAAAGACT GCGGTTGGAC TAAGACATGT TTATATCCAA CAAGTGAAGT CTCACCATAT GAAACTACTC AAGTTACTTC AGAAATCATT TGGGGTCAAT TACAAGTACT AAGCGATATA TTGGAATTAA AAGATGATGA GAATATCGCA ATAATTGCAA CAGAAAGGTC TTTACAACCA CATCTGCCCC CATTTGAATA CCTTAAAGAA AAGTGTATTA AATTAAACGT GGGTGATGAA ATAAATCTAA GTGATTTATC TTTAAAATTG AGTGAAAGTG GATATATCAA GTCTAATAAT ATAGATCAAG AAGGAACATG GACAAGACGC GGAGATATTA TTGATATTTA CCCTGTTAGT AGTGAACTTC CAATTAGATT AGAGTTATTT GGTGATCTAT TAGATAAGAT TAAAGAATTT GATCCAATTT CACAAAGGTC ATTAGATCAA ATCAACAATG TATGCATAAC ACCCACAGGT TTTGATCCAC TAATCATTAA TAAGCTTATA TCAACTGACA ACAAGGATAT ATCGAGTTTA TTTACTAATG ATGAGTTCTC TGAGTTAGTA AATTCAAATA AATTGGATTC AGCAAAGAAA TATTTAGGAG TTGCATTTGA TAAGCCTTCA TCATTATTAG ATTATTTAGA TGATAAGACA TTTATTGTTG TTGATGAGAG GCTTCAAGGT ATATCTCATG GAAAAGCTTG GTATAATATC GTTAATGAAA ATTATACAGA TGTAATTACT ACTATAAAAG GTAGTGAAGG AATAAAGACT ATATTTAAAC CTAATCTTCA TAAAGACATT AATGATATAT ATGATTCTCT AAATAATTAT AAAGGTATTG ATATAACAGA TTTAGAAGAC ACTACAAAGA AAACGAATGT TTTTAGTATT TCAAGCAAAG TTCATAATTG GCTACCTAAT CAATACGGTA AAATAAGTTT ATCTTTAAAA GATTACATTA AGGATAAATA TTCTATTTGG ATAATTTCAG CACAACCTAG TCGTGCAGTT TCTTTATTAG AAGAACATGA ATGTATCTCA AAGTTTATAC CTAACAATAC CGATCTTAAT GGTATCAAAA ATATTATCGA TGACAATATT CCAGTAGCTA TTAAAAATAA AAATGAGGGT GAAATTGAAG GATTTTATCT TCCTGCATGG AAAATTGCAC TATTAACGGA TAAGGAGTTT TTCGGACAAC AAAATATTTC TACGACTGGT TATATAAGAA GAAGAAAACA ATCTCAAAGT AAAAAGATAG ATCCTAATAA GATGAAACCA GGCGATTATG TTGTTCATAG AAATCATGGA ATTGGTTTAT TTCAGAAAAT TGAAAAATTA AATATTAATG GAGAGTCAAG AGATTATTTG GTAATAAAAT ATATGGATGG AAAGTTAAGT GTTGCCGCAG ATCAACTTGG AAGTTTAGGT AGGTATAGAA GTTCAAATGC AAAGACTCCT ACAATTAGTA AATTAGGAGG GGCTAATTGG AACAAAATAA AGGAAAAGGC AAAGAAATCC GTTAAAAAAG TTGCTATTGA TTTAATTAAG TTATATGCAG AAAGAAGTAA AGAAAAAGGG TATAAATTTC CATGTGATGG TCCCTGGCAA AGCGAATTAG AAGACTCATT TCCATACGCA CTTACACCTG ATCAAGCAAC AGCTACATCT CAAGTTAAAT CTGATATGGA AAGTGAAAAG CCTATGGATA GATTGGTTTG CGGCGATGTT GGATTTGGAA AAACAGAAGT TGCTATACGA GCAATATTTA AGGCTATTAC CTCAGGAAAA CAAATAGCTT TATTAGCACC AACGACTGTA TTATCTCAAC AACATTGGAG AACTATTTCT GATCGATTTG CTCCTTATCC TATAAAAGTT TCATTACTCA ACAGATTTAA AACAAATAGT GAAAAAAAAC ATATAGTTAG TGGCCTGAAA GCTGGACAAA TTGATGCAGT TGTTGGTACA CATCAGCTCT TGAACAAAAA ATTAGTTTAT AAGGACTTGG GACTTCTAGT TATAGATGAA GAACAACGTT TTGGAGTTAA TCAAAAAGAG AAAATAAAGG AGTTAAAAAA AAGTGTAGAT GTATTAACTC TTTCAGCGAC TCCAATTCCA AGAACACTCT ATATGAGTCT TTCTGGTGTC CGTGAAATGA GTTTAATAAC AACACCGCCT CCCCTACGAA GGCCGATTAA AACACACTTA GCACCCCTCG ATAATGAAAT AATAAGAAGT GCAATTTCGC AAGAGATTGA TAGAGGTGGC CAAATATTTT ATATTGTTCC TCGAATAAAA GGAATAGAAG ATGTAGCAGA GAAATTAAAA ATTATGATCC CAAATGTGAA ATTATTGATT GCACATGGTC AAATGGAGGA GGGAGCATTA GAGAATGCAA TGCTTGCATT TAATGCAGGA GAAGCCGATA TTTTGCTTTG TACAACTATT GTAGAAAGTG GATTAGATAT TCCTAGAGTA AATACTATTT TAATTGAAGA TTCTCACAAG TTTGGTTTAT CTCAACTTTA CCAATTGAGA GGCAGAGTAG GCCGAAGTGG AGTACAAGCA CATGCTTGGT TATTTTATCC AAGCGATGAG AAATTAAATG AGACCTCAAG GCAACGTTTA AAGGCTATAA AAGAATTTAG TGATTTGGGC AGTGGTTATC AGTTAGCCAT GAGGGACATG GAAATTAGAG GCGTTGGAAA TATCTTAGGT ATTGAACAAA GCGGACAAAT GGAAACAATA GGATTTGATT TGTATATGGA ATTATTGCAG GAAACTATTG CCGAAATACA GGGGCAAGAC ATTCCTAGTG TTGACGATAC TCAAATAGAT CTACCTGTTA CAGCTTTTAT ACCGGGAGAT TGGATAACTG ATCCAGATGA AAAAATAAAT GCATATAGAT TAGCCACACA ATGCGAAAAC AATGATTCAT TAGTTCAATT TGCTAGCAAC TTGGTTGATA GATATGGAAC ATTACCAAAA GCAGTTGAAT CATTAATAGA AGTAATGAAA TTAAAAATAA TCGCTAAAAA GTGTGGCTTC TCAAGAATCA AGTTATCCAA ACCAAATGTT GAGCTTGAGA CCATGATGGA TGAGCCAGCA TTCAAGTTAC TAAGAAAAGG TTTGGCTAAT CATCTTCATG GAAGATTTAT TTACAAGAAA GGGGATAGGT GTTCAACGGT GACTATTCGA GGACTCGGAA TCTTGGATAG CGATAAACTT CTAGATCAAT TAACAGAATG GCTAAAACTT ATGAATTCAG AAATAAACGC TTAA
|
Protein sequence | MSLESIAKYL EKHHLTTELI ERTNREERLT LTGASRTAKA LITTSLAKNE SKRLLVIVPT LEEATRWYPL VKDCGWTKTC LYPTSEVSPY ETTQVTSEII WGQLQVLSDI LELKDDENIA IIATERSLQP HLPPFEYLKE KCIKLNVGDE INLSDLSLKL SESGYIKSNN IDQEGTWTRR GDIIDIYPVS SELPIRLELF GDLLDKIKEF DPISQRSLDQ INNVCITPTG FDPLIINKLI STDNKDISSL FTNDEFSELV NSNKLDSAKK YLGVAFDKPS SLLDYLDDKT FIVVDERLQG ISHGKAWYNI VNENYTDVIT TIKGSEGIKT IFKPNLHKDI NDIYDSLNNY KGIDITDLED TTKKTNVFSI SSKVHNWLPN QYGKISLSLK DYIKDKYSIW IISAQPSRAV SLLEEHECIS KFIPNNTDLN GIKNIIDDNI PVAIKNKNEG EIEGFYLPAW KIALLTDKEF FGQQNISTTG YIRRRKQSQS KKIDPNKMKP GDYVVHRNHG IGLFQKIEKL NINGESRDYL VIKYMDGKLS VAADQLGSLG RYRSSNAKTP TISKLGGANW NKIKEKAKKS VKKVAIDLIK LYAERSKEKG YKFPCDGPWQ SELEDSFPYA LTPDQATATS QVKSDMESEK PMDRLVCGDV GFGKTEVAIR AIFKAITSGK QIALLAPTTV LSQQHWRTIS DRFAPYPIKV SLLNRFKTNS EKKHIVSGLK AGQIDAVVGT HQLLNKKLVY KDLGLLVIDE EQRFGVNQKE KIKELKKSVD VLTLSATPIP RTLYMSLSGV REMSLITTPP PLRRPIKTHL APLDNEIIRS AISQEIDRGG QIFYIVPRIK GIEDVAEKLK IMIPNVKLLI AHGQMEEGAL ENAMLAFNAG EADILLCTTI VESGLDIPRV NTILIEDSHK FGLSQLYQLR GRVGRSGVQA HAWLFYPSDE KLNETSRQRL KAIKEFSDLG SGYQLAMRDM EIRGVGNILG IEQSGQMETI GFDLYMELLQ ETIAEIQGQD IPSVDDTQID LPVTAFIPGD WITDPDEKIN AYRLATQCEN NDSLVQFASN LVDRYGTLPK AVESLIEVMK LKIIAKKCGF SRIKLSKPNV ELETMMDEPA FKLLRKGLAN HLHGRFIYKK GDRCSTVTIR GLGILDSDKL LDQLTEWLKL MNSEINA
|
| |