Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_09271 |
Symbol | mfd |
ID | 5730520 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 821147 |
End bp | 824656 |
Gene Length | 3510 bp |
Protein Length | 1169 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641285293 |
Product | transcriptional-repair coupling factor |
Protein accession | YP_001550812 |
Protein GI | 159903468 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1197] Transcription-repair coupling factor (superfamily II helicase) |
TIGRFAM ID | [TIGR00580] transcription-repair coupling factor (mfd) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.254503 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0000230767 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCATTAG AACCAATAAT TGACAAATTA AAAGAATCGT CTTTAACTTC TCAATTAATA GAACGGATAC AACGAAATAA CAGATTAATC TTAACGGGAG GTTCAAAAAC TGCTAAAACT ATTATATCAA CCACAATTTC AAAAGCTGAG AAACTACCAC TAATAGTCAT TGTTCCAACA TTAGAAGAAA CAACAAGATG GTATTCAATT CTAAATAATT TCTCTTGGGA TTCACTTTAT ATATATCCAA CATCAGAAAA TTCCCCATAT GAATCTATAC CACCGACAAC TGAGATATTA TGGGGACAAT TACAAGTTTT GACTGAGCTT TGCTCTAATG ATACTAATAA CATCGCCATA GTTACAACAG AAAGAGCTCT TCAACCACAT TTACCTCCAA AGGACTCTTT TTTAACATCA TGCCTATCTT TAGTTAAGGA AAATATATTT GATCTAGATA AGTTAGCCAT TAATTTAACA AACCTTGGAT ATTCTAAAAC AACTACTACC GAAGAGGAAG GTCAATGGAG TCGAAGAGGC GATATCTTGG ATATTTATCC GGTCAACTAT GAATCTCCAA TACGGTTAGA ATTTTATGGC GATAATATAG ATAAAATAAA GGAATTCGAC CCAGTTTCTC AACGTTCTCT TGATGAAATA AATCAGGTAA TAATCTCACC GGTCAACTTT GATATATTAA TAGCGAATAA GCTATCTTCT TTCACTCCAG AAATTCTGGC TAAATATTTT GACGCCGACT CTATCGATAA GCTTAAAAAT AATATAATAC CTTCTGGTAT AAGACGTTAT CTAGGACTAG CCTGGACAAG CCCTTCTTCC CTTATAGATT TTATAGATAA TTCATCACTA ATTATTACAG ATGAACCAAA TCAATGTAAT TCCCATTCAA CAGCCTGGAC AGAGCATGTC TCAGAAACCT ATCATCAATT AGAAACTAAT TTAGAGTTAA ATGATAATAA TTTAGCTCTT CCTCCAAATA ACCTTCATTC TACTTTTACT ACTAATTATG ACTTGCTAAA TGGATTTTAT GGCTTAGATA CTACAGATTT TATTGATCAA TATAATAGAG AGAATATATT TGATATTTCC TCCAAACAAA TTCTTACCTA TCCAAATCAA TTCGGAAGGT TAAGTGAAAT GCTCAAGAAA TATCAAAATG ATAAGTTTAA AATCTTTATT TATTCAGCCC AACCTAGTAG AACTAGCTCT CTGCTAAATG AACACGATTG TATTTCAGTA TTCGTAGAAA ATTCTAAAGA TAGCTTACGC ATTAAGACAT TACTAGATCA AAATACACCT GTTGCTCTCA GAAGTTCATC AAACTTTGAT TTAGAAGGGA TTAATTTTCT TCCTTGGAAA ATACTTCTAC TAACAGATAA GGAATTTTAT GGACAACAAC TTGTCAGTCA TAGTGGTTAC GTAAGGAGAA GAAAAAGATC AGCTAGTAAA AGTATTGATC ATAATAAATT AAGAACTGGG GACTATGTTG TACATAGAAA TCATGGTATA GGTAAATTCA TTAAAATAGA AAAATTTGTT ATCTCTCAAG AAAGTAGAGA TTATTTATTA GTCCAGTATC AAGACGGTAC TTTACGTGTA GCCGCAGATC AACTTGGTTC TCTTGGCCGC TATAGATCAT CTTCTGACAA ATCACCTAGG ATTGGAAAGT TAGGAGGTAC AGCTTGGCTT AATGCAAAGG AAAAAGCACG AAAATCAATT AACAAAGTTG CTATAGATTT AATTAGGCTT TACGCAGAGA GGAATAAAAC TGAGGGCTAT AGTTTCCCTC CTGATGCTCC TTGGCAATCT GAACTTGAAG ATGCATTCCA ATATGAGCCT ACTCATGATC AACTCACTGC AATTAAGGAT GTCAAAAATG ATATGGAAAA ACCCAAACCC ATGGATAGAC TTGTTTGTGG CGATGTTGGC TATGGAAAAA CGGAAGTAGC GATTAGGGCT CTTTTTAAAG CTATTATATC CGGAAAACAA GCAGCATTAC TCGCTCCAAC AACTATCCTA TCTCAACAGC ATTGGCGTAC GTTATCTGAT CGATTTGCTC CTTATCCAAT TAAAATTGCG CTTTTAAATC GATTCAAGAC TTCAAGAGAA AAGAATGCAA TTGTAGAAGA ACTGAAAAGC GGAACTATAG ATTTAGTTGT AGGTACCCAT TTAATACTTT CCAACAAGGT TTGTTTTAAA GATTTAGGTT TATTAGTAGT TGATGAAGAG CAACGGTTTG GTGTAAAGCA AAAGGAGCGT ATTAAACAAT TTAAAAAAAA TATAGATGTT CTTACTCTTA CTGCTACACC AATTCCTAGG ACTTTGTATA TGAGTCTATC GGGAGTTAGA GAAATGAGTC TTATTACTAC CCCCCCTCCT CTACGAAGGG CTATTAAAAC TCATTTAATT CCCTATGAGG AAGAAGCAAT CAGAAGTGCT ATATGCCAGG AGATTGATAG AGGAGGTCAA ATATTTTATG TCGTCCCACG AATTGAAGGT ATAACTGATA TTGCTACAAA ATTAAGTAAT ATGATTCCGA AAATAAGAAT ATTAATAGCA CATGGTCAAA TGGATGAAGG TGAATTAGAA AGCTCAATGA TAGCTTTTAA TGACTGGGAA GCAGACTTAA TGCTTTGTAC TACAATAGTT GAGAGCGGTT TAGACATTCC TAGAGTTAAT ACAATATTAA TTGAGGATGC TCAGCAGTTT GGCCTGTCAC AGCTTTATCA ATTAAGAGGA AGAGTTGGTC GCAGTGGTGT TCAGGCTCAT GCTTGGCTAC TTTATCCAAG TAATACGACA ATTAATGATA AAGCTAAGCA ACGATTACAA GCAATCCAGG AATTTAGCCA ATTAGGAAGT GGGTATCAAT TGTCTATGCG TGATATGGAA ATAAGAGGGG TTGGAAACTT AATAGGCCTA CAGCAGAGTG GACAGATGGA AGCAATTGGT TTTGATATGT ATATGGAAAT GCTACAAGAA TGTATTTCAG ATCTAGAAGG ACATGAAATA CCAAAAGTTG ATGAAACTCT TATAGATTTA CCAATTAACG CATTTATACC AGGTAATTGG ATAGTTGATA ACCAAGAAAA AATTTCAGCT TATAAGGCAG CAACTGATTG CCATACATCA GGAAAACTTA TTGAATTGGG TCTTGCATGG TCAGACAGAT ATGGTGCCTT GCCTAAACCT GTCTCCTCGT TAATGCAAGT AATGCAAATT AAACTAGTCG GTAAAAGTTT AGGATTTTCC CGTATTCGGC AGATAAAACC AAATATTATT TTAGAAACTA AAATGAAGGA ATCTACTTTT AAAGTTCTTA GGAATGGAAT CGATAAGAGC TTACACAGTA GGATACTATA TAAGAAAGGC AATTCTTCTT CGGAGGTATT ACTTAGAGGA CTTGCGAATC AACCCATCGA AAAACAGTTG GATATCCTAT TTGAATGGCT ATCCAAAATG AAGGATGCTA CAACAAACCT AGAATTATAA
|
Protein sequence | MSLEPIIDKL KESSLTSQLI ERIQRNNRLI LTGGSKTAKT IISTTISKAE KLPLIVIVPT LEETTRWYSI LNNFSWDSLY IYPTSENSPY ESIPPTTEIL WGQLQVLTEL CSNDTNNIAI VTTERALQPH LPPKDSFLTS CLSLVKENIF DLDKLAINLT NLGYSKTTTT EEEGQWSRRG DILDIYPVNY ESPIRLEFYG DNIDKIKEFD PVSQRSLDEI NQVIISPVNF DILIANKLSS FTPEILAKYF DADSIDKLKN NIIPSGIRRY LGLAWTSPSS LIDFIDNSSL IITDEPNQCN SHSTAWTEHV SETYHQLETN LELNDNNLAL PPNNLHSTFT TNYDLLNGFY GLDTTDFIDQ YNRENIFDIS SKQILTYPNQ FGRLSEMLKK YQNDKFKIFI YSAQPSRTSS LLNEHDCISV FVENSKDSLR IKTLLDQNTP VALRSSSNFD LEGINFLPWK ILLLTDKEFY GQQLVSHSGY VRRRKRSASK SIDHNKLRTG DYVVHRNHGI GKFIKIEKFV ISQESRDYLL VQYQDGTLRV AADQLGSLGR YRSSSDKSPR IGKLGGTAWL NAKEKARKSI NKVAIDLIRL YAERNKTEGY SFPPDAPWQS ELEDAFQYEP THDQLTAIKD VKNDMEKPKP MDRLVCGDVG YGKTEVAIRA LFKAIISGKQ AALLAPTTIL SQQHWRTLSD RFAPYPIKIA LLNRFKTSRE KNAIVEELKS GTIDLVVGTH LILSNKVCFK DLGLLVVDEE QRFGVKQKER IKQFKKNIDV LTLTATPIPR TLYMSLSGVR EMSLITTPPP LRRAIKTHLI PYEEEAIRSA ICQEIDRGGQ IFYVVPRIEG ITDIATKLSN MIPKIRILIA HGQMDEGELE SSMIAFNDWE ADLMLCTTIV ESGLDIPRVN TILIEDAQQF GLSQLYQLRG RVGRSGVQAH AWLLYPSNTT INDKAKQRLQ AIQEFSQLGS GYQLSMRDME IRGVGNLIGL QQSGQMEAIG FDMYMEMLQE CISDLEGHEI PKVDETLIDL PINAFIPGNW IVDNQEKISA YKAATDCHTS GKLIELGLAW SDRYGALPKP VSSLMQVMQI KLVGKSLGFS RIRQIKPNII LETKMKESTF KVLRNGIDKS LHSRILYKKG NSSSEVLLRG LANQPIEKQL DILFEWLSKM KDATTNLEL
|
| |