Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_14371 |
Symbol | mfd |
ID | 4778142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1231596 |
End bp | 1235177 |
Gene Length | 3582 bp |
Protein Length | 1193 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640086946 |
Product | transcriptional-repair coupling factor |
Protein accession | YP_001017448 |
Protein GI | 124023141 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1197] Transcription-repair coupling factor (superfamily II helicase) |
TIGRFAM ID | [TIGR00580] transcription-repair coupling factor (mfd) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCTGA GCTCATTAGT TCGTCAGCTT CAGATGTCGA CTCTCACTGG TGAGTTGGTG GACAGAAGTA ATCGAAGCGA TCGCTTGCTG ATGCGTGGAG CCGGGCGAGT GGGGCGTGCA TTAATTGCAA GTGCAATAGC TCGTCATCAG AGCCGTCCAC TGGTTGTGAT CGTGCCAACA CTAGAAGAGG CGAATCGCTG GTCTTCGCTG CTTGCCTTGA TGGGTTGGTC GCATAATCAT CTCTACCCAA CAAGTGAAGG ATCACCTTAT GAGCCATTTG ACCCAACAAC AGAAATCGTC TGGGGCCAAT TGCAGGTTCT AAGTGAACTA CTTGGTGAAT CAACAAGAAG GTGGGATCGA GCCATAGTGG CCACAGAAAG AGCTTTACAA CCACATCTAC CTCCTGTTGA GGCACTCGCA TCACAGTGTG AAATCCTCAG TCGTGGAGAG AATATTGACC TGGAGAGCTT AGCCAGTACT CTTACCAAGC TTGGATACGA TCGAGTTACA GCTGTTGATC AGGAAGCTAC CTGGAGTCGT CGTGGTGACA TTGTTGATAT ATTTCCAGTA AGCAGTGAAT TGCCAGTTCG CCTCGAGTTA TTCGGCGATG AACTTGACAA ATTAAGAGAA TTCGACCCAG TCACTCAACG TTCCCTGGAT GAAGTTAATG AACTTTGCCT AACTCCTTCA GGGTTTAGTC CATTAATTGC TCATCAACTA CGACAGTCCA TGCCCGATGG GCTTGACCGT CTTGTTAGTG AGGAGACTTT AGATCAACTA TTCGAAGGTT CTACACCTGA TGGTATAAGA AGGTTAATGG GAATTGCATG GAACAAACCT GCCTCGCTAC TTGACTACAT TCCTGCCAAC TCCTTTATCG CAATAGATGA GAAGCGTCAT GGCTCTGCAC ATGGAAAGCT ATGGCTTCAA CATGCAGAAG AACATCACAA TGATGTCGGT CAATCAATGG GCTTGTCCAT TGATGAACAA AAGAAGTATT GGCCTCCATT GCTTCATCGC AGCATCGGAG AAAGCTATGC AAACACAGAT GGCTTTGCCG GTATTGATCT TGCTGAACTC CATGAAGATG ATGGTTATGC AAATAGTTTT GATCTTGCCA GTCGCCCAAT TCCGGCCAAC CCAAACCAAT TCGGAAGGTT AGGAGAGCAA ATCAAAAATT ATCAAAAAGC ACATCATCCC GTTTGGCTTT TGTCAGCGCA ACCAAGCCGT GCTGTGGCTC TTCTTGAGGA ACATGACTGC ATCACACGCT TCGTCCCAAA CGCTAAAGAT CACCCAGCCA TTGAACGTTT GCTCGAGCAA AACACGCCAG TAGCATTAAA AACAAGTGGT TCTGTGGATT TAGAGGGGCT GATCTTGCCA GCCTGGCGAG TTGTTTTGAT GACTGACCAT GAATTTTTTG GTCAAAAAAA CCTTGGCTCT ACCGGTTATG TTCGACGACG ACGACGGGCC GCGAGTCGTA CGGTTGACCC CAACAAAATG TGCTCTGGGG ACTTCGTCGT ACATCGCAAT CACGGCATCG GTCGTTTTCT GAAATTAGAA AAACTGGCCA TCAGTGGTGA GGTCCGTGAC TATTTGGTTA TCCAATATTT GGATGGAACA CTCAGCGTGG CCGCCGATCA GCTCGGCAGC CTTGGTCGCT ATCGATCAAC AAGTGAATCG CCACCAAAAC TCAATCGCAT GGGAGGAACA GCGTGGCAAA AAATTAAAGA GCGAACCCGA AAGTTAGTTC GCAAAGTTGC GATGGATCTG GTCAAGCTCT ATGCAGAGCG ACTCCAGGCC CCTGGATATG CCTTCCCACC AGATGGACCT TGGCAGATTG AACTAGAAGA ATCATTTCCC TATGAACCAA CACCTGATCA AGTCAAGGCA GTCGTTGATG TAAAACGCGA TATGGAAGCA GCACAACCTA TGGATCGGCT TGTGTGCGGA GATGTTGGTT TCGGAAAAAC GGAAGTAGCA ATACGAGCCA TCTTCAAAGC AATCACGTCT GGACGCCAGA TAGCCATGCT TGCCCCCACA ACAGTGCTAG CCCAACAACA CTGGAGAACA CTTTCGGACC GCTTCGCTCC CTACCCAATC AAGGTCGCTT TACTGAACAG ATTCAGAACA AGCTCAGAAC GAAAATCAAT ACTTAATGGC CTCAAAGAAG GGACAATCGA TGCAGTTGTC GGTACCCACC AGCTACTCAG TAAAAACACA ACATTCCAAA AACTAGGGTT GTTGGTTGTT GATGAGGAAC AGCGTTTTGG AGTCAATCAA AAGGAAAAGA TCAAAGTTCT TCGTAAGGAT GTAGATGTTT TGACCCTTTC AGCTACACCA ATTCCGCGGA CCTTATACAT GAGCCTTTCA GGGGTAAGGG AAATGAGTCT GATCACAACC CCTCCGCCAT TGCGCCGTCC TATCAAAACC CACCTAGCTG CTTTTGATGA AGAAGCAGTT CGTAGTTCTA TCCGCCAGGA ACTTGATCGA GGCGGACAGG TGTTCTATGT CGTTCCACGT GTTGAGGGTA TTGAAGATGT AGCCAGTCAA CTTCAACAGA TGCTGCCCGA TTTGAAGTTG TTGGTAGCCC ATGGTCAGAT GGCAGAAGGC GAACTTGAGA GCTCGATGGT CGCCTTTAAT GCAGGGGAGG CCGACTTGAT GCTATGCACC ACGATCGTTG AAAGTGGCCT CGATATCCCA CGTGTGAACA CTATCCTCAT TGAGGATGCT CATAAATTTG GACTAGCACA GCTCTACCAA CTACGTGGAC GTGTGGGTAG AAGCGGTGTT CAAGCGCATG CATGGTTGTT CTATCCGGGT GACGCATCCC TGAGTGATGC CGCTAGACAA CGCCTAAGAG CAATCCAAGA ATTTGCACAG CTAGGCAGTG GCTATCAACT AGCCATGCGA GACATGGAAA TCCGTGGTGT GGGAAACCTT CTCGGGGTTG AACAAAGCGG ACAAATGGAA ACCATTGGTT TCGATCTTTA CATGGAAATG TTGCAGGAAT CACTTGCTGA AATCCAAGGA CAGGGCATTC CATCTGTAGA TGACACTCAA ATCGATCTAC CGGTAACAGC ATTCGTGCCA GCGGAATGGA TTGTTGATGG TGACGAAAAG ATCGCTGCTT ACCGAGCTGC AGCAAATTGT GCTTCTCATG AATCACTGAT TGAGTTGGCA GCTAGCTGGA CAGACCGCTA CGGAGCCATT CCTGGTCCTG TGCAATCACT TCTTCAACTC ATGGAGCTCA AACTCTTAGC TCGTCGCTGC GGGATCTCGA GAATTAAACC AGAAAAGCCA AATATTGCGA TGGAAACTCC GATGGAGGAG CCCGCCTTCC GGCTACTTAG GCAAGGTTTA CCGCAACACC TGCACGGCCG ACTGATTTAC CAGACTGGAA GTGGAAATAA AGCCAAGGTG CTGGCAAGAG GTCTAAGCGT CTTGCCTATG GAAAAACAGC TAGAACAACT GATGGAGTGG TTGCGTCTCA TGGCCACTCA GATTCCTTGC GAGGATGGAT TAACTGCAAG TCAGCAAAAG CAGCAAGCCG TAGAGCGAGA TGAGGCCGTC ATTACTCCCT AA
|
Protein sequence | MPLSSLVRQL QMSTLTGELV DRSNRSDRLL MRGAGRVGRA LIASAIARHQ SRPLVVIVPT LEEANRWSSL LALMGWSHNH LYPTSEGSPY EPFDPTTEIV WGQLQVLSEL LGESTRRWDR AIVATERALQ PHLPPVEALA SQCEILSRGE NIDLESLAST LTKLGYDRVT AVDQEATWSR RGDIVDIFPV SSELPVRLEL FGDELDKLRE FDPVTQRSLD EVNELCLTPS GFSPLIAHQL RQSMPDGLDR LVSEETLDQL FEGSTPDGIR RLMGIAWNKP ASLLDYIPAN SFIAIDEKRH GSAHGKLWLQ HAEEHHNDVG QSMGLSIDEQ KKYWPPLLHR SIGESYANTD GFAGIDLAEL HEDDGYANSF DLASRPIPAN PNQFGRLGEQ IKNYQKAHHP VWLLSAQPSR AVALLEEHDC ITRFVPNAKD HPAIERLLEQ NTPVALKTSG SVDLEGLILP AWRVVLMTDH EFFGQKNLGS TGYVRRRRRA ASRTVDPNKM CSGDFVVHRN HGIGRFLKLE KLAISGEVRD YLVIQYLDGT LSVAADQLGS LGRYRSTSES PPKLNRMGGT AWQKIKERTR KLVRKVAMDL VKLYAERLQA PGYAFPPDGP WQIELEESFP YEPTPDQVKA VVDVKRDMEA AQPMDRLVCG DVGFGKTEVA IRAIFKAITS GRQIAMLAPT TVLAQQHWRT LSDRFAPYPI KVALLNRFRT SSERKSILNG LKEGTIDAVV GTHQLLSKNT TFQKLGLLVV DEEQRFGVNQ KEKIKVLRKD VDVLTLSATP IPRTLYMSLS GVREMSLITT PPPLRRPIKT HLAAFDEEAV RSSIRQELDR GGQVFYVVPR VEGIEDVASQ LQQMLPDLKL LVAHGQMAEG ELESSMVAFN AGEADLMLCT TIVESGLDIP RVNTILIEDA HKFGLAQLYQ LRGRVGRSGV QAHAWLFYPG DASLSDAARQ RLRAIQEFAQ LGSGYQLAMR DMEIRGVGNL LGVEQSGQME TIGFDLYMEM LQESLAEIQG QGIPSVDDTQ IDLPVTAFVP AEWIVDGDEK IAAYRAAANC ASHESLIELA ASWTDRYGAI PGPVQSLLQL MELKLLARRC GISRIKPEKP NIAMETPMEE PAFRLLRQGL PQHLHGRLIY QTGSGNKAKV LARGLSVLPM EKQLEQLMEW LRLMATQIPC EDGLTASQQK QQAVERDEAV ITP
|
| |