Gene P9303_14371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_14371 
Symbolmfd 
ID4778142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1231596 
End bp1235177 
Gene Length3582 bp 
Protein Length1193 aa 
Translation table11 
GC content48% 
IMG OID640086946 
Producttranscriptional-repair coupling factor 
Protein accessionYP_001017448 
Protein GI124023141 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1197] Transcription-repair coupling factor (superfamily II helicase) 
TIGRFAM ID[TIGR00580] transcription-repair coupling factor (mfd) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCTGA GCTCATTAGT TCGTCAGCTT CAGATGTCGA CTCTCACTGG TGAGTTGGTG 
GACAGAAGTA ATCGAAGCGA TCGCTTGCTG ATGCGTGGAG CCGGGCGAGT GGGGCGTGCA
TTAATTGCAA GTGCAATAGC TCGTCATCAG AGCCGTCCAC TGGTTGTGAT CGTGCCAACA
CTAGAAGAGG CGAATCGCTG GTCTTCGCTG CTTGCCTTGA TGGGTTGGTC GCATAATCAT
CTCTACCCAA CAAGTGAAGG ATCACCTTAT GAGCCATTTG ACCCAACAAC AGAAATCGTC
TGGGGCCAAT TGCAGGTTCT AAGTGAACTA CTTGGTGAAT CAACAAGAAG GTGGGATCGA
GCCATAGTGG CCACAGAAAG AGCTTTACAA CCACATCTAC CTCCTGTTGA GGCACTCGCA
TCACAGTGTG AAATCCTCAG TCGTGGAGAG AATATTGACC TGGAGAGCTT AGCCAGTACT
CTTACCAAGC TTGGATACGA TCGAGTTACA GCTGTTGATC AGGAAGCTAC CTGGAGTCGT
CGTGGTGACA TTGTTGATAT ATTTCCAGTA AGCAGTGAAT TGCCAGTTCG CCTCGAGTTA
TTCGGCGATG AACTTGACAA ATTAAGAGAA TTCGACCCAG TCACTCAACG TTCCCTGGAT
GAAGTTAATG AACTTTGCCT AACTCCTTCA GGGTTTAGTC CATTAATTGC TCATCAACTA
CGACAGTCCA TGCCCGATGG GCTTGACCGT CTTGTTAGTG AGGAGACTTT AGATCAACTA
TTCGAAGGTT CTACACCTGA TGGTATAAGA AGGTTAATGG GAATTGCATG GAACAAACCT
GCCTCGCTAC TTGACTACAT TCCTGCCAAC TCCTTTATCG CAATAGATGA GAAGCGTCAT
GGCTCTGCAC ATGGAAAGCT ATGGCTTCAA CATGCAGAAG AACATCACAA TGATGTCGGT
CAATCAATGG GCTTGTCCAT TGATGAACAA AAGAAGTATT GGCCTCCATT GCTTCATCGC
AGCATCGGAG AAAGCTATGC AAACACAGAT GGCTTTGCCG GTATTGATCT TGCTGAACTC
CATGAAGATG ATGGTTATGC AAATAGTTTT GATCTTGCCA GTCGCCCAAT TCCGGCCAAC
CCAAACCAAT TCGGAAGGTT AGGAGAGCAA ATCAAAAATT ATCAAAAAGC ACATCATCCC
GTTTGGCTTT TGTCAGCGCA ACCAAGCCGT GCTGTGGCTC TTCTTGAGGA ACATGACTGC
ATCACACGCT TCGTCCCAAA CGCTAAAGAT CACCCAGCCA TTGAACGTTT GCTCGAGCAA
AACACGCCAG TAGCATTAAA AACAAGTGGT TCTGTGGATT TAGAGGGGCT GATCTTGCCA
GCCTGGCGAG TTGTTTTGAT GACTGACCAT GAATTTTTTG GTCAAAAAAA CCTTGGCTCT
ACCGGTTATG TTCGACGACG ACGACGGGCC GCGAGTCGTA CGGTTGACCC CAACAAAATG
TGCTCTGGGG ACTTCGTCGT ACATCGCAAT CACGGCATCG GTCGTTTTCT GAAATTAGAA
AAACTGGCCA TCAGTGGTGA GGTCCGTGAC TATTTGGTTA TCCAATATTT GGATGGAACA
CTCAGCGTGG CCGCCGATCA GCTCGGCAGC CTTGGTCGCT ATCGATCAAC AAGTGAATCG
CCACCAAAAC TCAATCGCAT GGGAGGAACA GCGTGGCAAA AAATTAAAGA GCGAACCCGA
AAGTTAGTTC GCAAAGTTGC GATGGATCTG GTCAAGCTCT ATGCAGAGCG ACTCCAGGCC
CCTGGATATG CCTTCCCACC AGATGGACCT TGGCAGATTG AACTAGAAGA ATCATTTCCC
TATGAACCAA CACCTGATCA AGTCAAGGCA GTCGTTGATG TAAAACGCGA TATGGAAGCA
GCACAACCTA TGGATCGGCT TGTGTGCGGA GATGTTGGTT TCGGAAAAAC GGAAGTAGCA
ATACGAGCCA TCTTCAAAGC AATCACGTCT GGACGCCAGA TAGCCATGCT TGCCCCCACA
ACAGTGCTAG CCCAACAACA CTGGAGAACA CTTTCGGACC GCTTCGCTCC CTACCCAATC
AAGGTCGCTT TACTGAACAG ATTCAGAACA AGCTCAGAAC GAAAATCAAT ACTTAATGGC
CTCAAAGAAG GGACAATCGA TGCAGTTGTC GGTACCCACC AGCTACTCAG TAAAAACACA
ACATTCCAAA AACTAGGGTT GTTGGTTGTT GATGAGGAAC AGCGTTTTGG AGTCAATCAA
AAGGAAAAGA TCAAAGTTCT TCGTAAGGAT GTAGATGTTT TGACCCTTTC AGCTACACCA
ATTCCGCGGA CCTTATACAT GAGCCTTTCA GGGGTAAGGG AAATGAGTCT GATCACAACC
CCTCCGCCAT TGCGCCGTCC TATCAAAACC CACCTAGCTG CTTTTGATGA AGAAGCAGTT
CGTAGTTCTA TCCGCCAGGA ACTTGATCGA GGCGGACAGG TGTTCTATGT CGTTCCACGT
GTTGAGGGTA TTGAAGATGT AGCCAGTCAA CTTCAACAGA TGCTGCCCGA TTTGAAGTTG
TTGGTAGCCC ATGGTCAGAT GGCAGAAGGC GAACTTGAGA GCTCGATGGT CGCCTTTAAT
GCAGGGGAGG CCGACTTGAT GCTATGCACC ACGATCGTTG AAAGTGGCCT CGATATCCCA
CGTGTGAACA CTATCCTCAT TGAGGATGCT CATAAATTTG GACTAGCACA GCTCTACCAA
CTACGTGGAC GTGTGGGTAG AAGCGGTGTT CAAGCGCATG CATGGTTGTT CTATCCGGGT
GACGCATCCC TGAGTGATGC CGCTAGACAA CGCCTAAGAG CAATCCAAGA ATTTGCACAG
CTAGGCAGTG GCTATCAACT AGCCATGCGA GACATGGAAA TCCGTGGTGT GGGAAACCTT
CTCGGGGTTG AACAAAGCGG ACAAATGGAA ACCATTGGTT TCGATCTTTA CATGGAAATG
TTGCAGGAAT CACTTGCTGA AATCCAAGGA CAGGGCATTC CATCTGTAGA TGACACTCAA
ATCGATCTAC CGGTAACAGC ATTCGTGCCA GCGGAATGGA TTGTTGATGG TGACGAAAAG
ATCGCTGCTT ACCGAGCTGC AGCAAATTGT GCTTCTCATG AATCACTGAT TGAGTTGGCA
GCTAGCTGGA CAGACCGCTA CGGAGCCATT CCTGGTCCTG TGCAATCACT TCTTCAACTC
ATGGAGCTCA AACTCTTAGC TCGTCGCTGC GGGATCTCGA GAATTAAACC AGAAAAGCCA
AATATTGCGA TGGAAACTCC GATGGAGGAG CCCGCCTTCC GGCTACTTAG GCAAGGTTTA
CCGCAACACC TGCACGGCCG ACTGATTTAC CAGACTGGAA GTGGAAATAA AGCCAAGGTG
CTGGCAAGAG GTCTAAGCGT CTTGCCTATG GAAAAACAGC TAGAACAACT GATGGAGTGG
TTGCGTCTCA TGGCCACTCA GATTCCTTGC GAGGATGGAT TAACTGCAAG TCAGCAAAAG
CAGCAAGCCG TAGAGCGAGA TGAGGCCGTC ATTACTCCCT AA
 
Protein sequence
MPLSSLVRQL QMSTLTGELV DRSNRSDRLL MRGAGRVGRA LIASAIARHQ SRPLVVIVPT 
LEEANRWSSL LALMGWSHNH LYPTSEGSPY EPFDPTTEIV WGQLQVLSEL LGESTRRWDR
AIVATERALQ PHLPPVEALA SQCEILSRGE NIDLESLAST LTKLGYDRVT AVDQEATWSR
RGDIVDIFPV SSELPVRLEL FGDELDKLRE FDPVTQRSLD EVNELCLTPS GFSPLIAHQL
RQSMPDGLDR LVSEETLDQL FEGSTPDGIR RLMGIAWNKP ASLLDYIPAN SFIAIDEKRH
GSAHGKLWLQ HAEEHHNDVG QSMGLSIDEQ KKYWPPLLHR SIGESYANTD GFAGIDLAEL
HEDDGYANSF DLASRPIPAN PNQFGRLGEQ IKNYQKAHHP VWLLSAQPSR AVALLEEHDC
ITRFVPNAKD HPAIERLLEQ NTPVALKTSG SVDLEGLILP AWRVVLMTDH EFFGQKNLGS
TGYVRRRRRA ASRTVDPNKM CSGDFVVHRN HGIGRFLKLE KLAISGEVRD YLVIQYLDGT
LSVAADQLGS LGRYRSTSES PPKLNRMGGT AWQKIKERTR KLVRKVAMDL VKLYAERLQA
PGYAFPPDGP WQIELEESFP YEPTPDQVKA VVDVKRDMEA AQPMDRLVCG DVGFGKTEVA
IRAIFKAITS GRQIAMLAPT TVLAQQHWRT LSDRFAPYPI KVALLNRFRT SSERKSILNG
LKEGTIDAVV GTHQLLSKNT TFQKLGLLVV DEEQRFGVNQ KEKIKVLRKD VDVLTLSATP
IPRTLYMSLS GVREMSLITT PPPLRRPIKT HLAAFDEEAV RSSIRQELDR GGQVFYVVPR
VEGIEDVASQ LQQMLPDLKL LVAHGQMAEG ELESSMVAFN AGEADLMLCT TIVESGLDIP
RVNTILIEDA HKFGLAQLYQ LRGRVGRSGV QAHAWLFYPG DASLSDAARQ RLRAIQEFAQ
LGSGYQLAMR DMEIRGVGNL LGVEQSGQME TIGFDLYMEM LQESLAEIQG QGIPSVDDTQ
IDLPVTAFVP AEWIVDGDEK IAAYRAAANC ASHESLIELA ASWTDRYGAI PGPVQSLLQL
MELKLLARRC GISRIKPEKP NIAMETPMEE PAFRLLRQGL PQHLHGRLIY QTGSGNKAKV
LARGLSVLPM EKQLEQLMEW LRLMATQIPC EDGLTASQQK QQAVERDEAV ITP