Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_10301 |
Symbol | mfd |
ID | 4717741 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 890248 |
End bp | 893760 |
Gene Length | 3513 bp |
Protein Length | 1170 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 640078745 |
Product | transcriptional-repair coupling factor |
Protein accession | YP_001009421 |
Protein GI | 123968563 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1197] Transcription-repair coupling factor (superfamily II helicase) |
TIGRFAM ID | [TIGR00580] transcription-repair coupling factor (mfd) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTAA ATACTTTAGT TGATTATATT TCGAACTCAC AAATTACTTC TGAATTAGTA AAAAGAATTT CAAAAAATAA TGAATTAAAT ATTGTTGGTT CAAGTAGATA TGCTAAATCA ATAATCTTAG ATAGCATCGC AAAAAAAGAG AAAAAAAATA TATTATTAAT TTGTCCTAAT GTAGAAATTG CCTACAAATG GATTGGTTAT TTTGAAAGTA TAAATGATAA AGCAGTTTTA TATTATCCTC CAAAAGAACA TCTACCATAC TCATCAATTA ATAAATCCAA AGAGATTGAA TTTAGTCAGC TTACTGTTTT ATCCAAATTA ATAAAAAAAG AGAAAAATGA ACTTAATATT GTTATATCAA CAGAGAGATC ACTACAACCT CATCTAATAA ATAAAAACTT ATTAATTGAA AATAAGTTGA ATTTACAAAA AGGGGTTCAA ATCGAGATTC AAGAATTAGC AAATAAACTT TCTTTACTTG GTTATACGAA GGATAATGTA ACTTCAACAG AGGGATTCTG GAGTAGGAGA GGGGAAATAA TAGATATTTA TCCCGTCAAT AATGAGTTTC CTATAAGATT AGAATTTTTT GATAATGTAA TTGAGAAGAT AAGAGAATAT GATCCCCATA CACAAAAAAC ATTAGAAAGT ATTAATAATA TTGAAATAAT ACAGGCTGGA TTTAATTTGC TAATTAAAGA TAAGTTAAAT AATTTTTCTA AGAACGGTAT TTTTAATTCA GAAGATATAA ATAAAAATAA TCTTGATCGG TATTTAGGAA TAATTGAAAA AACCCCCTCA AATATAATAG ATTTTATTGA TAGGGAAACA ATTCTCGTAA TTGATGAATT AGAAGATTGT ACTAAATTTG CAAATAATTG GTATCAAGAT TCAGAAAGTA ATTTTGATAA TTGTGAGTAT GAATTAAATG AGAACCTTAA AAATAATGAT ATTAATTTAC AGGCCAAACC TAATTTACAT TTAAAGTTTG ACGAAATATT AAATTCACTA GGAAATTTTA ATTTGATAAA ATTTTATGAA TTTGAATCTA AAACCAATAT TGATAATAAG TTTTTGTTAA ATGATAAAAG AATAAATTCA TACTCTAAAA ATATAGGAAA ATTATCCAAT GATATAAATA AAAATATAAA AAATAATGAG AAAGTATGGA TATTATCAGC ACAGCCATTG AGGACTAGGA CTTTACTTTT TGAGCACGAA TGTAATACAA ATTTCTTAAA CAATCCTAAT GATATTGATG AAGCATTTAA GTCAATTAAT AATTCAACTC CTTTAATTTT AAAAAATAAG AACAATTATG AAATCGAGGG TTTTTATCTT CCAATTTGGA AAGTTGTCCT AATAACAGAT AAAGAATTAT TTTCACAACA ATCTCTTTTT CATAATGTAT TCATAAGAAG AAAAAAAAGA AGTGTAAATT CAAATATAAA CGTAAACAAG ATTAGTCCCG GTGATTTTAT AGTTCATAAA AATCATGGAA TAGGAAAATT TTTAAAAATA GAAAAAATAA ATATAACTGG AGATTCAAGA GATTATTTAG TCATTCAGTA TCAGGATGGG AAGATAAGTG TTGCCGCTGA TCAACTTGGT AGTGTTAACA GATATAGATC AAGTGGAAAA ATAAAGCCAA AAATAAATAA ATTAGGAGGG ACGGAATGGG AAAGAATAAA AGACAAAAAT AAGAAACAAA TCAAAAAAGT TGCTGTCGAT ATTTTAAAAC TTTATGCAAA GAGAGAGAAA TTAAAGGGTT ACATTTACCC AGAAGATGGT CCTTGGCAAG ATGAATTAGA GGAATCATTC CCTTATCAAC CAACACCTGA TCAAATTACT GCTGTAGAAG AAATAAAATC TGATATGGAA AGCGAAAAGC CAATGGACAG GCTAGTTTGT GGAGATGTAG GATTTGGCAA AACAGAAGTC GCTGTTCGGG CTATTTTTAA GGCTATTACA TCAGGCAAAC AGGTAATATT ACTTGCTCCC ACAACAATCC TAGCTCAGCA ACATTGGAGA ACAATAAGCA ATAGATTTTC ACCTTACCCA ATAAAAGTAT CATTACTCAA TAGATTCAAA ACCGTTAATG AAAGAAAGGA AATCTATGCT GGTTTGAAAA ATAACAAAAT TGATTTAGTT GTAGCAACGC ACCAAATTTT AGGAAAGGAA ATAGAGATAA AAAACTTAGG ACTACTTGTA ATTGATGAAG AACAAAGATT TGGAGTAAGG CAAAAGGAGA AAATTAAAAA AATCAAAACA AGCATAGACG TATTAACTCT ATCGGCAACT CCAATTCCAA GAACTCTTTA TATGAGTTTA TCTGGACTAA GACAAATGAG CTTACTAAAT ACTCCTCCTC CATCAAGAAG ATCAATAAAA ACCTATTTAG CTGAAATAGA TATGGATGTT ATAAGAACTG CCATTAATCA AGAACTTGAT AGGGGAGGTC AAATTTTTTA TGTTCTTCCA AGAATTTCTG ATATTAATCA AGCTTTAAAT AAATTAAAAA ATATTTTTCC AAGCTTAAAA TTTATTGTTG CTCATGGGCA AATGAACGAA ACAGAGCTTG AAAATGCAAT GATTGCTTTT AATAATGGAG AAGTAGATCT TATGATATGC ACAACAATAA TTGAAAGTGG ATTAGACATA CCTAAAGTAA ATACAATCAT TATTGAAGAT TCTCACAAAT TTGGCCTTTC ACAACTTTAT CAACTTAGAG GAAGAGTTGG TAGAAGCAGT GTACAAGCAC ATGCTTGGTT GTTTTATCCA GATATAAATA AAATTAATGA CGCTGCAAAA CAAAGATTGA AAGCTATAAA AGATTTTTCA GAACTAGGAA GTGGTTACCA ACTTGCAATG AAAGATATGG AAATAAGAGG TGTTGGTAGT TTATTAGGAG AAGAACAAAG TGGAAAGGTT AATGCTATTG GATATGATTT ATATATAGAA ATGCTCCATG AGGCTATTTC AGAAATCAGT GGGCAAGAGA TACCTGAAGT TAACGATACT CAAATTGATT TACCAATAAA TGCTTTTATA CCTGCAACAT GGATATTAAA CAGAGAAGAG AAGCTTGAGG CTTACAAATC TGCTACTGAA TGTTCAAAAA ATGATGAATT AACTGAATTA GCTACAGACT GGGTAAATAG ATATGGAAAC TTACCCAAAC CTGTTGAGTC CTTAATTATG ATAATGAGAC TAAAATTACT AGCTAAAAAA TGTGGTTTTA GTAAGATCAA GCTCAAAAAG CCAAACATCT TGATAGAGAC AAAATTAAAA AATTCTACTT TTAAAATTCT TAAAAATTCT TTGGCAAGTA GTGTTCAAAA TAAATTTAAT TTTAATGAAG GCGAACAATT ATCAATCATC ACTATAAGGG GTTTAGGTGC AACTGAAATT CAAAATCAAA TTGATCAACT TATGTTGTGG TTCGAATCTT TTGAAAGAGA AATAAAGAAT TTCGATAAAG AACTTCTTAT GAAAAAAGAA TAA
|
Protein sequence | MSLNTLVDYI SNSQITSELV KRISKNNELN IVGSSRYAKS IILDSIAKKE KKNILLICPN VEIAYKWIGY FESINDKAVL YYPPKEHLPY SSINKSKEIE FSQLTVLSKL IKKEKNELNI VISTERSLQP HLINKNLLIE NKLNLQKGVQ IEIQELANKL SLLGYTKDNV TSTEGFWSRR GEIIDIYPVN NEFPIRLEFF DNVIEKIREY DPHTQKTLES INNIEIIQAG FNLLIKDKLN NFSKNGIFNS EDINKNNLDR YLGIIEKTPS NIIDFIDRET ILVIDELEDC TKFANNWYQD SESNFDNCEY ELNENLKNND INLQAKPNLH LKFDEILNSL GNFNLIKFYE FESKTNIDNK FLLNDKRINS YSKNIGKLSN DINKNIKNNE KVWILSAQPL RTRTLLFEHE CNTNFLNNPN DIDEAFKSIN NSTPLILKNK NNYEIEGFYL PIWKVVLITD KELFSQQSLF HNVFIRRKKR SVNSNINVNK ISPGDFIVHK NHGIGKFLKI EKINITGDSR DYLVIQYQDG KISVAADQLG SVNRYRSSGK IKPKINKLGG TEWERIKDKN KKQIKKVAVD ILKLYAKREK LKGYIYPEDG PWQDELEESF PYQPTPDQIT AVEEIKSDME SEKPMDRLVC GDVGFGKTEV AVRAIFKAIT SGKQVILLAP TTILAQQHWR TISNRFSPYP IKVSLLNRFK TVNERKEIYA GLKNNKIDLV VATHQILGKE IEIKNLGLLV IDEEQRFGVR QKEKIKKIKT SIDVLTLSAT PIPRTLYMSL SGLRQMSLLN TPPPSRRSIK TYLAEIDMDV IRTAINQELD RGGQIFYVLP RISDINQALN KLKNIFPSLK FIVAHGQMNE TELENAMIAF NNGEVDLMIC TTIIESGLDI PKVNTIIIED SHKFGLSQLY QLRGRVGRSS VQAHAWLFYP DINKINDAAK QRLKAIKDFS ELGSGYQLAM KDMEIRGVGS LLGEEQSGKV NAIGYDLYIE MLHEAISEIS GQEIPEVNDT QIDLPINAFI PATWILNREE KLEAYKSATE CSKNDELTEL ATDWVNRYGN LPKPVESLIM IMRLKLLAKK CGFSKIKLKK PNILIETKLK NSTFKILKNS LASSVQNKFN FNEGEQLSII TIRGLGATEI QNQIDQLMLW FESFEREIKN FDKELLMKKE
|
| |