Gene A9601_10301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_10301 
Symbolmfd 
ID4717741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp890248 
End bp893760 
Gene Length3513 bp 
Protein Length1170 aa 
Translation table11 
GC content28% 
IMG OID640078745 
Producttranscriptional-repair coupling factor 
Protein accessionYP_001009421 
Protein GI123968563 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1197] Transcription-repair coupling factor (superfamily II helicase) 
TIGRFAM ID[TIGR00580] transcription-repair coupling factor (mfd) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTAA ATACTTTAGT TGATTATATT TCGAACTCAC AAATTACTTC TGAATTAGTA 
AAAAGAATTT CAAAAAATAA TGAATTAAAT ATTGTTGGTT CAAGTAGATA TGCTAAATCA
ATAATCTTAG ATAGCATCGC AAAAAAAGAG AAAAAAAATA TATTATTAAT TTGTCCTAAT
GTAGAAATTG CCTACAAATG GATTGGTTAT TTTGAAAGTA TAAATGATAA AGCAGTTTTA
TATTATCCTC CAAAAGAACA TCTACCATAC TCATCAATTA ATAAATCCAA AGAGATTGAA
TTTAGTCAGC TTACTGTTTT ATCCAAATTA ATAAAAAAAG AGAAAAATGA ACTTAATATT
GTTATATCAA CAGAGAGATC ACTACAACCT CATCTAATAA ATAAAAACTT ATTAATTGAA
AATAAGTTGA ATTTACAAAA AGGGGTTCAA ATCGAGATTC AAGAATTAGC AAATAAACTT
TCTTTACTTG GTTATACGAA GGATAATGTA ACTTCAACAG AGGGATTCTG GAGTAGGAGA
GGGGAAATAA TAGATATTTA TCCCGTCAAT AATGAGTTTC CTATAAGATT AGAATTTTTT
GATAATGTAA TTGAGAAGAT AAGAGAATAT GATCCCCATA CACAAAAAAC ATTAGAAAGT
ATTAATAATA TTGAAATAAT ACAGGCTGGA TTTAATTTGC TAATTAAAGA TAAGTTAAAT
AATTTTTCTA AGAACGGTAT TTTTAATTCA GAAGATATAA ATAAAAATAA TCTTGATCGG
TATTTAGGAA TAATTGAAAA AACCCCCTCA AATATAATAG ATTTTATTGA TAGGGAAACA
ATTCTCGTAA TTGATGAATT AGAAGATTGT ACTAAATTTG CAAATAATTG GTATCAAGAT
TCAGAAAGTA ATTTTGATAA TTGTGAGTAT GAATTAAATG AGAACCTTAA AAATAATGAT
ATTAATTTAC AGGCCAAACC TAATTTACAT TTAAAGTTTG ACGAAATATT AAATTCACTA
GGAAATTTTA ATTTGATAAA ATTTTATGAA TTTGAATCTA AAACCAATAT TGATAATAAG
TTTTTGTTAA ATGATAAAAG AATAAATTCA TACTCTAAAA ATATAGGAAA ATTATCCAAT
GATATAAATA AAAATATAAA AAATAATGAG AAAGTATGGA TATTATCAGC ACAGCCATTG
AGGACTAGGA CTTTACTTTT TGAGCACGAA TGTAATACAA ATTTCTTAAA CAATCCTAAT
GATATTGATG AAGCATTTAA GTCAATTAAT AATTCAACTC CTTTAATTTT AAAAAATAAG
AACAATTATG AAATCGAGGG TTTTTATCTT CCAATTTGGA AAGTTGTCCT AATAACAGAT
AAAGAATTAT TTTCACAACA ATCTCTTTTT CATAATGTAT TCATAAGAAG AAAAAAAAGA
AGTGTAAATT CAAATATAAA CGTAAACAAG ATTAGTCCCG GTGATTTTAT AGTTCATAAA
AATCATGGAA TAGGAAAATT TTTAAAAATA GAAAAAATAA ATATAACTGG AGATTCAAGA
GATTATTTAG TCATTCAGTA TCAGGATGGG AAGATAAGTG TTGCCGCTGA TCAACTTGGT
AGTGTTAACA GATATAGATC AAGTGGAAAA ATAAAGCCAA AAATAAATAA ATTAGGAGGG
ACGGAATGGG AAAGAATAAA AGACAAAAAT AAGAAACAAA TCAAAAAAGT TGCTGTCGAT
ATTTTAAAAC TTTATGCAAA GAGAGAGAAA TTAAAGGGTT ACATTTACCC AGAAGATGGT
CCTTGGCAAG ATGAATTAGA GGAATCATTC CCTTATCAAC CAACACCTGA TCAAATTACT
GCTGTAGAAG AAATAAAATC TGATATGGAA AGCGAAAAGC CAATGGACAG GCTAGTTTGT
GGAGATGTAG GATTTGGCAA AACAGAAGTC GCTGTTCGGG CTATTTTTAA GGCTATTACA
TCAGGCAAAC AGGTAATATT ACTTGCTCCC ACAACAATCC TAGCTCAGCA ACATTGGAGA
ACAATAAGCA ATAGATTTTC ACCTTACCCA ATAAAAGTAT CATTACTCAA TAGATTCAAA
ACCGTTAATG AAAGAAAGGA AATCTATGCT GGTTTGAAAA ATAACAAAAT TGATTTAGTT
GTAGCAACGC ACCAAATTTT AGGAAAGGAA ATAGAGATAA AAAACTTAGG ACTACTTGTA
ATTGATGAAG AACAAAGATT TGGAGTAAGG CAAAAGGAGA AAATTAAAAA AATCAAAACA
AGCATAGACG TATTAACTCT ATCGGCAACT CCAATTCCAA GAACTCTTTA TATGAGTTTA
TCTGGACTAA GACAAATGAG CTTACTAAAT ACTCCTCCTC CATCAAGAAG ATCAATAAAA
ACCTATTTAG CTGAAATAGA TATGGATGTT ATAAGAACTG CCATTAATCA AGAACTTGAT
AGGGGAGGTC AAATTTTTTA TGTTCTTCCA AGAATTTCTG ATATTAATCA AGCTTTAAAT
AAATTAAAAA ATATTTTTCC AAGCTTAAAA TTTATTGTTG CTCATGGGCA AATGAACGAA
ACAGAGCTTG AAAATGCAAT GATTGCTTTT AATAATGGAG AAGTAGATCT TATGATATGC
ACAACAATAA TTGAAAGTGG ATTAGACATA CCTAAAGTAA ATACAATCAT TATTGAAGAT
TCTCACAAAT TTGGCCTTTC ACAACTTTAT CAACTTAGAG GAAGAGTTGG TAGAAGCAGT
GTACAAGCAC ATGCTTGGTT GTTTTATCCA GATATAAATA AAATTAATGA CGCTGCAAAA
CAAAGATTGA AAGCTATAAA AGATTTTTCA GAACTAGGAA GTGGTTACCA ACTTGCAATG
AAAGATATGG AAATAAGAGG TGTTGGTAGT TTATTAGGAG AAGAACAAAG TGGAAAGGTT
AATGCTATTG GATATGATTT ATATATAGAA ATGCTCCATG AGGCTATTTC AGAAATCAGT
GGGCAAGAGA TACCTGAAGT TAACGATACT CAAATTGATT TACCAATAAA TGCTTTTATA
CCTGCAACAT GGATATTAAA CAGAGAAGAG AAGCTTGAGG CTTACAAATC TGCTACTGAA
TGTTCAAAAA ATGATGAATT AACTGAATTA GCTACAGACT GGGTAAATAG ATATGGAAAC
TTACCCAAAC CTGTTGAGTC CTTAATTATG ATAATGAGAC TAAAATTACT AGCTAAAAAA
TGTGGTTTTA GTAAGATCAA GCTCAAAAAG CCAAACATCT TGATAGAGAC AAAATTAAAA
AATTCTACTT TTAAAATTCT TAAAAATTCT TTGGCAAGTA GTGTTCAAAA TAAATTTAAT
TTTAATGAAG GCGAACAATT ATCAATCATC ACTATAAGGG GTTTAGGTGC AACTGAAATT
CAAAATCAAA TTGATCAACT TATGTTGTGG TTCGAATCTT TTGAAAGAGA AATAAAGAAT
TTCGATAAAG AACTTCTTAT GAAAAAAGAA TAA
 
Protein sequence
MSLNTLVDYI SNSQITSELV KRISKNNELN IVGSSRYAKS IILDSIAKKE KKNILLICPN 
VEIAYKWIGY FESINDKAVL YYPPKEHLPY SSINKSKEIE FSQLTVLSKL IKKEKNELNI
VISTERSLQP HLINKNLLIE NKLNLQKGVQ IEIQELANKL SLLGYTKDNV TSTEGFWSRR
GEIIDIYPVN NEFPIRLEFF DNVIEKIREY DPHTQKTLES INNIEIIQAG FNLLIKDKLN
NFSKNGIFNS EDINKNNLDR YLGIIEKTPS NIIDFIDRET ILVIDELEDC TKFANNWYQD
SESNFDNCEY ELNENLKNND INLQAKPNLH LKFDEILNSL GNFNLIKFYE FESKTNIDNK
FLLNDKRINS YSKNIGKLSN DINKNIKNNE KVWILSAQPL RTRTLLFEHE CNTNFLNNPN
DIDEAFKSIN NSTPLILKNK NNYEIEGFYL PIWKVVLITD KELFSQQSLF HNVFIRRKKR
SVNSNINVNK ISPGDFIVHK NHGIGKFLKI EKINITGDSR DYLVIQYQDG KISVAADQLG
SVNRYRSSGK IKPKINKLGG TEWERIKDKN KKQIKKVAVD ILKLYAKREK LKGYIYPEDG
PWQDELEESF PYQPTPDQIT AVEEIKSDME SEKPMDRLVC GDVGFGKTEV AVRAIFKAIT
SGKQVILLAP TTILAQQHWR TISNRFSPYP IKVSLLNRFK TVNERKEIYA GLKNNKIDLV
VATHQILGKE IEIKNLGLLV IDEEQRFGVR QKEKIKKIKT SIDVLTLSAT PIPRTLYMSL
SGLRQMSLLN TPPPSRRSIK TYLAEIDMDV IRTAINQELD RGGQIFYVLP RISDINQALN
KLKNIFPSLK FIVAHGQMNE TELENAMIAF NNGEVDLMIC TTIIESGLDI PKVNTIIIED
SHKFGLSQLY QLRGRVGRSS VQAHAWLFYP DINKINDAAK QRLKAIKDFS ELGSGYQLAM
KDMEIRGVGS LLGEEQSGKV NAIGYDLYIE MLHEAISEIS GQEIPEVNDT QIDLPINAFI
PATWILNREE KLEAYKSATE CSKNDELTEL ATDWVNRYGN LPKPVESLIM IMRLKLLAKK
CGFSKIKLKK PNILIETKLK NSTFKILKNS LASSVQNKFN FNEGEQLSII TIRGLGATEI
QNQIDQLMLW FESFEREIKN FDKELLMKKE