Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_00371 |
Symbol | |
ID | 4719868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | - |
Start bp | 36830 |
End bp | 39763 |
Gene Length | 2934 bp |
Protein Length | 977 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640079698 |
Product | hypothetical protein |
Protein accession | YP_001010353 |
Protein GI | 123965272 |
COG category | [R] General function prediction only |
COG ID | [COG4889] Predicted helicase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCAACCT TTGATGAATT TTATGCTTCC TTAGATCCTG ATATTGGGAT TAGAGGAAAG CAATTTGAAA AATTCGTTAA ATGGTTTCTT AAGACTGATC CGACATGGGC AAGTCAAATA GATGAAGTGT GGTTATGGAA TGATTATCCA AAGAGATGGG GTGCTGATTG CGGAATAGAT TTAATCTTTA CTCAAAAGAA TGGGAAGACC TGGGCAGTTC AATCAAAATG TGTTTCACCA AATAATGATA TTAAGAAATC TGAAATAGAT AGTTTTTTAA GTGAATCGAG TGACTCGAAA ATTGATGGAA GATTATTAAT CGCAAGTACT GATGGAATTG GAAAGAATGC TCAACAGGTA ATTAATCGTC AAGAAAAGCA AGTTGTCTGT TTTTTATTAG AACAGTTCAG ACAATCTGAA ATTGAATTTC CTTCATCAAT GGAAGACCTT AATCAAGGTA AAAGAAAAAA GAAAAAGACT CCTAGACCAC ATCAAATCGA AGCAATTGAA AAAGTTTCTG AAAGATTAAA AACTGCTGAT CGTGGTCAAG TTCTAATGGC ATGTGGAACT GGTAAAACTC TTACATCTTT ATGGATTAAA GAAAAAATGA AAGCGAAACA AGTTTTAGTT TTAGTTCCAT CTTTAAGTCT TCTTTCTCAA ACTCTGAAAG AATGGAATTC TGAAGCAAAT CATGACTTTA AGTGGATTTG TGTTTGCTCT GATAAATCAG TTGCAAAAGA TAAAAAGGAA GACGAATGGA TTTCAAATAC GTCAGAAATT GGGGTTCCAG TTACAAATGA TCCATTAGAA ATTAAACTTT TTTTAGATGA AAGTAGTCCT AAAGTTGTTT TCTCAACTTA TCAATCTGCA CAATTGATAG TAGAAGCACA AGAGCATCAT GACACCGATG ATTTTGATCT CGTAATTGCA GATGAAGCAC ATCGTTGTGC AGGAAAAGTA TCTGATTCTT TTGGTTCAGT ATTAGATGAA AGAAAAATAA AAGCATCAAA AAGATTATTC TTTACTGCTA CTCCAAGAAT TCTCTCAAAA CAAATTAAAA CACAAGCAAA AATTAATGAG ATTTCAGTTA TTTCAATGGA TGATAAATCT TTATTTGGAG ATATTTTTTA TCAACTTAAT TTCTCAAAAG CAATTGAAAA GAAACTTTTA TCTGATTATC AAGTTGTAGT GGTTGGAGTT GATGATCCTA TGGTTCATGA AAAAATAAAC AATCGAGACT TGGTTACAAT ACCTGATGAA TTAAATACTG ACGCAGAGAC TCTTGCAAGT CATATTGCAC TTGCAAAAGC AACAAAAGAT CATAAGTTAA ATAGATTAAT TACTTTTCAT AGTCGAATCA ATAGTGCTAG AGAATTTGCA CATCAGCATA ATAAAATCAT CAATTGGTTA TCACAATTTG AAAATTTTCA AAGTAATAAT TTTATTACGA ATATTTCGGG AGATATGCCC GCAAAAGAGA GAAATAAACG TATTAACAAA TTAAAAAATA TAGATGGAAA TGAATTAGGA ATTTTATGTA ATGCAAGGTG CTTATCAGAG GGTGTTGATG TTCCTAATCT TGATGGAATT GCTTTTATTG ATCCCAGAAA AAGTCAAATT GATATTATTC AAGCAGTTGG CAGGGCAATT CGGAAAAGTG AAGACAAATC AATTGGAACT ATCGTCATAC CAGTTTATCT TGCAGATATG GATAAACCAG AAGAAAAAAT ACTTGAATCA AAATTTAAGG ATGTTTGGCA AATAATTCTT GCTCTTAAAT GTCAAGATGA TTCTCTTCTC CAAACAATTG ATCTATTACG AGTAAATTTA GGAATCGATC AAAGACAAAC AGGAGGTAAA AGTGGTTTAG AAAAAATTAT TTTTGATCTA CCTAACAAAA TTAGTAAAAA TTTTGCTAAT TCTATTCAAA CTTTATTAAT AAGAAATACT TCCGATGATT GGTTAGAAAA ATTTGGAGAA TATAAGTCTT TTGTAGATAC TAATAATCTC ATGATTGCTA ATAGAGATCC TGCTTTTTTG AATTGGGTCA AAGATCAAAG AAAATTTAAA AATAAGGGTT TTTTATCCAA AGATCGAATC AATCTTTTAG ACAGTATAAA TTTTAATTGG AAACCAGATG AAGAGAACTG GGAGAATAAG TTAAAGCAAT TAAAAGAGTT TAAATTAAAA CATGGGCATG TTATTCCACC CCATAGGAGT GAAGTTGGTA GATGGTTACA TGGTCAAAAA AAATTATATA AGAACGGAAA ATTGCCAAAA AAATATATAA ATCTCTTAAA TAATTTGAAT ATTAATTGGG ATATCAAAAT TTCTGAAGGA TGGGATTCAA ATTTCGAAAA ATTAAAACTA TTCAAGTTAG AGCATGGTCA CTCTAATCCA CCAAAAGAAC ATTCACTATA TCTTTGGACA ATGTCAGAAA GAAGCAGAAG TAAAGGTAAA AATTATCCAA AAAAACGTTT GGAATTACTT AGAGATATTG GTTTTGTTTT TGACCTTAGG AAAGACTATT TCAATCAAAA GATTAAGGAT TTAAAAGAAT TTAAACTTAA ATATGGTCAT GCAAACCCTC CTCAGAGTGA CGAGGCACTT GGAAAGTGGG TAAATCGTTT GAGGAACGAT TATAAAAAAA ATAAATTATT GCAATCTGAA ATTAATCTTT TAGAGGAATT AGGATTTGTT TTTGATACCC AAAAAGAATT TTTAAATCGC AAAATTAAAG ATTTAAGAGA ATTTCAGTCA AGTAATGGTA ATACATTTCC GCCAAAATAT AGTCCTCTAG GGAAGTGGGT TACAAGACGA AGATTAGATT ATAAAAATGG GAAATTATCT AAAGAGATAA AAAATCTATT AGAGAGTATT AGAGGATGGG TTTGGGAAGA AAAAAAATCT CACGAATTCC CTAAAAGAAA ATAA
|
Protein sequence | MATFDEFYAS LDPDIGIRGK QFEKFVKWFL KTDPTWASQI DEVWLWNDYP KRWGADCGID LIFTQKNGKT WAVQSKCVSP NNDIKKSEID SFLSESSDSK IDGRLLIAST DGIGKNAQQV INRQEKQVVC FLLEQFRQSE IEFPSSMEDL NQGKRKKKKT PRPHQIEAIE KVSERLKTAD RGQVLMACGT GKTLTSLWIK EKMKAKQVLV LVPSLSLLSQ TLKEWNSEAN HDFKWICVCS DKSVAKDKKE DEWISNTSEI GVPVTNDPLE IKLFLDESSP KVVFSTYQSA QLIVEAQEHH DTDDFDLVIA DEAHRCAGKV SDSFGSVLDE RKIKASKRLF FTATPRILSK QIKTQAKINE ISVISMDDKS LFGDIFYQLN FSKAIEKKLL SDYQVVVVGV DDPMVHEKIN NRDLVTIPDE LNTDAETLAS HIALAKATKD HKLNRLITFH SRINSAREFA HQHNKIINWL SQFENFQSNN FITNISGDMP AKERNKRINK LKNIDGNELG ILCNARCLSE GVDVPNLDGI AFIDPRKSQI DIIQAVGRAI RKSEDKSIGT IVIPVYLADM DKPEEKILES KFKDVWQIIL ALKCQDDSLL QTIDLLRVNL GIDQRQTGGK SGLEKIIFDL PNKISKNFAN SIQTLLIRNT SDDWLEKFGE YKSFVDTNNL MIANRDPAFL NWVKDQRKFK NKGFLSKDRI NLLDSINFNW KPDEENWENK LKQLKEFKLK HGHVIPPHRS EVGRWLHGQK KLYKNGKLPK KYINLLNNLN INWDIKISEG WDSNFEKLKL FKLEHGHSNP PKEHSLYLWT MSERSRSKGK NYPKKRLELL RDIGFVFDLR KDYFNQKIKD LKEFKLKYGH ANPPQSDEAL GKWVNRLRND YKKNKLLQSE INLLEELGFV FDTQKEFLNR KIKDLREFQS SNGNTFPPKY SPLGKWVTRR RLDYKNGKLS KEIKNLLESI RGWVWEEKKS HEFPKRK
|
| |