Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_24101 |
Symbol | |
ID | 4777689 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2120842 |
End bp | 2123019 |
Gene Length | 2178 bp |
Protein Length | 725 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640087931 |
Product | hypothetical protein |
Protein accession | YP_001018408 |
Protein GI | 124024101 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATCAAG AGGAGATCAT GCAGCAGCTG CAGGTTGCAG TGCAGGCATA TCAGAGCAAA GATCTTGATG GCGCTGAGGC AGTTTTTAAG CAGATTCTTG CTGTAAACCC AAAAGAGCCC AACGCACTGC ATCTTCTTGG TTGTATTTAC AAAGATCGTG GACAACACCA GCAGGCTGTT GAGTTAATTC AGGCGTCTAT TCGAGAAGAT GAGAGTAATC CAATTCCATT CTTTAATCTT GGCAAGATCC TTGCTATCGC TGGTCAGCAT GAGAATGCAG TGGGCGTCTT CCAGGAGGCA TTGAAGAGAA ACCAGCAGAT CCCTGAAACG TGGTTTTGCT TTGCCAATGC TCTGAGGGAG ATTGGGAAAA CAGAGGAAGC AAAGCAGGCA TATCGAAATG CACTGCAATT AAATCCTGCA CATGCTGGAG CAGCAGGAAA TTTAGGAGCA CTGCTCACTG ATGATGGTGA GTTGGATGAG GCTGAGCAAT TATTTGTAAA GGCTGTAGAC CAGTATCCTA ATAATGTGAA TCTCAGAATT AATTATGGCA GGTTGTTGGC TGAGAAAGCG GAGCATGCTG CGGCAATCAT GCAGTATCAG ATTGCTTTGC CTTTGGCTCC TCAGTCTCCT GAGTTGCACT ACAACTTTGC AAATGCATTG AAGGAAGAGG GGGATGTTGA GGAAGCGATC GCAAGTTATC GGAAGGCGAT TGAGGTGAAG CCCGATTTTG CAGATGCTTA TTTTGCTCTT GGGTTGGTGA TGAAGGAAGA GGGGGATGTT GAGGAAGCGA TCGCAAGTTA TCGGAAGGCG ATTGAGGTGA AGCCCGATTT TGCAGATGCT TATTTTGCTC TTGGGTTGGT GATGAAGGAA GAGGGGGATG TTGAGGAAGC GATCGCAAGT TATCGGAAGG CGATTGAGGT GAAGCCCGAT TTTGCAGATG CTTATTTTGC TCTTGGGTTG GTGATGAAGG AAGAGGGGGA TGTTGAGGAA GCGATCGCAA GTTATCGGAA GGCGATTGAG GTGAAGCCCG ATTTTGCAGA TGCTTATTTT GCTCTTGGGT TGGTGATGAA GGAAGAGGGG GATGTTGAGG AAGCGATCGC AAGTTATCGG AAAGCGATTG AAGTGAAGCC CGATTTTGCA GATGCGTATT TGAATCTGGG TAATGTGTTG AAGGAAGAGG GGGAGATAGA TGAGGCTAGG CAAATCATTA CTACTCTTCG ACAGATGAAG TCTTTCGAGA AGGAGACTTG GACCAGGATT CAAGATAAGA CTTTGGTTTT TGATTGGCAT CACAGAAGAG CTTTGGGGCT TCTCTGGCAG GTCGAGCTTG CTGCCTTTTC TGGAATGGAG CCCGCCTCTT GTCTTCCTGC TGTAAAGCAA GCCGATGCTC TTTGCTTTCC TCCTTCATTC TTGGGAGATG AGATTTCTTG TGAAGATGGG AAGCTGTTGT ATGAAAAGGG ATATTTGGTT GAAGAAGGTC TGGTTTCGCC AGGTTTATGT GCCCAACTGA TCAGTCAATT CAATAATGGG AAAGCGCCGA TGAGTGCTGC GCTAATCGAA TCAGTTGAGG TTAATGGTAT TTTGCGATTT GCTTTGGAAA GAATATTTCT GCATACGGGT TTCCCGCATT TGATTTGGAA TTGCATTTAC TTTGCCAAAG GACCCGACGA TGAAACGGTA TCTGATACTT GGCATTACGA TAATCATTAC AATGCTTGGA CTCCAAAGTT GATGATATAC CTTAACTCTC AGGGTGAAGA GTGTGGTGCC ACTGAATTTG TTGATGCAGG TCTCTCGCGG AAAATCTCTG AGAAATCAGA CTATATGGGT CTAGTTTGGC AGCGTAAGTC TTATCCGGAT ATGGTGAAGG ACTTAGTTGA AGATCTAAAT CTCGATCCTG TCTCATTAGA TCCTGAGCAT TACACCTTCT CGCCAGATCA TGTTGGATCG GGTGTGTGGT TTTGTCCTGC CAGGGCTCTG CATCGAGGTG TTAGTCCTAA GAAAGGTCTG AGACATGTCC TTTCTTTTTC ATTGACACCT TTGCCTAAGG ACTGTGGCTG GAGTGTTGAC CAGTGTGAGG AGAAATCAAT TGAAATTCTG AAGGATAAAA TTGAAAAGGG TATGCAAAAG ACTGATGTCA ATCCTTATTG GATGTTGTCT GAGGTTAGGT CTGCTTAA
|
Protein sequence | MNQEEIMQQL QVAVQAYQSK DLDGAEAVFK QILAVNPKEP NALHLLGCIY KDRGQHQQAV ELIQASIRED ESNPIPFFNL GKILAIAGQH ENAVGVFQEA LKRNQQIPET WFCFANALRE IGKTEEAKQA YRNALQLNPA HAGAAGNLGA LLTDDGELDE AEQLFVKAVD QYPNNVNLRI NYGRLLAEKA EHAAAIMQYQ IALPLAPQSP ELHYNFANAL KEEGDVEEAI ASYRKAIEVK PDFADAYFAL GLVMKEEGDV EEAIASYRKA IEVKPDFADA YFALGLVMKE EGDVEEAIAS YRKAIEVKPD FADAYFALGL VMKEEGDVEE AIASYRKAIE VKPDFADAYF ALGLVMKEEG DVEEAIASYR KAIEVKPDFA DAYLNLGNVL KEEGEIDEAR QIITTLRQMK SFEKETWTRI QDKTLVFDWH HRRALGLLWQ VELAAFSGME PASCLPAVKQ ADALCFPPSF LGDEISCEDG KLLYEKGYLV EEGLVSPGLC AQLISQFNNG KAPMSAALIE SVEVNGILRF ALERIFLHTG FPHLIWNCIY FAKGPDDETV SDTWHYDNHY NAWTPKLMIY LNSQGEECGA TEFVDAGLSR KISEKSDYMG LVWQRKSYPD MVKDLVEDLN LDPVSLDPEH YTFSPDHVGS GVWFCPARAL HRGVSPKKGL RHVLSFSLTP LPKDCGWSVD QCEEKSIEIL KDKIEKGMQK TDVNPYWMLS EVRSA
|
| |