Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_14940 |
Symbol | PAFC3501 |
ID | 5001313 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | + |
Start bp | 216029 |
End bp | 219379 |
Gene Length | 3351 bp |
Protein Length | 1059 aa |
Translation table | |
GC content | 59% |
IMG OID | 640416734 |
Product | predicted protein |
Protein accession | XP_001417181 |
Protein GI | 145345359 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0306125 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCCGC GAAACGCGGT CGCCGTCGCG CTGCTGAACG ACCCGAACGA ATGCGTCGTC GTCGACCTCG ACGCGCTGCC GACGGACGTC GAGGACGTCG CGAGCGTCCT CCAGGCGGAG CTCGCGCCGC TCGAGGCGTG GATCGAGGTG ACGGAGGCGT ACCTCAGGCG AGGCGACGCG CGAGGGTTCG AGACGCTGAT GGAGATGGTG TGCGCGCCGG GTGCGTGGAC GCGCGCGCCG CGACCGACGG GACGCGACGC GCGCGCGAAC GAACGAACGA ACGAACGAAA GCGCGCGAAC GTTTGCGCGA TCGAGCGATG ATGGATCGAC GGGCTCAACG GCGTCGACCG AACGCGGTGA CTGACGAGAC CGCGCGCGCG GACGACGATA GAGATCGAGG AGGTGTATAG GGAACAAGCG TACGGACGCG CGGCGGTGCT GTGCCTGTAC GCGTCGTATT GGGCGAATAG GGCGGCGCGG GAGACGGACG CGGTGAGCCG AGAGGAGGGG TTCATCAAGG CGGGCGCGTA CTTGAATCAA GCGGCGGGAA TTCACAGGAA ACCGGAACAG ATCGTGGCGA TCGGACGCGC GCACCTGGCG TTCGCGCGCG GGGCGACGGG AGAGGGCGAG AAACTCATCG ACCAGGCGCT CGGGCTCAAG GACGACGGTA GGGATAACAT CACGCCGATG CTGTGGAAGG CGGTTTTATT GTATAAGAAG GAGCGCTACC AAGACGCGTT GACGTGGTAC AAACGCGCAC TTCGCGCGTT TCCTTCCGCG CCCGCACCGG TGCGTTTGGG GATCGGGGCG TGCCAATACA AGCTGGGCGA CTTCAAGACG GCGAAGCTTG CGTTTGCGCG CGTGCTCAAG TTGGATGAGC GCAACGTCGA AGCCATGCTC GGTTTGGCGC TGTGCGAATT GAGCTTGCAT GACATTCGTA GTCAGCAACA CTTGGATTCC GTAGCTGCGG CCATGCGCCT GCTCGAGCGC GCTTTCATGC ACGACCCTCA CAACCAAGCG GTGAACAACG TCATCTCGGA CAATTTGCTC ATGGCGGATG ATTACGAAAA AGTAGAGAAA CTCACGCGAC TGGCGCTTCA GAACAACGCG GAGACGCCGA GGAACCGAGC GAAGGCGGCG TTTAACCAAG CGCGCGCCTT GCACGCCCGA GGACAGGTGC CTCAAGCGCA AGCGTTGTAC TTGACGGCGA CGAACCTCGA CGAACACTAC GTGCCGCCGT ACTTTGGTTT GGGTCAAATC GCCCTTGCAA AGGGTGATGT GAAGTTAGCG TGGAACTACA TGGATAAAGC ACACGGGGAA TTCGGAGAGT CCATGACCGT GACTAGAATG TTTGCCCATT TATGCGCGTC GACTGGTAGA AGCGAGCAGG CGGCGGAGAT GTTCCGCGAA GTAGTGAAAC AGGGTGGAAA CGACGTCGAC GCGATGTTAG AACTCGGAGA ACTCTTAGAG ACTCAAGATC CGAAGGCTGC GTTGAAGGCT TACAGTGCCG CCCTGAAAAT GCTAGCTGCC AAGGGTGAAG AAGGGCCAAT TACCGCCATC AAGAACAACA TCGGCGTGTT GAACGTTCAG TTGGGCAAAT TCGACGAGGC GCGCGAGGCG TTCACCGAGG CGCTGCAGGC GCTCGGTGGC GATGCCGATC AACTCGAGGG CAAACTGAAA GGTGCTAAGG CGAAGAAGGC GTTACAGCCA GGCGTTGCGC CCATCGCGTT CAACTTGGCT TTATTAGAGG AACAGCAAGG AAACAACGCC GCGGCGGAGG CACGATACGA CGCCATCCTC GCCGCGCAAC CGGATTACAT TGATTCCATC TTGCGTCAAG CGAAAATTCG GGCCGAACGG GGCGATTACG ACATGGCGCT CGAACGCACG AATGAAGCCA TTGCGGCGAA AAGTGATAGT GCCGATGCCC TCGCCCTCGC TGGTTGGGTT TTGCTCAAGG CAAAGCGATG GAGTGAGGCG GAGCAACAAT TCGCCGCGCT GCGCAACTTG CCCAAACCCG ACGCCGCGGC GAATGCGAAG GAAAAAACAT TGACGCACGA CGAGTATGCG ATGGTAAGCG CGGCGAACGC GGCGTATTAC AGCGCTATCA AAGAGGGTGT GCTCAAGCGA AACGATCCCA AGGTGCTCAA GCGCGAGGAG GAGCACTACG AACGGGCTTA CTCGTTGTTC CAAAAGACGC TCCAGAAGAA TGGTTCCAAC GTCTACGCCG CTAATGGTCT CGGGATTATT CTTGCTGAGC GAGGGCGAAT CGACGAAGCC AAGACGGTGT TTCAGATCGT GCAAGAAGGC ATGGCTGCGA AAGGCTCAAT CAACCCGGAT ATTCTCATCA ACCAAGGTCA CGTGTACTTG GCCAAGGCGC AATATGTGCA GGCGAGCAAG CTTTACGAGC GCGCTCAAAG CCAGTTCTAC TTTAACCAGA ACGAAAATGT CATGCTTTAT CAAGCGAGGG CGCATTATGA GAATGGCAAC TTGGAAGAAG CGCGCAAGAT TCTTCGCAAA GCGTTGTTAA TCGCGCCCTG GAACCACAGA ATTCGATTCA ACTTGGCGTA CGTCATTCAA GAGATGGCGC AGCGCACGTT GAACAGAACG ATGAAGAGCA CGAGCTCGGA CGGACGTTTG GCGCAAGTGG AAAGTGCGAT AGAAGACTTG ACCACTGCGC TCAAGCTCTT CGAGCAATTG CAAACGCTTG GGAACCAGGC GGAGTTTGGT TTCGACGCCA AACGCACGAG CGTTCACGTG TCCTTCTGCA AGCAAGCGCT CACCAAGTCC AAACCGCACT TGGAAGCGGC GCAGAAGGAA GAGGCTTCGA TCTCTGCAGC GAAAAACGCG CAACTCACCG CGCGCCGCGC GATCGAAGAA GGCCGCGCCG CCCAAAAAGC TGCCGAAGAA CTCGCCAAGG AGACGCACGC CAAGGAACTC GAAGCCATCG CTGCGCAGTC TGAGCGTCGC TTCAAAGAAA GCCAAGCGCG ATGGATGTCA GAACAAGCTG TCGAGCGACC GACGAAGAAA GGTGCCAAGG GTCTCGGTGC GGCTCCTGTC GGTGAGGCCA CGAGCGATCT TTCCGAAGAC GATGACGAGC CCGCACCCGA GACGCGCGCG CCGCCCACTG CCGAGGAACT CGCGCGTCAG AAAGAAGCCC TCGCGGCGGC CGGTTTAGCC GACAGCGACG ACGAAGACGA AGACGAAGAC GAAGACGCGC AACCGAGCGC CGACGTCGAA GCACCGGCAG AAAAGAAGCG TTCCGCTGAC GAAACTGACG AAGCCCAAGC TGAAGCCGCC GCTCCCAAGC GTCGCCGACG CGCCGTCGTC GACGACGACG ACGACGAGTG A
|
Protein sequence | MSPRNAVAVA LLNDPNECVV VDLDALPTDV EDVASVLQAE LAPLEAWIEV TEAYLRRGDA RGFETLMEMV CAPEIEEVYR EQAYGRAAVL CLYASYWANR AARETDAVSR EEGFIKAGAY LNQAAGIHRK PEQIVAIGRA HLAFARGATG EGEKLIDQAL GLKDDGRDNI TPMLWKAVLL YKKERYQDAL TWYKRALRAF PSAPAPVRLG IGACQYKLGD FKTAKLAFAR VLKLDERNVE AMLGLALCEL SLHDIRSQQH LDSVAAAMRL LERAFMHDPH NQAVNNVISD NLLMADDYEK VEKLTRLALQ NNAETPRNRA KAAFNQARAL HARGQVPQAQ ALYLTATNLD EHYVPPYFGL GQIALAKGDV KLAWNYMDKA HGEFGESMTV TRMFAHLCAS TGRSEQAAEM FREVVKQGGN DVDAMLELGE LLETQDPKAA LKAYSAALKM LAAKGEEGPI TAIKNNIGVL NVQLGKFDEA REAFTEALQA LGGDADQLEG KLKGAKAKKA LQPGVAPIAF NLALLEEQQG NNAAAEARYD AILAAQPDYI DSILRQAKIR AERGDYDMAL ERTNEAIAAK SDSADALALA GWVLLKAKRW SEAEQQFAAL RNLPKPDAAA NAKEKTLTHD EYAMVSAANA AYYSAIKEGV LKRNDPKVLK REEEHYERAY SLFQKTLQKN GSNVYAANGL GIILAERGRI DEAKTVFQIV QEGMAAKGSI NPDILINQGH VYLAKAQYVQ ASKLYERAQS QFYFNQNENV MLYQARAHYE NGNLEEARKI LRKALLIAPW NHRIRFNLAY VIQEMAQRTL NRTMKSTSSD GRLAQVESAI EDLTTALKLF EQLQTLGNQA EFGFDAKRTS VHVSFCKQAL TKSKPHLEAA QKEEASISAA KNAQLTARRA IEEGRAAQKA AEELAKETHA KELEAIAAQS ERRFKESQAR WMSEQAVERP TKKGAKGLGA APVGEATSDL SEDDDEPAPE TRAPPTAEEL ARQKEALAAA GLADSDDEDE DEDEDAQPSA DVEAPAEKKR SADETDEAQA EAAAPKRRRR AVVDDDDDE
|
| |