Gene OSTLU_14940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_14940 
SymbolPAFC3501 
ID5001313 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp216029 
End bp219379 
Gene Length3351 bp 
Protein Length1059 aa 
Translation table 
GC content59% 
IMG OID640416734 
Productpredicted protein 
Protein accessionXP_001417181 
Protein GI145345359 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0306125 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCCGC GAAACGCGGT CGCCGTCGCG CTGCTGAACG ACCCGAACGA ATGCGTCGTC 
GTCGACCTCG ACGCGCTGCC GACGGACGTC GAGGACGTCG CGAGCGTCCT CCAGGCGGAG
CTCGCGCCGC TCGAGGCGTG GATCGAGGTG ACGGAGGCGT ACCTCAGGCG AGGCGACGCG
CGAGGGTTCG AGACGCTGAT GGAGATGGTG TGCGCGCCGG GTGCGTGGAC GCGCGCGCCG
CGACCGACGG GACGCGACGC GCGCGCGAAC GAACGAACGA ACGAACGAAA GCGCGCGAAC
GTTTGCGCGA TCGAGCGATG ATGGATCGAC GGGCTCAACG GCGTCGACCG AACGCGGTGA
CTGACGAGAC CGCGCGCGCG GACGACGATA GAGATCGAGG AGGTGTATAG GGAACAAGCG
TACGGACGCG CGGCGGTGCT GTGCCTGTAC GCGTCGTATT GGGCGAATAG GGCGGCGCGG
GAGACGGACG CGGTGAGCCG AGAGGAGGGG TTCATCAAGG CGGGCGCGTA CTTGAATCAA
GCGGCGGGAA TTCACAGGAA ACCGGAACAG ATCGTGGCGA TCGGACGCGC GCACCTGGCG
TTCGCGCGCG GGGCGACGGG AGAGGGCGAG AAACTCATCG ACCAGGCGCT CGGGCTCAAG
GACGACGGTA GGGATAACAT CACGCCGATG CTGTGGAAGG CGGTTTTATT GTATAAGAAG
GAGCGCTACC AAGACGCGTT GACGTGGTAC AAACGCGCAC TTCGCGCGTT TCCTTCCGCG
CCCGCACCGG TGCGTTTGGG GATCGGGGCG TGCCAATACA AGCTGGGCGA CTTCAAGACG
GCGAAGCTTG CGTTTGCGCG CGTGCTCAAG TTGGATGAGC GCAACGTCGA AGCCATGCTC
GGTTTGGCGC TGTGCGAATT GAGCTTGCAT GACATTCGTA GTCAGCAACA CTTGGATTCC
GTAGCTGCGG CCATGCGCCT GCTCGAGCGC GCTTTCATGC ACGACCCTCA CAACCAAGCG
GTGAACAACG TCATCTCGGA CAATTTGCTC ATGGCGGATG ATTACGAAAA AGTAGAGAAA
CTCACGCGAC TGGCGCTTCA GAACAACGCG GAGACGCCGA GGAACCGAGC GAAGGCGGCG
TTTAACCAAG CGCGCGCCTT GCACGCCCGA GGACAGGTGC CTCAAGCGCA AGCGTTGTAC
TTGACGGCGA CGAACCTCGA CGAACACTAC GTGCCGCCGT ACTTTGGTTT GGGTCAAATC
GCCCTTGCAA AGGGTGATGT GAAGTTAGCG TGGAACTACA TGGATAAAGC ACACGGGGAA
TTCGGAGAGT CCATGACCGT GACTAGAATG TTTGCCCATT TATGCGCGTC GACTGGTAGA
AGCGAGCAGG CGGCGGAGAT GTTCCGCGAA GTAGTGAAAC AGGGTGGAAA CGACGTCGAC
GCGATGTTAG AACTCGGAGA ACTCTTAGAG ACTCAAGATC CGAAGGCTGC GTTGAAGGCT
TACAGTGCCG CCCTGAAAAT GCTAGCTGCC AAGGGTGAAG AAGGGCCAAT TACCGCCATC
AAGAACAACA TCGGCGTGTT GAACGTTCAG TTGGGCAAAT TCGACGAGGC GCGCGAGGCG
TTCACCGAGG CGCTGCAGGC GCTCGGTGGC GATGCCGATC AACTCGAGGG CAAACTGAAA
GGTGCTAAGG CGAAGAAGGC GTTACAGCCA GGCGTTGCGC CCATCGCGTT CAACTTGGCT
TTATTAGAGG AACAGCAAGG AAACAACGCC GCGGCGGAGG CACGATACGA CGCCATCCTC
GCCGCGCAAC CGGATTACAT TGATTCCATC TTGCGTCAAG CGAAAATTCG GGCCGAACGG
GGCGATTACG ACATGGCGCT CGAACGCACG AATGAAGCCA TTGCGGCGAA AAGTGATAGT
GCCGATGCCC TCGCCCTCGC TGGTTGGGTT TTGCTCAAGG CAAAGCGATG GAGTGAGGCG
GAGCAACAAT TCGCCGCGCT GCGCAACTTG CCCAAACCCG ACGCCGCGGC GAATGCGAAG
GAAAAAACAT TGACGCACGA CGAGTATGCG ATGGTAAGCG CGGCGAACGC GGCGTATTAC
AGCGCTATCA AAGAGGGTGT GCTCAAGCGA AACGATCCCA AGGTGCTCAA GCGCGAGGAG
GAGCACTACG AACGGGCTTA CTCGTTGTTC CAAAAGACGC TCCAGAAGAA TGGTTCCAAC
GTCTACGCCG CTAATGGTCT CGGGATTATT CTTGCTGAGC GAGGGCGAAT CGACGAAGCC
AAGACGGTGT TTCAGATCGT GCAAGAAGGC ATGGCTGCGA AAGGCTCAAT CAACCCGGAT
ATTCTCATCA ACCAAGGTCA CGTGTACTTG GCCAAGGCGC AATATGTGCA GGCGAGCAAG
CTTTACGAGC GCGCTCAAAG CCAGTTCTAC TTTAACCAGA ACGAAAATGT CATGCTTTAT
CAAGCGAGGG CGCATTATGA GAATGGCAAC TTGGAAGAAG CGCGCAAGAT TCTTCGCAAA
GCGTTGTTAA TCGCGCCCTG GAACCACAGA ATTCGATTCA ACTTGGCGTA CGTCATTCAA
GAGATGGCGC AGCGCACGTT GAACAGAACG ATGAAGAGCA CGAGCTCGGA CGGACGTTTG
GCGCAAGTGG AAAGTGCGAT AGAAGACTTG ACCACTGCGC TCAAGCTCTT CGAGCAATTG
CAAACGCTTG GGAACCAGGC GGAGTTTGGT TTCGACGCCA AACGCACGAG CGTTCACGTG
TCCTTCTGCA AGCAAGCGCT CACCAAGTCC AAACCGCACT TGGAAGCGGC GCAGAAGGAA
GAGGCTTCGA TCTCTGCAGC GAAAAACGCG CAACTCACCG CGCGCCGCGC GATCGAAGAA
GGCCGCGCCG CCCAAAAAGC TGCCGAAGAA CTCGCCAAGG AGACGCACGC CAAGGAACTC
GAAGCCATCG CTGCGCAGTC TGAGCGTCGC TTCAAAGAAA GCCAAGCGCG ATGGATGTCA
GAACAAGCTG TCGAGCGACC GACGAAGAAA GGTGCCAAGG GTCTCGGTGC GGCTCCTGTC
GGTGAGGCCA CGAGCGATCT TTCCGAAGAC GATGACGAGC CCGCACCCGA GACGCGCGCG
CCGCCCACTG CCGAGGAACT CGCGCGTCAG AAAGAAGCCC TCGCGGCGGC CGGTTTAGCC
GACAGCGACG ACGAAGACGA AGACGAAGAC GAAGACGCGC AACCGAGCGC CGACGTCGAA
GCACCGGCAG AAAAGAAGCG TTCCGCTGAC GAAACTGACG AAGCCCAAGC TGAAGCCGCC
GCTCCCAAGC GTCGCCGACG CGCCGTCGTC GACGACGACG ACGACGAGTG A
 
Protein sequence
MSPRNAVAVA LLNDPNECVV VDLDALPTDV EDVASVLQAE LAPLEAWIEV TEAYLRRGDA 
RGFETLMEMV CAPEIEEVYR EQAYGRAAVL CLYASYWANR AARETDAVSR EEGFIKAGAY
LNQAAGIHRK PEQIVAIGRA HLAFARGATG EGEKLIDQAL GLKDDGRDNI TPMLWKAVLL
YKKERYQDAL TWYKRALRAF PSAPAPVRLG IGACQYKLGD FKTAKLAFAR VLKLDERNVE
AMLGLALCEL SLHDIRSQQH LDSVAAAMRL LERAFMHDPH NQAVNNVISD NLLMADDYEK
VEKLTRLALQ NNAETPRNRA KAAFNQARAL HARGQVPQAQ ALYLTATNLD EHYVPPYFGL
GQIALAKGDV KLAWNYMDKA HGEFGESMTV TRMFAHLCAS TGRSEQAAEM FREVVKQGGN
DVDAMLELGE LLETQDPKAA LKAYSAALKM LAAKGEEGPI TAIKNNIGVL NVQLGKFDEA
REAFTEALQA LGGDADQLEG KLKGAKAKKA LQPGVAPIAF NLALLEEQQG NNAAAEARYD
AILAAQPDYI DSILRQAKIR AERGDYDMAL ERTNEAIAAK SDSADALALA GWVLLKAKRW
SEAEQQFAAL RNLPKPDAAA NAKEKTLTHD EYAMVSAANA AYYSAIKEGV LKRNDPKVLK
REEEHYERAY SLFQKTLQKN GSNVYAANGL GIILAERGRI DEAKTVFQIV QEGMAAKGSI
NPDILINQGH VYLAKAQYVQ ASKLYERAQS QFYFNQNENV MLYQARAHYE NGNLEEARKI
LRKALLIAPW NHRIRFNLAY VIQEMAQRTL NRTMKSTSSD GRLAQVESAI EDLTTALKLF
EQLQTLGNQA EFGFDAKRTS VHVSFCKQAL TKSKPHLEAA QKEEASISAA KNAQLTARRA
IEEGRAAQKA AEELAKETHA KELEAIAAQS ERRFKESQAR WMSEQAVERP TKKGAKGLGA
APVGEATSDL SEDDDEPAPE TRAPPTAEEL ARQKEALAAA GLADSDDEDE DEDEDAQPSA
DVEAPAEKKR SADETDEAQA EAAAPKRRRR AVVDDDDDE