Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_13904 |
Symbol | NFF3501 |
ID | 4999610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | + |
Start bp | 798414 |
End bp | 801191 |
Gene Length | 2778 bp |
Protein Length | 925 aa |
Translation table | |
GC content | 55% |
IMG OID | 640415031 |
Product | predicted protein |
Protein accession | XP_001415603 |
Protein GI | 145340998 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTCGC CGACGCGCGC GTCTAAGTCG CCGACGCGCG CGGTAGCGAC AATCGACTTG ACGAATTCAA ATTTCGGTCC GAACGCGACG ACGACGACGA CGAGTAAGAA GCGCAAGGTG GACGCGGCGA AGGAGGCGGA GAAGCTCGCC AAGGCGCAGG CGGCGCAGGC GGCGAAGGAG GCGAAGGAGG CGGCCAAGGC GGAGAAGGCG CGACGCCGCG AGGAGGAGAA GGCGGCCAAG GAAGCGGCCA AGGCGGAGAA GGAACTAGAA ATGGCGAGGG CGAAGGCGGC GAAGGAAGCG GCCAAGGTGG AGAAGGAACT AGAAATGGCG AGGGCGAAGG CGGAGAAGGA AGCGGCCAAT GCGGAGAAGG AACGCGAAGC GGCGGCGCTG AAGGCAGAGA AGGCGAAATC GAAGGCGGAG AAGGAGGCGG CGGAGAAGGC GAAATTAGAG GAGAAGCGGC GACTCGAGGC GAAAAAGGCC AAGGAAGCGA ACGTTTTCGC GCAATTTTTT GTGAAGTCGC CAGCGGTGAA ATCGAAACCC GTGGTGACGC CGGAAGTGCG AACGCCGGAG GTTTCGCGAG AGGTGCGGGA AAAGTTGGAT GATATCGTGC GCGCGGAGGA TGCCGTCGAG GATATGGACG CGATTCGTGA AACGTCTTTG AAACGTTGGA AGACCAAGCG TAAAGAATGT AGAATTGAGA AACGATGGGG TGCGCGACGC ATCCAACGGG ACGTCGAGGT GACATCTGTG CTTTGCTTTT CTTTGCTGAG TAAGCGCAAG CGCGACGATG ACGGGATGGT TCACAAGAGT GCGCGTCAAC GCCGCTTGTT TGATATCGAT GTCGCCTTAT ACGAGCGCCC CGCATTTTGG GGAACTGGAC CGTTTCCAAA CCGTCCGGCG AACGCATCCG TTGTGACTGG AAGAAGGCCG TTCGGACAAG AAAAAGATGT TGATTACGAA TATGATTCCG CGGAGGAATG GGAAGGCGCT GACCAAGGCG AAAGCCTTTC GGATGAAGAC ATCGACGAAG AAGACGACAT GCCGCAGGCT TCAGATGATG AGGACGATGG TTTTATCGCA GGCGACGACG AGATGGCCGA TGAACCCAGG CATTTCGACG CTGCAGCTAT TGGCGATGAT GCGGAAATGA CGCAAAAGCG AAGCACGATG GCGATGCTTG CCAACCGTTC ACGCCGCTCG GCGGCTCCGC TCGTGATTTC TAAATTAGCC ACGACAGTTG CTGAAAATGG CGGAGAATCA AGTTTGTTAC GCCTGTTCGC GTTGGAGGCG CCGTTTTCGA ACGCGCCGCG CATTAGCCTC AAAGTGTCAA CTTCACAGCC GGCGTCGGCG GTGAAAGCGT CGAAGACGGT TTCGAAAGCG GCCAAGACTG CCACAAAAAA GAGTGCGGAC GACATACTTC AGGAAAATCT CCGAATGCTG GTCATCTTCC TTTTGAGAAA CCCGTCGTTG AAAGTGAACC AGGTGAAAGC CAAGTTCCTC GAAGAAGCTT GTCACTCAAT CGCCGGTTTG AATCACTCGG CGGTGAAACG AAAGATCATG GAAATTGCGA CGCATGTGTC GAATCGCTGG ATCATTACGG AGAAGGCTAT GCAAGACGCC GGTGTGAGTG ACGAAGAGGT TGCAGAGCTG CGAGCTAACG CTGTCGTTCC AGCAAAGAGT ACGGTGAAAA AACGCAAAAT CGAGCAAGGC GATGGCGCCG CTGCGGCTCC ACGAACTTTG GAGACATTCT TCGGCAAAAC AGAGCAGGAA AAACTTCCCA CCGCGACGGA CGCGCCAATC TGGAATCTCG CGATTGCGAG TATGTCGAGG TCGAAAGACT CCAAAGGGAT GTTCCGGGAT GATTACAAGC ACATTTTTGA CGAGTCGAAC TTGAAAGCGT GCGTTGAACA GGGGGTTGTT CCCGATTCGT TTGTTTCATG TCTGATCAGA AGCGTCGGCG CCAAATCGCA AAAGACGGCA TTCCGTATTG CGTGTGAGAA ATTGCTCGTG GTTGTGTTTC GCACGCTCGG GAACGGCCAA AATGGTGTTT CGGAGCGCAA GCTCGTCACA CCAGCCCGTG CGAGCGTCGA TGCCGCTTGC GGTGGAGATG CGCTCGTGAG CGCGATAACT TCTTGCATAG AAAGTGGACA GGAGTCGTTG AAACTCGCGG CATTGGAGAT AATGGATGCG CTTTTGTGTG ACCCTACGGC GGGAATGAAA TTCGTGACCA ATCGCATGTT CTCCAAACAA GTGATGCAGC TCATGATTGA TGCGCTGGGC AGACGCAACG AAGAGTTTTC CGTTCTCGCC ATGCGCATTC TCTGCCGCGC CCTCGGCAGT CAAACTGCGA TTGAGACGTG CTCTGCCTTA CGCCCGGAGG AGTTCCTGAC GTTATCGCGA GAACTTGCAA AGGGCATACG TTACCAGTCG CCGGACATTC GCGACAATGT CCGCCACGTC TCGTCCGGTT TGAACTTTCT CGAGAAATGT TTCTCGATGG AAGCCTGGAC GAGCGCGGTC GACCGCAACG CTTGCCGCAA AACTCTCGTC ACCGTGTTGG GTGCGTGCAT CGTACATCGC GAAGACGCCC TCGATTGGAC GAACGAAGTC ACGACTGTTT TGCTTAATAT TTTGATCGGA ACGATGGAGG TGCTTGAAAT TACGTCCGAG GACAAGATAC AAATGCGGAA CGCTCTCAAC GCGCTGTGGA CGTCGAACGA TGGGCAAGTC AAGCAACTCG TCGAGAAGGT CCAAAACGCA CTAGAAAATG TCTGTTGA
|
Protein sequence | MPSPTRASKS PTRAVATIDL TNSNFGPNAT TTTTSKKRKV DAAKEAEKLA KAQAAQAAKE AKEAAKAEKA RRREEEKAAK EAAKAEKELE MARAKAAKEA AKVEKELEMA RAKAEKEAAN AEKEREAAAL KAEKAKSKAE KEAAEKAKLE EKRRLEAKKA KEANVFAQFF VKSPAVKSKP VVTPEVRTPE VSREVREKLD DIVRAEDAVE DMDAIRETSL KRWKTKRKEC RIEKRWGARR IQRDVEVTSV LCFSLLSKRK RDDDGMVHKS ARQRRLFDID VALYERPAFW GTGPFPNRPA NASVVTGRRP FGQEKDVDYE YDSAEEWEGA DQGESLSDED IDEEDDMPQA SDDEDDGFIA GDDEMADEPR HFDAAAIGDD AEMTQKRSTM AMLANRSRRS AAPLVISKLA TTVAENGGES SLLRLFALEA PFSNAPRISL KVSTSQPASA VKASKTVSKA AKTATKKSAD DILQENLRML VIFLLRNPSL KVNQVKAKFL EEACHSIAGL NHSAVKRKIM EIATHVSNRW IITEKAMQDA GVSDEEVAEL RANAVVPAKS TVKKRKIEQG DGAAAAPRTL ETFFGKTEQE KLPTATDAPI WNLAIASMSR SKDSKGMFRD DYKHIFDESN LKACVEQGVV PDSFVSCLIR SVGAKSQKTA FRIACEKLLV VVFRTLGNGQ NGVSERKLVT PARASVDAAC GGDALVSAIT SCIESGQESL KLAALEIMDA LLCDPTAGMK FVTNRMFSKQ VMQLMIDALG RRNEEFSVLA MRILCRALGS QTAIETCSAL RPEEFLTLSR ELAKGIRYQS PDIRDNVRHV SSGLNFLEKC FSMEAWTSAV DRNACRKTLV TVLGACIVHR EDALDWTNEV TTVLLNILIG TMEVLEITSE DKIQMRNALN ALWTSNDGQV KQLVEKVQNA LENVC
|
| |