Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16923 |
Symbol | |
ID | 5003872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009363 |
Strand | + |
Start bp | 647923 |
End bp | 652602 |
Gene Length | 4680 bp |
Protein Length | 1559 aa |
Translation table | |
GC content | 58% |
IMG OID | 640419293 |
Product | predicted protein |
Protein accession | XP_001419661 |
Protein GI | 145350540 |
COG category | [K] Transcription |
COG ID | [COG5179] Transcription initiation factor TFIID, subunit TAF1 |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCGCG CGGACGACGA TGACGATTAC GATGACGACG TCGATGACGA CGATCGAGGA CGCGGCGCGA CGTACGCGGA CGCGGGGGCG CGAGACGACG ACGACGACGA CGATTACGAC GGCGCGTCCG AAGACGCGAT GGACGCGAGC GACGACGGCG ACGCGTACGA AGACGAGAGC GAGAGCGAGA GCGAGGACGA TGATTACGAC GAAGACGTCG GGAAGACGAA GAAGCGCGGG AAAACGGGAC GGACGACGCG GGCGAGCGGG GGGGCGGTGG TGGTGGTGCC GACGACGACG GGACGCGGCG AGCGAGGCGT GAAGGTGGCG TCGGTGAACG CGGACGACGC CCTGAGGGCG CAGTTGAGTG GGAAACAGGA TCCCATGGCG TTCGTCATCT CGAGCTCGGT GCCGGAAACG GGAGAGGTGT TGAAGTTTAC GTCGTTGTTG CACAACGCGC GAGACGGACG AACGTACGCG GCGGCGTTCG GGGACGCCAG ACGAGGCGCG GGAGAGGACG ACGCGTCCAA GGCGACGAAT CGAAGATTAG ACATGGAACA ACGAGAAGAC GACGGGGGGG GGAGCGAGAA CGACGAAGAT TACGATGAAG ACGCCGAGGA CGGGGACGAA GACGACGCCG CGTGGTTCGC CAAGGTTGAC GAAAATTTTC CGCTCGAGCG GTACGCGACT TTAAATCCCA CGCTGAGCGA CGACGAGGAC GACGATGAGT ACTACGCCGA TAACGAGGCC GCGCGGCCGG CGTTAGACGT CGGCGAAGCC GAGGCTCGCA TGAAACCGCG TAGAGGCGAG AGCGGGTTGG AAAACGAAGA GTGGGAGCGC GGAGTGGTCT GGGGCGCAGA TTCCGACGAA GACGAAGAGA TTCGTCTCGC CGCGGTCGGT ATCGACACCG TGGGCGCGTT CAAAAAGCCG CGACGCGGCG AAGAGACGGA CTTGTCATCG CAAGTGACGG TTCCGTCGTT ACCCCCGGCC ACCGCGCTCG CGTCGGCGAC GTCGAGATCT TGGGGAGAGC TCGACGGGGA GAGCCCGAGC GGCGGCGCGA GCTCGTTCGA TAGCGCCGTC TCGGATTACA TGCGACATCT CAGCTTGTAC GGCGTCGTAG ACGAGAAGCA GCAGAGCGGT GATGTGGACA TGACGGACGC CAACGCCGCG CGTGGGGCGG GTGAAATGCT GCGCATGCGA AATCACGACT TCGCGAACGG GCGTTGGTTG GGCGACGTCG AGTGGGAGGG TAAACCGCGT CAAGTCATCG ATCCACATCG CAAGTACATC TTGCCAAAGG TGCAAGTGAA CCCGAACGAT CCGAGATTGG TGTTGCAGTA CAAGGGTGAT TTAGAAGCGA CGGTGTGGAA ATGGGCCAAC GCCGTCATGG CGCCGAGCTG GGATCTCGAT CCCGTCAACG ACATCGACGA GGCGTTGAAT ATCAGCATGG ATGACAAGTA CAAGGAGGAA AACGAGCAAA CTGTAGACAC TGGCGAGCGT CCCCCGAGAC AAGGACGAAT TGACGGCATT CAACACGCCG AATTCCTTCT CACGTCGGTG CCCCAACTCG TTCGCGATCC CCTCAACGAT TATCCCGCGC CCAGGCCAAT GCTGCGACCG CCGACGGTGC TCCCGAAACA CGCCACGGCG TCGATGAAAA TTAAAATCGC GGGGTCGGAG TTGCAAACGT TTAAGGTTTG CATCAAATCG CTCAATATTC ACAGTGCGGT GATAAACTTG AAAGTTCGCG GTACCGACAC CATCGGAAGC GTCATCCCTC GCATTCGCAA GCGCTGGACC GACTTAGACG GTCCTATTCA CTTGTACTAC CCGGGAAGTC CGAAGCCGAA CGAAAAGTTG AACGAAGAAA TGACGCTCGT CGATGCGGGC TTGCAAGCGC AAGTCGGCTT GCCGATCATC TATCTCGTGG CTCCGAAGGT TACGTTCATC AGCGAAGAGG CGGCGTTGGC GCCCCTCGCT TCCGACGCCG TTCTCGCGCC CCCGGGCGCG TACAAAAAAC CCTCGGATTT GTCGGCGCGA AGTGGTACGC TGATGATGGT GCAGTACACA GAAGCGCGCC CGCCGCTCAT CGCTAAGCCT GGTATGGGAG CAAAGAAGGT TGTGTACTAT CGCAGAAAGA CGCTCGGCGA TCAAGGCGCG CGTCCATACG CCAACGCGAC GACGACTGTC ATTGATCTCA AGCCGAACGC GCCGTCACCC TTCCTCGCCG AACTTCCTCC TGGGCGAGGC ATAACGGCCC TTGAAACGAG CATGTTCAGA GCACCGTTGT TTCAGCAAGC AAAGTCTGAT GACTACGTTG ATTTTCTTCT CATCAGAGCT CCGAACGGCA AGTTGACGCT TCGCGAAGCG CCGTCGCTTC ACCTGGCTGG ACAACAAGAG CCACACGTAG AGATTCCGGC GCCGGGGAGT GACATGCTCA AAGACTTTGA AGAGCGTCTC GTCAACGCCA CCGTACTCCG CTACTTCTTG AACCTGCAAG CGAAAGGGCT CATCGAACCC GGCACGATGC CGACGGTGAA ATCGACCGAC GTCGCGGCGA CGCTCTCGCA CGCGCTTTCC GTTCGAGATG TTCGCAAAAA TATTCGCCGA AAGATTTGCG CGGTGCGCCG CGGCGCCGAG GCAAATGAGG ATGAGTATGT TCTCAATCCT TCGTACCGAT TCGAGCACGA AAAAGAGGTG CAACGGATGG CGACGCCAGA AGAAGTGTGC GCGTACGAGT CGTATCGTCA CGCGGTTGCT GTACTCTGCA AGGACCGCGA TCGCGACGAA CAGGAGCGAA TTTTACGATT GTCGTCTCTT TCGCTTCAAC AGTTGAAGAA TGCGGTGACC ATCTTGATCC GACACAGCGA GGGCAAGCGC AAGCGTCAAC TGGAGAACTT GGAGTTGTGG CTGCAAATTC AGCCGTGGGC GCAAACGCAC GAATTCCTTC AGGCTTCGCT GGGCACGAAG GGAATTTTAC ATCTCGAGAC GTCGAGGAGA ATCCAGCGAC TCACTGGAAA GTTCTACAAC TATATTCGAC GTATGACTGT ACCAGATCCT CCGGAAGATC GTCGACCTCG CCGCGAGCCC GGAACCGTGA CGGGGACGAA TGCCGATCTC CGTAAGCTTA CGATGCCTCA GGCGCAACGC ATCCTTGAAA ACTTTGGGGT AGAGAAGGAA GTGATTCTCA GGCTTGAACG ATGGAAGAGA ATCGGTTTGA TTCGTGAATT GTCTGGTGCT GCGACCGCGG ATGGCACGAA CTCTCACGCG GGCATGGCGC GTTTCGCCCG TCGCTTGCGA GTGAGCGAAG CGGACCAACT CAAAGAGATT CGCGAACATT CTGATTTGCT CTTCAAAAGA CAGATGAAGC AATATTCTCA GCGACACACG CGTCACGAAG ATTCTTCGAG CGGTGGCGAG ACGGATGAAG ACGATTCGTC GAGCGACAGT GAAAGTGAAA GCGATTCTTT GGCAGATGAG TTGGAAAAGC AACTCGAAGA AAAGGCTGCC AAGGATGCGC CGAACGAAGA AGACGAACGC GCCGAGCTCG AGGCGCTTCG TAACGCCCTT AAGGACGGTA GTTCCATCGA GCCATCCGTT GACCCGATGA AGGCTTTGGC GCAAGGCATT TTGGTGCCCG GAAAGAAGCT GAAATTGAAG CGAGTTTCGA CCCACATCTT CCCAGACGGT CGATCCGCCA AGGTGGAGGA CGACATCACC GAGTACGCGG GAGAGGCTTG GCTTCAAGCG CGCGCGCAAG GTGTCGAAGC CGCAGACGCT GCGACGACTG CTGCACTCAT CGCATGCGGC ATGGATAAGC CGCCAGAGCC GCCGAAGGAA GCCTTGGAGA GGAAGGAGCT CGACGACGAA GAAACTGGTC TTACTCCGGA AGCAAGAGCG CGTTACGAAA TGCTTCGTCA GCGGAAACGT GCCAAGGAGC GCGTGCGACG AGCCGGCGAG AAGATCAAGC GTTTGCAGTC CATGGGCGTC ACCGATGAAG GCGCGGACTT GGCGAACGCG CAAAAGTCGC CCGGAATCAC GCCAAGCGAG AGGCCCGCAC CCGTGGTTCC AGGTCCCGGT GGACTGAAAA TTAAGCTTGG CGTCAGCAAG AAGACGATGG ATAAAGTGTT CAGCTCTGGC GCGACGCCGC AGAAGAAATC CACGCTCGGT CGCAAGTTGC CTTGGGATAA AGTACTCATA AAAGTCACCG AAAGCGCAAT GAAACATGGA ACATACGGCC AAGTGTTTAC GCCGGCGGTG ACGTTGTCAG ACTACGCAAA GTTTGTTGAG CAACCCATGG ATCTGGGCGC CATCATGGCA CGCCTTCGCG AGGGCTTGTA CAATGATCCC AGCGACTGGG CATCGGACGT CAAGTTGATC GCAGTCAACG CGCAGGCGTA CCACGGGAGC GAAGAACCGG TTGAGCTTCG TCTGGATTGG GTTCCTGGGA TGGCGGCGGA CATGGTCAAC TACATAGCCG ATCAAGCGAA ACGTCACGAA GCGGACATCA AGGCTTCGTT TTCGGATTTC TCCGCCGACT CTCTGCGCGT TGCGCGTCAG GGAGCCACGG CCGGTGGTGA TAGCGCTCCA GCCGAAGTCA AAACCGAAGT TCCAACAGAG CCCGCGGCGC CGCCGAAGCC GCTTCCAAAG CTCAAGTTTT CGTTTGGAAA GAAATCGTAG
|
Protein sequence | MARADDDDDY DDDVDDDDRG RGATYADAGA RDDDDDDDYD GASEDAMDAS DDGDAYEDES ESESEDDDYD EDVGKTKKRG KTGRTTRASG GAVVVVPTTT GRGERGVKVA SVNADDALRA QLSGKQDPMA FVISSSVPET GEVLKFTSLL HNARDGRTYA AAFGDARRGA GEDDASKATN RRLDMEQRED DGGGSENDED YDEDAEDGDE DDAAWFAKVD ENFPLERYAT LNPTLSDDED DDEYYADNEA ARPALDVGEA EARMKPRRGE SGLENEEWER GVVWGADSDE DEEIRLAAVG IDTVGAFKKP RRGEETDLSS QVTVPSLPPA TALASATSRS WGELDGESPS GGASSFDSAV SDYMRHLSLY GVVDEKQQSG DVDMTDANAA RGAGEMLRMR NHDFANGRWL GDVEWEGKPR QVIDPHRKYI LPKVQVNPND PRLVLQYKGD LEATVWKWAN AVMAPSWDLD PVNDIDEALN ISMDDKYKEE NEQTVDTGER PPRQGRIDGI QHAEFLLTSV PQLVRDPLND YPAPRPMLRP PTVLPKHATA SMKIKIAGSE LQTFKVCIKS LNIHSAVINL KVRGTDTIGS VIPRIRKRWT DLDGPIHLYY PGSPKPNEKL NEEMTLVDAG LQAQVGLPII YLVAPKVTFI SEEAALAPLA SDAVLAPPGA YKKPSDLSAR SGTLMMVQYT EARPPLIAKP GMGAKKVVYY RRKTLGDQGA RPYANATTTV IDLKPNAPSP FLAELPPGRG ITALETSMFR APLFQQAKSD DYVDFLLIRA PNGKLTLREA PSLHLAGQQE PHVEIPAPGS DMLKDFEERL VNATVLRYFL NLQAKGLIEP GTMPTVKSTD VAATLSHALS VRDVRKNIRR KICAVRRGAE ANEDEYVLNP SYRFEHEKEV QRMATPEEVC AYESYRHAVA VLCKDRDRDE QERILRLSSL SLQQLKNAVT ILIRHSEGKR KRQLENLELW LQIQPWAQTH EFLQASLGTK GILHLETSRR IQRLTGKFYN YIRRMTVPDP PEDRRPRREP GTVTGTNADL RKLTMPQAQR ILENFGVEKE VILRLERWKR IGLIRELSGA ATADGTNSHA GMARFARRLR VSEADQLKEI REHSDLLFKR QMKQYSQRHT RHEDSSSGGE TDEDDSSSDS ESESDSLADE LEKQLEEKAA KDAPNEEDER AELEALRNAL KDGSSIEPSV DPMKALAQGI LVPGKKLKLK RVSTHIFPDG RSAKVEDDIT EYAGEAWLQA RAQGVEAADA ATTAALIACG MDKPPEPPKE ALERKELDDE ETGLTPEARA RYEMLRQRKR AKERVRRAGE KIKRLQSMGV TDEGADLANA QKSPGITPSE RPAPVVPGPG GLKIKLGVSK KTMDKVFSSG ATPQKKSTLG RKLPWDKVLI KVTESAMKHG TYGQVFTPAV TLSDYAKFVE QPMDLGAIMA RLREGLYNDP SDWASDVKLI AVNAQAYHGS EEPVELRLDW VPGMAADMVN YIADQAKRHE ADIKASFSDF SADSLRVARQ GATAGGDSAP AEVKTEVPTE PAAPPKPLPK LKFSFGKKS
|
| |