Gene OSTLU_16923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16923 
Symbol 
ID5003872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp647923 
End bp652602 
Gene Length4680 bp 
Protein Length1559 aa 
Translation table 
GC content58% 
IMG OID640419293 
Productpredicted protein 
Protein accessionXP_001419661 
Protein GI145350540 
COG category[K] Transcription 
COG ID[COG5179] Transcription initiation factor TFIID, subunit TAF1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGCG CGGACGACGA TGACGATTAC GATGACGACG TCGATGACGA CGATCGAGGA 
CGCGGCGCGA CGTACGCGGA CGCGGGGGCG CGAGACGACG ACGACGACGA CGATTACGAC
GGCGCGTCCG AAGACGCGAT GGACGCGAGC GACGACGGCG ACGCGTACGA AGACGAGAGC
GAGAGCGAGA GCGAGGACGA TGATTACGAC GAAGACGTCG GGAAGACGAA GAAGCGCGGG
AAAACGGGAC GGACGACGCG GGCGAGCGGG GGGGCGGTGG TGGTGGTGCC GACGACGACG
GGACGCGGCG AGCGAGGCGT GAAGGTGGCG TCGGTGAACG CGGACGACGC CCTGAGGGCG
CAGTTGAGTG GGAAACAGGA TCCCATGGCG TTCGTCATCT CGAGCTCGGT GCCGGAAACG
GGAGAGGTGT TGAAGTTTAC GTCGTTGTTG CACAACGCGC GAGACGGACG AACGTACGCG
GCGGCGTTCG GGGACGCCAG ACGAGGCGCG GGAGAGGACG ACGCGTCCAA GGCGACGAAT
CGAAGATTAG ACATGGAACA ACGAGAAGAC GACGGGGGGG GGAGCGAGAA CGACGAAGAT
TACGATGAAG ACGCCGAGGA CGGGGACGAA GACGACGCCG CGTGGTTCGC CAAGGTTGAC
GAAAATTTTC CGCTCGAGCG GTACGCGACT TTAAATCCCA CGCTGAGCGA CGACGAGGAC
GACGATGAGT ACTACGCCGA TAACGAGGCC GCGCGGCCGG CGTTAGACGT CGGCGAAGCC
GAGGCTCGCA TGAAACCGCG TAGAGGCGAG AGCGGGTTGG AAAACGAAGA GTGGGAGCGC
GGAGTGGTCT GGGGCGCAGA TTCCGACGAA GACGAAGAGA TTCGTCTCGC CGCGGTCGGT
ATCGACACCG TGGGCGCGTT CAAAAAGCCG CGACGCGGCG AAGAGACGGA CTTGTCATCG
CAAGTGACGG TTCCGTCGTT ACCCCCGGCC ACCGCGCTCG CGTCGGCGAC GTCGAGATCT
TGGGGAGAGC TCGACGGGGA GAGCCCGAGC GGCGGCGCGA GCTCGTTCGA TAGCGCCGTC
TCGGATTACA TGCGACATCT CAGCTTGTAC GGCGTCGTAG ACGAGAAGCA GCAGAGCGGT
GATGTGGACA TGACGGACGC CAACGCCGCG CGTGGGGCGG GTGAAATGCT GCGCATGCGA
AATCACGACT TCGCGAACGG GCGTTGGTTG GGCGACGTCG AGTGGGAGGG TAAACCGCGT
CAAGTCATCG ATCCACATCG CAAGTACATC TTGCCAAAGG TGCAAGTGAA CCCGAACGAT
CCGAGATTGG TGTTGCAGTA CAAGGGTGAT TTAGAAGCGA CGGTGTGGAA ATGGGCCAAC
GCCGTCATGG CGCCGAGCTG GGATCTCGAT CCCGTCAACG ACATCGACGA GGCGTTGAAT
ATCAGCATGG ATGACAAGTA CAAGGAGGAA AACGAGCAAA CTGTAGACAC TGGCGAGCGT
CCCCCGAGAC AAGGACGAAT TGACGGCATT CAACACGCCG AATTCCTTCT CACGTCGGTG
CCCCAACTCG TTCGCGATCC CCTCAACGAT TATCCCGCGC CCAGGCCAAT GCTGCGACCG
CCGACGGTGC TCCCGAAACA CGCCACGGCG TCGATGAAAA TTAAAATCGC GGGGTCGGAG
TTGCAAACGT TTAAGGTTTG CATCAAATCG CTCAATATTC ACAGTGCGGT GATAAACTTG
AAAGTTCGCG GTACCGACAC CATCGGAAGC GTCATCCCTC GCATTCGCAA GCGCTGGACC
GACTTAGACG GTCCTATTCA CTTGTACTAC CCGGGAAGTC CGAAGCCGAA CGAAAAGTTG
AACGAAGAAA TGACGCTCGT CGATGCGGGC TTGCAAGCGC AAGTCGGCTT GCCGATCATC
TATCTCGTGG CTCCGAAGGT TACGTTCATC AGCGAAGAGG CGGCGTTGGC GCCCCTCGCT
TCCGACGCCG TTCTCGCGCC CCCGGGCGCG TACAAAAAAC CCTCGGATTT GTCGGCGCGA
AGTGGTACGC TGATGATGGT GCAGTACACA GAAGCGCGCC CGCCGCTCAT CGCTAAGCCT
GGTATGGGAG CAAAGAAGGT TGTGTACTAT CGCAGAAAGA CGCTCGGCGA TCAAGGCGCG
CGTCCATACG CCAACGCGAC GACGACTGTC ATTGATCTCA AGCCGAACGC GCCGTCACCC
TTCCTCGCCG AACTTCCTCC TGGGCGAGGC ATAACGGCCC TTGAAACGAG CATGTTCAGA
GCACCGTTGT TTCAGCAAGC AAAGTCTGAT GACTACGTTG ATTTTCTTCT CATCAGAGCT
CCGAACGGCA AGTTGACGCT TCGCGAAGCG CCGTCGCTTC ACCTGGCTGG ACAACAAGAG
CCACACGTAG AGATTCCGGC GCCGGGGAGT GACATGCTCA AAGACTTTGA AGAGCGTCTC
GTCAACGCCA CCGTACTCCG CTACTTCTTG AACCTGCAAG CGAAAGGGCT CATCGAACCC
GGCACGATGC CGACGGTGAA ATCGACCGAC GTCGCGGCGA CGCTCTCGCA CGCGCTTTCC
GTTCGAGATG TTCGCAAAAA TATTCGCCGA AAGATTTGCG CGGTGCGCCG CGGCGCCGAG
GCAAATGAGG ATGAGTATGT TCTCAATCCT TCGTACCGAT TCGAGCACGA AAAAGAGGTG
CAACGGATGG CGACGCCAGA AGAAGTGTGC GCGTACGAGT CGTATCGTCA CGCGGTTGCT
GTACTCTGCA AGGACCGCGA TCGCGACGAA CAGGAGCGAA TTTTACGATT GTCGTCTCTT
TCGCTTCAAC AGTTGAAGAA TGCGGTGACC ATCTTGATCC GACACAGCGA GGGCAAGCGC
AAGCGTCAAC TGGAGAACTT GGAGTTGTGG CTGCAAATTC AGCCGTGGGC GCAAACGCAC
GAATTCCTTC AGGCTTCGCT GGGCACGAAG GGAATTTTAC ATCTCGAGAC GTCGAGGAGA
ATCCAGCGAC TCACTGGAAA GTTCTACAAC TATATTCGAC GTATGACTGT ACCAGATCCT
CCGGAAGATC GTCGACCTCG CCGCGAGCCC GGAACCGTGA CGGGGACGAA TGCCGATCTC
CGTAAGCTTA CGATGCCTCA GGCGCAACGC ATCCTTGAAA ACTTTGGGGT AGAGAAGGAA
GTGATTCTCA GGCTTGAACG ATGGAAGAGA ATCGGTTTGA TTCGTGAATT GTCTGGTGCT
GCGACCGCGG ATGGCACGAA CTCTCACGCG GGCATGGCGC GTTTCGCCCG TCGCTTGCGA
GTGAGCGAAG CGGACCAACT CAAAGAGATT CGCGAACATT CTGATTTGCT CTTCAAAAGA
CAGATGAAGC AATATTCTCA GCGACACACG CGTCACGAAG ATTCTTCGAG CGGTGGCGAG
ACGGATGAAG ACGATTCGTC GAGCGACAGT GAAAGTGAAA GCGATTCTTT GGCAGATGAG
TTGGAAAAGC AACTCGAAGA AAAGGCTGCC AAGGATGCGC CGAACGAAGA AGACGAACGC
GCCGAGCTCG AGGCGCTTCG TAACGCCCTT AAGGACGGTA GTTCCATCGA GCCATCCGTT
GACCCGATGA AGGCTTTGGC GCAAGGCATT TTGGTGCCCG GAAAGAAGCT GAAATTGAAG
CGAGTTTCGA CCCACATCTT CCCAGACGGT CGATCCGCCA AGGTGGAGGA CGACATCACC
GAGTACGCGG GAGAGGCTTG GCTTCAAGCG CGCGCGCAAG GTGTCGAAGC CGCAGACGCT
GCGACGACTG CTGCACTCAT CGCATGCGGC ATGGATAAGC CGCCAGAGCC GCCGAAGGAA
GCCTTGGAGA GGAAGGAGCT CGACGACGAA GAAACTGGTC TTACTCCGGA AGCAAGAGCG
CGTTACGAAA TGCTTCGTCA GCGGAAACGT GCCAAGGAGC GCGTGCGACG AGCCGGCGAG
AAGATCAAGC GTTTGCAGTC CATGGGCGTC ACCGATGAAG GCGCGGACTT GGCGAACGCG
CAAAAGTCGC CCGGAATCAC GCCAAGCGAG AGGCCCGCAC CCGTGGTTCC AGGTCCCGGT
GGACTGAAAA TTAAGCTTGG CGTCAGCAAG AAGACGATGG ATAAAGTGTT CAGCTCTGGC
GCGACGCCGC AGAAGAAATC CACGCTCGGT CGCAAGTTGC CTTGGGATAA AGTACTCATA
AAAGTCACCG AAAGCGCAAT GAAACATGGA ACATACGGCC AAGTGTTTAC GCCGGCGGTG
ACGTTGTCAG ACTACGCAAA GTTTGTTGAG CAACCCATGG ATCTGGGCGC CATCATGGCA
CGCCTTCGCG AGGGCTTGTA CAATGATCCC AGCGACTGGG CATCGGACGT CAAGTTGATC
GCAGTCAACG CGCAGGCGTA CCACGGGAGC GAAGAACCGG TTGAGCTTCG TCTGGATTGG
GTTCCTGGGA TGGCGGCGGA CATGGTCAAC TACATAGCCG ATCAAGCGAA ACGTCACGAA
GCGGACATCA AGGCTTCGTT TTCGGATTTC TCCGCCGACT CTCTGCGCGT TGCGCGTCAG
GGAGCCACGG CCGGTGGTGA TAGCGCTCCA GCCGAAGTCA AAACCGAAGT TCCAACAGAG
CCCGCGGCGC CGCCGAAGCC GCTTCCAAAG CTCAAGTTTT CGTTTGGAAA GAAATCGTAG
 
Protein sequence
MARADDDDDY DDDVDDDDRG RGATYADAGA RDDDDDDDYD GASEDAMDAS DDGDAYEDES 
ESESEDDDYD EDVGKTKKRG KTGRTTRASG GAVVVVPTTT GRGERGVKVA SVNADDALRA
QLSGKQDPMA FVISSSVPET GEVLKFTSLL HNARDGRTYA AAFGDARRGA GEDDASKATN
RRLDMEQRED DGGGSENDED YDEDAEDGDE DDAAWFAKVD ENFPLERYAT LNPTLSDDED
DDEYYADNEA ARPALDVGEA EARMKPRRGE SGLENEEWER GVVWGADSDE DEEIRLAAVG
IDTVGAFKKP RRGEETDLSS QVTVPSLPPA TALASATSRS WGELDGESPS GGASSFDSAV
SDYMRHLSLY GVVDEKQQSG DVDMTDANAA RGAGEMLRMR NHDFANGRWL GDVEWEGKPR
QVIDPHRKYI LPKVQVNPND PRLVLQYKGD LEATVWKWAN AVMAPSWDLD PVNDIDEALN
ISMDDKYKEE NEQTVDTGER PPRQGRIDGI QHAEFLLTSV PQLVRDPLND YPAPRPMLRP
PTVLPKHATA SMKIKIAGSE LQTFKVCIKS LNIHSAVINL KVRGTDTIGS VIPRIRKRWT
DLDGPIHLYY PGSPKPNEKL NEEMTLVDAG LQAQVGLPII YLVAPKVTFI SEEAALAPLA
SDAVLAPPGA YKKPSDLSAR SGTLMMVQYT EARPPLIAKP GMGAKKVVYY RRKTLGDQGA
RPYANATTTV IDLKPNAPSP FLAELPPGRG ITALETSMFR APLFQQAKSD DYVDFLLIRA
PNGKLTLREA PSLHLAGQQE PHVEIPAPGS DMLKDFEERL VNATVLRYFL NLQAKGLIEP
GTMPTVKSTD VAATLSHALS VRDVRKNIRR KICAVRRGAE ANEDEYVLNP SYRFEHEKEV
QRMATPEEVC AYESYRHAVA VLCKDRDRDE QERILRLSSL SLQQLKNAVT ILIRHSEGKR
KRQLENLELW LQIQPWAQTH EFLQASLGTK GILHLETSRR IQRLTGKFYN YIRRMTVPDP
PEDRRPRREP GTVTGTNADL RKLTMPQAQR ILENFGVEKE VILRLERWKR IGLIRELSGA
ATADGTNSHA GMARFARRLR VSEADQLKEI REHSDLLFKR QMKQYSQRHT RHEDSSSGGE
TDEDDSSSDS ESESDSLADE LEKQLEEKAA KDAPNEEDER AELEALRNAL KDGSSIEPSV
DPMKALAQGI LVPGKKLKLK RVSTHIFPDG RSAKVEDDIT EYAGEAWLQA RAQGVEAADA
ATTAALIACG MDKPPEPPKE ALERKELDDE ETGLTPEARA RYEMLRQRKR AKERVRRAGE
KIKRLQSMGV TDEGADLANA QKSPGITPSE RPAPVVPGPG GLKIKLGVSK KTMDKVFSSG
ATPQKKSTLG RKLPWDKVLI KVTESAMKHG TYGQVFTPAV TLSDYAKFVE QPMDLGAIMA
RLREGLYNDP SDWASDVKLI AVNAQAYHGS EEPVELRLDW VPGMAADMVN YIADQAKRHE
ADIKASFSDF SADSLRVARQ GATAGGDSAP AEVKTEVPTE PAAPPKPLPK LKFSFGKKS