Gene OSTLU_40550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_40550 
Symbol 
ID5005665 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp1418 
End bp4405 
Gene Length2988 bp 
Protein Length950 aa 
Translation table 
GC content54% 
IMG OID640421086 
Productpredicted protein 
Protein accessionXP_001421676 
Protein GI145354827 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCAAA ACCGACGACT GACTGGGCAT ATATTATTTC ATCGACGGGC GCAGAGAGGA 
CGCGTACACC CGGGAGATAT CGCCGAAGCC GCGTCGCTGG CGAGCATCGA TTTACCACCG
GCCAAGTATC CTTTATTTGA CGCCATCGGC GACGATTTCG TGGAGACTGG AAAACTGAGC
AAGCTCCAAC TGGAGGGTGT CATGTTCGCG TGTCAAAAGC ACTGCGAATT CGCGCCGGAT
GGCAAGCGTT CGGGTTTCAT GATTGGCGAC GGTGCGGGGG TGGGCAAGGG ACGACAAATT
AGTGGAATTA TTATCGATAA TTTCGTGCGA GGCCGTCGGA AAGCGGTTTG GATATCGAGC
TCGGGCGATT TACACCGAGA CGCCGAGCGC GACTTGAGCG ATTTGGGAAG CACTATTAAG
GTTATTAATT CGTGCCCAGA GCTCGATCGC GAAACTCGCG CGTTCGGGTT ATCGAAAGAG
TATCAAGAGG GCGTGTTGTT TACCACCTAC AGCACGTTAG TGAGCAAGAC GCAGAAGAAG
AGTCGTCTCG ATCAAATCGT AAATTGGTTC GGCGGCGAAG ACGCCGAGGG CGTCGTGATT
TTTGACGAGT GCCACAAGGC AAAGAACTTT AGCGCGTCTT CTGATTCGGG TAGTAAAGTT
GCGGCGGCTG TGATTGCGTT TCAGGAACGC TGTCCTCGCG CGCGCGTTGT GTACGCGAGC
GCCACAGGTA TCTCTGAAGT TGGGCACATG ACCTACTTGG CCAGACTCGG TTTCTGGGGC
AAAGGGACGC CATTTAAATC CGCGGATACG TTCATTGAGA GCATGAAGTC TCGCGGTGTA
GGATTTTTAG AAATGCTCGC CATGGAAATG AAGGCAAGTG GGAAGTATGT TAGTCGAGGT
TTATCTTTTA GGCAGGCAGA GTTTGAGCAG TACACGATCG CGCTCAGCAA AGAGCAGCGT
CAAATGTACG ATGACGCGTG CGACTTGATG TCGCTCATCC GTAATGCGTG TATCGAAGCT
GTGCGTCGAA CTGGTTCCGA TGGAAAAGGC GTTTGGAGTG CGTATTGGAG CGTCCATCAA
AGATTCTTCA AGCTCTTGTG CATCAGCATG AAAGTTCCGG CGGTCATCGC CAAGGCGAAC
GAGGCGCTCG CGCGAGAGCA ATGCGTCGTC ATCGGGTTGC AAACGACAGG TGAGTCTCAA
GATGCGTCAG CCAACTTGAC TTTTGGTGAA GACGTCACGG AGTTCGGGTT CCTGAGTACG
ACGCGCGAAA TGCTCGTCAA TTTCTTGAAC GTGCACTTTC CGACTGAAAT CCACGGCGCT
TCAGACGGAG CGGCGGCGGA AAAGTCGGCG CCTTCCGATT GGTCGAGCGA TTGGAGGCGT
GCAGGCGGCG CGGAGGTAGA GAGCGAAGGC TACACTGGTC GTTTTCACAC GAATGCCCCA
GCCAAGGTCG ACGAAGAGCT CGTGGACGCT AAGGAAGAGT TGATCAATAA AGTCATCAAG
TTGAGGTTAC CGCCGAACTT TCTCGATGCT CTGATTGATG GGCTCGGAGG GCCGACAGAA
GTTGCCGAAC TCACGGGAAG AAGCGGTCGC ATCGTGCGTC GAGGCGATAG GCTCATGTAC
GAATCGCGTG GCACGACGGC GTCTAGAAAA GGAACGAGCA TCAGCGGCGA CGGTGATGTG
CTCGGAGTGA ACATCGCCGA GAAGAACGCG TTTATGAATG GTGACAAACG TGTGGCCATA
ATTTCGGACG CGGCTAGTAC CGGGATTTCA TTGCACGCGT CGAAAGGCGC GAAAAATGTC
CGCCGCCGCG TGCACGTCAC GATAGAACTT CCATGGAGTG CGGACAAGGC GATTCAGCAG
CTCGGTCGCA CGCATCGATC CAATCAGATT ACGGGTCCAG TGTACGTGAT GTGCTCGACG
AATATCGGAG GCGAAAGACG CTTCGTCGCC GCCGTCGCAC GACGCTTGCA AAGTCTCGGG
GCGTTGACGA GAGGTGATCG ACGTGCGGCG ACTGGCATTG ATTTATCCGA TGGCAATCTC
GACTCGTCAC TCGGGCGACA CGCGTTGAAG AAGCTCTACG AGGCGCTCGT CGCGCTCGAT
GCGAATCTCC CGTCGGGCGT CACGCTCGAG GGCATCATGT CGCATGTGAA CGACGACGAT
CTCGAGGGGA AACCCATCTC GAGCTGGCAG GATTTGAAGC TTGAACTTCG TTCAGCGTTA
TTGGAGCTCG GAGTGCAGGT TGGACTGCGA AATGACGGCT TTTCGGTCGA AGAAGCGACG
CTGCTGCAGG AAATTGGATA CGGCAACAAA GATCTCGGCG ACGTGCGCAA GTTTCTCAAT
CGCCTACTTG GACTTCCGGT GCGAACGCAA AATATCATGT TTGCATACTT TGCTGAAACC
TTGGATTGCG AAATCAAATT AGCAAAGGCT GAGGGGAAGT ACACGGAGGG CGTGAGCGAT
CTCGGTGGTT CTTCCATTCG CATCGAACCA GAGTCGAAGA CGATTCTTAA GGATCCGTAC
CACGGACAAA AACTCGTGTC GACCAAGGTT TTCATAGATC GCGGTATTTC GTTTGAGCGT
GCGCTGGATA TTTTGGAAAA GAGCAAGAAT CACGATCGAG ACGGCTTCTA CAAGATGAAG
CGCGAGATGT ATGGGCGCGC GCAAGTCGTT CTGGCCATCA CCAAGCCAGG TTCGCGAAAC
TTGTTCATGC TGTGGCGTCC GAACACCGGC GCAAGTCTTT TCGAGATGGA GTACGACGAA
TTGCGCGGCA AATACATGAT GTGCACGCCC GACGTGGCGA AACAACCGTG GGATCTTGTG
CACGAATTGA CCGAGAAGCA TTGCATGCAC GGCGTGAATT GTCACGTTCG ACAAACCATG
GGCGCTTGTA CCGCGGGCAA ACGCACGGTT GACTGCACCG TGATCAGCGG CGCCGTCGTG
CCTTGCTGGG GAGAGCTCGA ACGAACTTTG GATAGAAACG CGCACAAG
 
Protein sequence
MIQNRRLTGH ILFHRRAQRG RVHPGDIAEA ASLASIDLPP AKYPLFDAIG DDFVETGKLS 
KLQLEGVMFA CQKHCEFAPD GKRSGFMIGD GAGVGKGRQI SGIIIDNFVR GRRKAVWISS
SGDLHRDAER DLSDLGSTIK VINSCPELDR ETRAFGLSKE YQEGVLFTTY STLVSKTQKK
SRLDQIVNWF GGEDAEGVVI FDECHKAKNF SASSDSGSKV AAAVIAFQER CPRARVVYAS
ATGISEVGHM TYLARLGFWG KGTPFKSADT FIESMKSRGV GFLEMLAMEM KASGKYVSRG
LSFRQAEFEQ YTIALSKEQR QMYDDACDLM SLIRNACIEA VRRTGSDGKG VWSAYWSVHQ
RFFKLLCISM KVPAVIAKAN EALAREQCVV IGLQTTGESQ DASANLTFGE DVTEFGFLST
TREMLVNFLN VHFPTEIHGA SDGAAAEKSA PSDWSSDWRR AGGAEVESEG YTGRFHTNAP
AKVDEELVDA KEELINKVIK LRLPPNFLDA LIDGLGGPTE VAELTGRSGR IVRRGDRLMY
ESRGTTASRK GTSISGDGDV LGVNIAEKNA FMNGDKRVAI ISDAASTGIS LHASKGAKNV
RRRVHVTIEL PWSADKAIQQ LGRTHRSNQI TGPVYVMCST NIGGERRFVA AVARRLQSLG
ALTRGDRRAA TGIDLSDGNL DSSLGRHALK KLYEALLGVQ VGLRNDGFSV EEATLLQEIG
YGNKDLGDVR KFLNRLLGLP VRTQNIMFAY FAETLDCEIK LAKAEGKYTE GVSDLGGSSI
RIEPESKTIL KDPYHGQKLV STKVFIDRGI SFERALDILE KSKNHDRDGF YKMKREMYGR
AQVVLAITKP GSRNLFMLWR PNTGASLFEM EYDELRGKYM MCTPDVAKQP WDLVHELTEK
HCMHGVNCHV RQTMGACTAG KRTVDCTVIS GAVVPCWGEL ERTLDRNAHK