Gene OSTLU_26149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_26149 
Symbol 
ID5004150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp308322 
End bp311564 
Gene Length3243 bp 
Protein Length1080 aa 
Translation table 
GC content53% 
IMG OID640419571 
Productpredicted protein 
Protein accessionXP_001420141 
Protein GI145351561 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0731433 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGA AATTGAACGA GGACGCCGTG CGCGCGGACG CGGACGCGAA AATGAAATCA 
GACAGGATTG AAACGCTCGC GAGAGAACGC CAACGCGAGG AAGCGAACGC GAGCGACGAC
GAAGACGAGG ACGAAGACGA AGGGTTTCGA ATGAAGCGAA AAGAAGAGGA TGCGGCGGTA
CCGGCGTACA CGTGGACGGC GACGCTGCGA CGACGGGTGC GAGAACTCGA GGAGGGCGCT
GACGTCGAGG CGCGAGAGGC GGGGAGGCTT CGGTTGGACG AGGCGCTCGG CGCCGCCGAC
GACGAAGACG CCGGTGATGT CAGCACGTTT GTGAATGGCG CTCCGACAGT GATTACTAAA
ATGAAGAGTC GAATCAATTC GATGAATGCG CCGACGGGCC CGACCGGTGT GCTGGTGAAG
ACCGCGCACG CAGAGGATAA GCCGATGCAG CAAGTCGTGG AAGACTCGTA CTACAAAGCT
GTCATGAGCG AAAAGCTCCC GGCGTACAAA GCACCGCGCC GACACATCGC GCGATTGGAG
AATCTTGAGT CTGCGGTTTT AGACAAAGAG GGCAACCAGC TTCCTTTGTG GAAATCGACG
AGCGACGATC TCGACGTTCT CGGCCCGGGT ATCAGCTTGT GGTTCGCTCA GTTGAAGACG
TTGGGTAAAA CGTTTGTCTA CATCACAATG CTCGCGTCTG TCGCGCTCTC GCACTATGTG
TATATCGCCA ATAAAAGTAG TAGCGTCGAG GTAGAGCCGG AAATAACGAC GACACTCGGT
GTCACGGCGG CGGGCGTTTA TGCTTCGGCG TATGGTATAG ATATTCGTAC CATCATGGCT
GTTCTCTCGT GCTTGGACGT TTGCATGGTT TTGATGTTCA TGCTCATAAC GGCGATTTTA
TCCAGACGTA TTCGAAGCTT TGTCGTCCGC GTTGATGAAG CTTTACTCAC TTTGGCGGAC
TATTCGCTCC AAGTGACCGG CTTGCCAGTA GACGCGACGG AGGACGAAGT GCGCGAGCAC
TTTGAAGAAT TCGGACCCGT GGCAGATGTC GTCATAGCTC GTTCGTATGG TGCAGTTTTA
CGCGCGCGAA TACATCGTGC GCGCTTATTT AAACGTGCAG AACAGCTGAA ATCTGAACTG
AGCGCGATCC GTCACACGCT GAAGAAGCGC GGCGAGGATT ACAACGCAAA CTACGCCTTC
AAACGCAAAT GGAAACAATT TTACAAAGCA CGCGATAAGA TGAACGCCTT GAAGGAAAAA
ATCGAGCAGC GTATGCTCGA GCCTTTTAAC ATCGTTTACG CTTTCATCAC GTTTGAACAA
GAGTCGGACA AAATGACATG TCTGGACGAG TACGCTCCGT ACTTTAATTT TTTCAGAAGT
AAGCGAACGA GATTTCGCGT CCACCCTCGC GAAGACGGCA AGGGCGACGC ATTTCACTAC
TTGCGGGTTA CCCAAGCCGT GGAAGCCTCT GATATCATGT GGGAAAATAT GGCTAACGTG
GGTTGGAAAC AATACTTCGT ACGACGGGCG ATTACAACGA TTGCTATCCT TGGTCTTCTC
ATCGGCAACA TTATCCTTGT CGTTTTTGCC ACCGACTGGG TAAAATCCGG CGGGCGATTA
CTCGTCAACT GTGGCGACTT GTTCGCCTCG GGGGCTTCTG ATAACATGTA CTGCCCTGCG
ATCTGGAACA TCAACGAAAA CAGTCTCGAA ACCGATTTGG GATTGATTTC GAACGTCAAC
TTCCGAAAAC AAGTCGAGAG CTCAGACTGC AACCCATTCA TCGAGTCGGC AATGTGGTCG
TATGACATGA CGCAATACTC GCCATATTAC GAAGCCGTGG GCGCGTACTC TTCATTAAGC
GTCTCTAGCG CTAACGCTTA TGGATATTCT GGTGGAGTAT GGAATGGAGG CATGGACGCC
TCTACGAAGG CCGATGAGTG TGCGGCGAAA ATTTGTTATG ACTGCATGTG TGAGAGCGCG
ATACGGTCGG GACGCGTGAC AACCATCTGC AAAGACTACT TTTACGATCA ACTACTAATC
TTTGGTTTCG AAGTCGGTCG GCTGTCCGTC GTCGCATTGA CGTCTATGCT CTTGTTGTGG
ACGTCGGGAA AGTTTGCGCT CTTCGAGCGC CACAAGACTG TATCTGCCAC GGAAAGAACG
ACGAGTCGTT TTGCGTTCTT CACCCTCATC ACGAACGCCC TGATTCTTCC GTTACTCGTT
AACATCGAAA TCAAGGGTTT CTCAGGCTTC CCAATCTTGT TCAGAGGAAG CTACGAAGAC
GTGACGTTGG ACTGGTACGC GCTCGTGATG AGATCGCTCA TGATTACTAC ATTCATCAAC
GCCTCGTGGT TTGGACCGTC GCGTCTTGTG CAAATGTGGC TCACGCAACT CTGGCGTTAT
TGGACGGCGC GATGGTGCAC CACGCAGTAC AACTTGAATC GTCTTTATCA ACGACCAAAA
TTTACGCTCG CTGAACGTTA CGGTCAGTGT ATGACGGTGA TATTTACCGC AATCGTTCTC
TTCGCGGCCG CTCCAGTGCT CGTTCCGGTG GCGGCGTTCT ATTGCTTCTT GGCGTACTGG
AGCGACAAGA CGATGATTTT ACGTCACTCG CGCTACCCTT CGCTGTATGA TCACAAGTTG
GCGAGACAGT TTATGTCGTA CGCCCCGCTG GCGTGCTTGG CGCACTTCGC CTTCTCATCT
TGGGCGTTCT CTCAGTGGGA CATACCGTCC TACTTTCTCA CTGGGTTGGA CGGGTGGGCC
GAGGAAATTT ACGACCAACG CGACTCCGAT TTCTGGACCG TTAGAGCGAA TATGACCAAG
TACGAACAGC TCGACTTCAA AGAGCGATTC TATCGCGTCA ATGGATTGAT CCAATTGATC
CCACTCATGG CGTACTTTAT CTACTTGCTC ATCAAAGCCT TCTTCTCGAG CGTCGGTAAT
ACGGCGCTCT ACTTGCTCGG CTGTAAAGGC TTGGGAGCGA GCGAATGGAA GGCTGACGTG
CAATTCAGCA ACTTCTCTGA AGCTCGAGAC AACATGTTAA AGGATGGTGA TCATGAAACT
TCACTCTCTG GTTTGCCGAG TTACCGCGTG CAGGACAATC CGGAATACAC GGCGTTATTC
CCAGAGGCGA AACAGGTGCA CGGCGCGTTC ATCAGTCACG ACGACGCCGA GCGCGCGACC
GACGCCCCCG CGCCGGCTTC GACGCCCAAA TGGGCCAATC GAGTCGCATC ACCCCGCAAG
TGA
 
Protein sequence
MATKLNEDAV RADADAKMKS DRIETLARER QREEANASDD EDEDEDEGFR MKRKEEDAAV 
PAYTWTATLR RRVRELEEGA DVEAREAGRL RLDEALGAAD DEDAGDVSTF VNGAPTVITK
MKSRINSMNA PTGPTGVLVK TAHAEDKPMQ QVVEDSYYKA VMSEKLPAYK APRRHIARLE
NLESAVLDKE GNQLPLWKST SDDLDVLGPG ISLWFAQLKT LGKTFVYITM LASVALSHYV
YIANKSSSVE VEPEITTTLG VTAAGVYASA YGIDIRTIMA VLSCLDVCMV LMFMLITAIL
SRRIRSFVVR VDEALLTLAD YSLQVTGLPV DATEDEVREH FEEFGPVADV VIARSYGAVL
RARIHRARLF KRAEQLKSEL SAIRHTLKKR GEDYNANYAF KRKWKQFYKA RDKMNALKEK
IEQRMLEPFN IVYAFITFEQ ESDKMTCLDE YAPYFNFFRS KRTRFRVHPR EDGKGDAFHY
LRVTQAVEAS DIMWENMANV GWKQYFVRRA ITTIAILGLL IGNIILVVFA TDWVKSGGRL
LVNCGDLFAS GASDNMYCPA IWNINENSLE TDLGLISNVN FRKQVESSDC NPFIESAMWS
YDMTQYSPYY EAVGAYSSLS VSSANAYGYS GGVWNGGMDA STKADECAAK ICYDCMCESA
IRSGRVTTIC KDYFYDQLLI FGFEVGRLSV VALTSMLLLW TSGKFALFER HKTVSATERT
TSRFAFFTLI TNALILPLLV NIEIKGFSGF PILFRGSYED VTLDWYALVM RSLMITTFIN
ASWFGPSRLV QMWLTQLWRY WTARWCTTQY NLNRLYQRPK FTLAERYGQC MTVIFTAIVL
FAAAPVLVPV AAFYCFLAYW SDKTMILRHS RYPSLYDHKL ARQFMSYAPL ACLAHFAFSS
WAFSQWDIPS YFLTGLDGWA EEIYDQRDSD FWTVRANMTK YEQLDFKERF YRVNGLIQLI
PLMAYFIYLL IKAFFSSVGN TALYLLGCKG LGASEWKADV QFSNFSEARD NMLKDGDHET
SLSGLPSYRV QDNPEYTALF PEAKQVHGAF ISHDDAERAT DAPAPASTPK WANRVASPRK