Gene OSTLU_28022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_28022 
Symbol 
ID5005874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp117340 
End bp121452 
Gene Length4113 bp 
Protein Length1370 aa 
Translation table 
GC content52% 
IMG OID640421295 
Productpredicted protein 
Protein accessionXP_001421977 
Protein GI145355456 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.647512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.726838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACTT ATTACGGATT TGAACACGCG GAGAGTGGAT TGCTGTTGCA GCGCCGACAG 
CGCGGATCGA GAAAGTTGGT GTTTTGTTCC ACAAAGTTTG GCGTCAACGA ACAGTTTGAT
GCGACCGAGG GCGAGCGCGG GAGCTTGAAG TTGATCAATC GGCGGTGCTT CGGGGCGTGG
GAGGTGATTT TGAAGGCTTT GGAGACTCCG GCCGAAGTGA GCGCGCGAAA GGAATCGTTT
CAAGAGCACT GGCACGCGGC GACGGAGAAG GCTGTGACAC ACGGCTTTGG CGCCATGGAA
AATCTCGTTC GCACGTTTCA AAAGGCGCGA GACGGACACG AGCACGAGTT GACGCATTTG
AATAATGTTA TTATTCGCGC GATGCGTCAA AAGAGAGATC ATAAGATTGC GTACAGAGTC
TTTTTCGCGT GGCGTAACTC GACGATGAAA TCAAAGTCGT ACAACATCTC GGTTCGAAGG
GCCGGTGCTT TTCTCGGCGA ACGCATCGTC ATGACGGTGC GGAACGTCTT CGATGAGTGG
AGGGAGCGAT GCGATCGCAA AAAACGCATG GTGCTAAAGG CTGATGAGCG ATATCAGAAA
ATTAGAATTA GATTTTTGCG CGAGTACTTT TTTGAGTGGA AAAACCGTCT GTCTCGCGAC
AAGTGGTGTC GATTAGCCGT GCAACGATGC TTGAAAAAGT CCGAAAGACA AATGAAACTC
GCGGTTTTGA GTGTGTGGAA AAGCGATGTT GATAAATCAA AGGTCGATCG AGAAAAGAAA
CGAAGAGCCG AGCGTATGAT GCTGGAAATG ATGAACCACA AGCTGTACTC GGCATTTTAC
AGTTGGCGAG ACGCCGTGAC GCAGAGTCGC ATGAACGATG CCAAGGCACG TCAGAGCGTC
GCGAAACTGT CGACTAGACT GATATTTAAG GCATTCGTCG AGTGGCGCTT AGTCGTAGAC
ACTGCGCGCG CGGAAGCGAT GGAAGGTAAA AAGGCCATCA CGTGGTTTTT GTGCTCGACG
CAAAGAAGAG TCTTCACGCA GTGGGTCGGC GTCGCGCGAG AAAGTAAGCG CTTACAGCGA
ATGGCGGCGA GATTTATCAC CCGGCGAACA TCCTTACAGC TGTGCAACGC ATTTTATGAG
TGGAAAGAGA TGCTACACCG AAGTTCTGTG TACAAGGTTG CGATGGAAAA AGCAATCAGA
CGTTGGCAAC AGCGACGTCT CGCCAAGGCA TTTGCGCAAT GGAGCGAAGT CGTGGAGCAC
AAAAAGTACG TGCGCGTGCA AGCTCACAAA ATGGCTGAAA AAATGCGAAT AAATTCCTCG
ACAGCGGCAC TATCAATGTG CTTCTGGGGA TGGCTTTCTA TCGCCCAAGA ATCGCGCAAC
GTGCGCGTGA CGGAGCAGTT GGCGAACGAA CTCTTGGAGC AGCGTTTGGA CATTTTCTGC
AAGATTCATG CGACTCGCAA GGCGAGAGCA GCTTTCGTGT ATTGGTACAA GTATGCCATG
AGCCAGCGAG ATCAACGATT GAAATTGACT CTCGCACTGA ATCGTATGAC GTCTAGACTC
CAGTTCACTG CATTCAACAC GTGGGTGCAA GTGGTTGAGG ACAGGAAGCG TCAACGTGAG
CTCATGCGAA CCGTTTTGAT GCGCGCATCG AATCGTCTCA TCTCATGCGC TTTCAACGCT
TGGCGCGAAG TCACCGCGGA TTCGATCGCG GCAAAAATTC ATTTGAAAAA TATTGAAAAT
ATCGTCAACC TTCAAGCGAA GAATGCCGCG AAAGAACGAC TGAAGAGAAC GTTTTTGCAG
TGGAAAGACT ACGCTGTCCA CACGCGGCGT CAGCGGCGGG TGGTGGCAAA GGCTATCACT
TCCATACGCA AGCAGGCGCA AGCTAAAGCT TTCGCACGAT GGAGAGCGTC GGCAAAAATA
TTTGCGCAGC AGCGAAGAAC GCTCGTTCGT GTGACGCAAA AGATGCAACG AAACAATCTT
CGTATGGCGT TTGATACCTG GGCAGAACGT GTCGACGAAG CCAAGGTGCA TCGAGTCATT
TTCCAGAAAG CGATTCAAAA GATGTCGCAG TGCAAGCTGT ATTACGCGTT TTCAGGATGG
GTTGCGCGCG TCGGCGAAAA GAAGACCCAA CGCGCGTTGC TTAACCGCGC CGTTTCTCGA
TTCAGAGGAA GACGTTTGCA CGTGGCGTTT TACGACTGGT CGAGCACCGC CGCGGCGCTG
CGACATCAAC GACAAGTGAT CGAGAGAGTC GTGTCAAGGA TCAGGAACAG ACTCTTGGCA
GGTGCGTTTG AGCAGTGGAA GCAACGAGCG AGCGAGCAGC GAATCGATCG ATGGAAGATG
GATCGGGCTC TCACACGTCT CACGCAACGA GTCATTTTCA CAGCTTTCAA TACTTGGCTC
GACCACGTTC AAACAAAGAA ACGTTATCAA GCAATCATCG GCAGGTTCTA CGAGAGATTT
AGAGACAGAT CTTTGCGAGG AACGTTTAAA ACGTGGGTTC ACGCCACGCA GGAGGCGAAA
ATGCGTAGGA TGGCTGACAT GAAACAAGAG CAACTTCGAT CGAACAAGTT GGCGCAAATC
TTAGGAAGCG TGAAGCGGCA ATCTCTCGGT TACGCATTCA TGCAGTGGCG CGATCATGTA
CAGGAAATCA AGCAAATGAA AGTAAATGAA AGCAAAGCAC GCGGTGTGCT GGCGCGAGCG
CGAATGCGCG CGGTAGCCCG TGCGTTCAAC CGCTGGGTAT TTTTCATCGA TGAACGACGA
CGCGTCATGG ACGCCGCTCA CATGGTGATT CTTCGAGTGA AGCAGCGTCA CTTAGCGTAT
GCCTTTGACG GTTGGTTAGA CGCCGTGCAC AAAAAGAAGC GAAATCGACT ACTCGTCGCG
AACAGTTTGA GGAAAATGAG GTACAGAATC ACAGTCAGGG CGTTTTACTC TTGGATTGAA
AGCGTGGACG AAGCTCGCGC GTCTCGTGCG TACGAACGTC GCATCGAACG CGCAGTGAAG
ATGTCTTTGA CAAAAGTGCT GAATCGAACG CTTTCTCGAG CTTTCAACGC GTGGAACTAC
AAGATGATCG AACAAAAGCG TCACAGGACG CTCGTTTCAA AGTCTTTGCA CCGCGCGCGC
AATAAAACGC TGGCGCAGGC CTTTGATGGT TGGTCCACGC ACGTGCTGAT GATACGCAGA
CAAAAAGAGC TCGTTTCTAC CAGCCTGCAA CGCATGCGCC GTCGGGCTCT CGTCAAAGCG
TTTAATAGTT GGTCGGGATA CATGAAGCAA ATACGTTCCT TCCGTGTCGT CGAACGGCGA
CTGCAAAATG TCGAGCGCGC CATAGCGCCA CTGCACGTGA CTCATTCCTC GGTTTCAGAC
ATGGTCCGAG TCAACGTAGC CATGCGCTGG GGGCTCGCTC GTAACGAGCG CATCTACAGA
AACCCAATGT TCATGGCCTG GGTGCGGTAC TCGCAAAGAA TTTCTGAGCA CAGAAATCGC
ACGGTGAAGA AGATGCACGA TATTCTTGCA GACAGAGCTC GACGAAAGTT TTTGCGCTCG
TGGCGACAGT TCACGGAAGT GATGAAGTAC CATCGATTGA AAACCGAGCA TAGGCAGAAA
CGAGTCGTTC GCAAGATTTT TAGCGAATGG AAAATGAACG CTCGATCACC CAGCGGCGCG
CAGGAGACGT ATTCGCTCAC CAAATATGAA CGCCCGACGA TTTCGACAGG TTGGGATTTT
GACAAGTCTT ACGAAGAAAA TATTCGTCTT CTCGGACGCA ATCGGGTGGC GTCGAAGGAA
TTCGCGTATG GCTCTCGATA TTCAACTCCG ACAGCATCGC CTCGAACGAT GCCGACGGTC
GTCGAACCAG ATTCGTACGA AAAGCAATAT CGTGCCGTGA TGAGCGACGT TCAAACCATG
GAAGCTGAGG TCGAAGCCCT GTCGACGGTT CGAGAAAGCT TGCAAGACCA ATTTGACCTT
CTCGCGCGCG ATGAAGCCTT GCGCGCCACG TTTGCTCGAG CGACGTCGGC TTCTGTTTCT
CTATTGGAGC AAACTAAGAG CACGTACTAT AGCGACCGGC GATTCAGCGA ACCATTCAGC
CCAGTCAAGG TGAGTCGACA ACCGCCGCGG TGA
 
Protein sequence
MKTYYGFEHA ESGLLLQRRQ RGSRKLVFCS TKFGVNEQFD ATEGERGSLK LINRRCFGAW 
EVILKALETP AEVSARKESF QEHWHAATEK AVTHGFGAME NLVRTFQKAR DGHEHELTHL
NNVIIRAMRQ KRDHKIAYRV FFAWRNSTMK SKSYNISVRR AGAFLGERIV MTVRNVFDEW
RERCDRKKRM VLKADERYQK IRIRFLREYF FEWKNRLSRD KWCRLAVQRC LKKSERQMKL
AVLSVWKSDV DKSKVDREKK RRAERMMLEM MNHKLYSAFY SWRDAVTQSR MNDAKARQSV
AKLSTRLIFK AFVEWRLVVD TARAEAMEGK KAITWFLCST QRRVFTQWVG VARESKRLQR
MAARFITRRT SLQLCNAFYE WKEMLHRSSV YKVAMEKAIR RWQQRRLAKA FAQWSEVVEH
KKYVRVQAHK MAEKMRINSS TAALSMCFWG WLSIAQESRN VRVTEQLANE LLEQRLDIFC
KIHATRKARA AFVYWYKYAM SQRDQRLKLT LALNRMTSRL QFTAFNTWVQ VVEDRKRQRE
LMRTVLMRAS NRLISCAFNA WREVTADSIA AKIHLKNIEN IVNLQAKNAA KERLKRTFLQ
WKDYAVHTRR QRRVVAKAIT SIRKQAQAKA FARWRASAKI FAQQRRTLVR VTQKMQRNNL
RMAFDTWAER VDEAKVHRVI FQKAIQKMSQ CKLYYAFSGW VARVGEKKTQ RALLNRAVSR
FRGRRLHVAF YDWSSTAAAL RHQRQVIERV VSRIRNRLLA GAFEQWKQRA SEQRIDRWKM
DRALTRLTQR VIFTAFNTWL DHVQTKKRYQ AIIGRFYERF RDRSLRGTFK TWVHATQEAK
MRRMADMKQE QLRSNKLAQI LGSVKRQSLG YAFMQWRDHV QEIKQMKVNE SKARGVLARA
RMRAVARAFN RWVFFIDERR RVMDAAHMVI LRVKQRHLAY AFDGWLDAVH KKKRNRLLVA
NSLRKMRYRI TVRAFYSWIE SVDEARASRA YERRIERAVK MSLTKVLNRT LSRAFNAWNY
KMIEQKRHRT LVSKSLHRAR NKTLAQAFDG WSTHVLMIRR QKELVSTSLQ RMRRRALVKA
FNSWSGYMKQ IRSFRVVERR LQNVERAIAP LHVTHSSVSD MVRVNVAMRW GLARNERIYR
NPMFMAWVRY SQRISEHRNR TVKKMHDILA DRARRKFLRS WRQFTEVMKY HRLKTEHRQK
RVVRKIFSEW KMNARSPSGA QETYSLTKYE RPTISTGWDF DKSYEENIRL LGRNRVASKE
FAYGSRYSTP TASPRTMPTV VEPDSYEKQY RAVMSDVQTM EAEVEALSTV RESLQDQFDL
LARDEALRAT FARATSASVS LLEQTKSTYY SDRRFSEPFS PVKVSRQPPR