Gene OSTLU_17845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17845 
Symbol 
ID5004957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp262338 
End bp265304 
Gene Length2967 bp 
Protein Length988 aa 
Translation table 
GC content58% 
IMG OID640420378 
Productpredicted protein 
Protein accessionXP_001421101 
Protein GI145353612 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0578537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAACG GAGAGGGGGC GCTGGGCGCG ATTTGGACGA CGGCGACGGC GGCGCTCGGG 
ACGCTGCTGA CGAGAGGGAG CTGGCGGACG AGCGCGACGG TCGGGACGTC GTGCGCGCTG
CTCGCGGTGA ATGGGGTGAA TTTGTTTTTA TTTTGCTCGT GGTGCACGCT GCAATTTCGG
TGGATACACG AGTCGCACGC GGGCGCGGCG GCGACGTGCG AGCGTTTGAT TTTCGCGCTG
TGTCCGCCGA CGACGACGAC GATTCTGACG TGGGCGTTCG CGAGCGCGAG CGCGGGAGCT
GAGAGAGCGG CGTTTTACGG CTCGGTCGTG TCGTTGATCA CGCATCGGAT GTTTTTGTTT
CCGTGCGCGA GCGCGTGCGC GGCGGTGACG AAGACGAACG GGCCTCCGTC GGACAAGCGC
TCGGCGTTGA CCGAGAGCGA CGCCGAGTAC GCCACCGTGG CCACGCTCGG CTTACCCGTC
GCGTTGTATT TATGGACGAA CCTCGACACA CTGTTCAAGT CCCTCGATCA CGTCTTCGCG
TGTGGGGCGT TGGTGACGGT CCCGGTGTTG TATTTAATAG CCGCCGGCAC GGAGAAGTCG
TTGTGGTGGC GCGCGCGCGG CGTCGACGTG GCGTCGAAAA GGATGGAAAC CGCCGTGCTC
CTCTTCGCAC TCACTGGATT TGCGGTGAGC GTTGAAGGTG GAATAATATT TAGTGAATTT
GCCGAGTACA TCGAAATCAT GGCTCCTTTG AATTACATCA TGGTGACGGT TTCCGTCCAC
AGCGCGCTGG CAGTTTTCGC CGCGTGCTAC GCAAACGCCG TCGGCGACGG CGTCCCGACG
GGTGCGGTGA AGGCGACGTT GGCTCTGTCT ACGTCGACCG CAATTTGTGC TTTGGGTGCA
CCTCTGTGGA TGATTCCGAT CCCCATCGCC GGATCATCGT CATTCGTCAA ATATTATTAC
GAGGACCGAG AGCCGAAGGA TTACGGCGTG TTTGCCGCGA GTTGCGTCGG ATGCTTTTCG
TGGTTTCTTT CGAAGAATTT CTGGTCACTC GATGTGCGCG TGGGAGCTTT CGACGTCAAA
CAATTGTGTG TCGCCATATT ACTCCTCGCC GTGGCGGCTT TGGCTTTGCC AGCGGTGTTG
AATACGAAAA GCGCGCGCGC GCCGACGGTT GGCGTCCTCG TCGTCTGTTA CGTCTCGGCG
CTCGCAACGA TTGAACAAAT CCTGAGTCAA GCGACGCATG ACGACGATTC TTTGATTTAT
CCACCATATT TGGTCATCGT CACTTCCATC AGCGGCTTCT TGGCGAGTCG AGGCCTCGTC
ATCAGCGGGA GAATCAGTCG CGAATTTGGC TGGGTGATGC AGAGCGTGTG CGGCGCAAAG
CTCTCCATGC TCTTTGTTCG CGGTTTGAAG GAAATGTTCA GCGTCTTGGT GGTCGTGCTC
GCCATCACCG CTCCGCACGC GATGTCACGG CGAATGACTC AATTATCTCC GGGCGCGAGC
GTTGGTTATT GCGTCGCTTT GGTGTTCTCT TTGGTATTCG CTCGGTTTGC GATGTTTGAC
GTTATATTTG AACTCTCGGG TCACCGACCG ACGGACGCGA CGCTCTTCGG TGGACTCCTC
CTGATCACGG GGGCGAGCTT GGCGTCCGTG GTAACGCGAC AAAGTTACGG TGATGACATG
TTTAGCAAAC GATTGATGAT GCTCTTGAGT TTCTGTGGCG TCTTTCTCAT CACCTTCCGA
CCGCCGATGC CTTGGAAGGG CGAGGTGGGC ATGTGGTATG ACGCCGAACA CGTTCCCGAC
TCTGAAGAAG ACGAAGCGAG AATGTACGGC GTGCGCGAGA ACGCGCATCA TGGATGGCCG
AGCTGGTTGC TGATGTTAGC CGCGCTCACC GCGATATTCG CCGTCTCGTC TCCACGACAA
CAGACAAAAT CAACGTCAAC GATTCGAATC GCCTTAAGCG CCGTGTGCGG TGGGAGCGTT
GGTTTATATA TGGCGCTGGA ATTCTTCGTC CAGCAAGTGG CGCTGACGGC ACTACTATTT
GTCGCGTGTG CGCTGGTGGG AGTGTTCTTG TCTTTCACGT ACAGCCCTTC GCCGAAGTCT
TCGCGCTGGT TGCCTTACGT GTATTTATCG TTCGTCTCCG TTCTCGGCTT GGCGTACGTC
ACACAAATGG GTGGCTCAGA CGAGACTGTG GACGACCATC AGGCGAGAAT GGAAGGAAAA
TTCGGTGTCG TCGGCGTTTT CGCCGGGACT TCGCTGCAAA TCGCGTTTGC TCTGAAGTTG
AGAATCAAAA CGAGCCTGGA GAGCGTGCAA CATCGACGAC GTCAAGGCGG TACTTCACCG
TTCCTTCCCG CCACTGGTCG CAGTCGTCCG GAATATTTCC GCGGTGTCGC GAGCAGAAAC
GAGCACAGAG AACTCAAGGC AAAGGCGATC GCTTGGATGC CAATCATCGG GAACATCGCC
ACGCTCACGT CCTTTCTCGC GTGCGTGGTG CTGAGCGATG AGTTGGCCGA TGGCTCCGCG
TTTTCGGTCT TCGTCCTCGC GCCCATTCTG CTTCTTCTTC ACCAAGACTC GGTGATATTT
CCTATACTGG AAGACAGCCA ACGATACGCT CCACCGCTCG CGATGATCGT GGGTAAGATG
TGCTGGGACG CCGTCGCCGC CATCCTCGCC GGCCCGAACC GAGTTCACGT TCTCGCCGCG
ACCGCCTCCA AGTTGCCGTG GATGACGCTC AACGCGTTGA GTCTGCTCCT CGCTTCGGTG
AATAGCATCA ATTTGGTGCA CTACCTCGCC ACGAGCGTTC GCACGGACGG GATGACGCTC
ATCTTGACCG CCCCGCTCGC CGTCGTGGCG CCGTTTCTTT CAAAAATTCC CTCCGTGCGC
GCGCTCGCCT TCACCAGTCT CATCGCCGTC GTCACCCAGC ACACCCTCCA GCGTCGAGCG
AAGATCGTCG GGCTGAAGTA TTTATAG
 
Protein sequence
MGNGEGALGA IWTTATAALG TLLTRGSWRT SATVGTSCAL LAVNGVNLFL FCSWCTLQFR 
WIHESHAGAA ATCERLIFAL CPPTTTTILT WAFASASAGA ERAAFYGSVV SLITHRMFLF
PCASACAAVT KTNGPPSDKR SALTESDAEY ATVATLGLPV ALYLWTNLDT LFKSLDHVFA
CGALVTVPVL YLIAAGTEKS LWWRARGVDV ASKRMETAVL LFALTGFAVS VEGGIIFSEF
AEYIEIMAPL NYIMVTVSVH SALAVFAACY ANAVGDGVPT GAVKATLALS TSTAICALGA
PLWMIPIPIA GSSSFVKYYY EDREPKDYGV FAASCVGCFS WFLSKNFWSL DVRVGAFDVK
QLCVAILLLA VAALALPAVL NTKSARAPTV GVLVVCYVSA LATIEQILSQ ATHDDDSLIY
PPYLVIVTSI SGFLASRGLV ISGRISREFG WVMQSVCGAK LSMLFVRGLK EMFSVLVVVL
AITAPHAMSR RMTQLSPGAS VGYCVALVFS LVFARFAMFD VIFELSGHRP TDATLFGGLL
LITGASLASV VTRQSYGDDM FSKRLMMLLS FCGVFLITFR PPMPWKGEVG MWYDAEHVPD
SEEDEARMYG VRENAHHGWP SWLLMLAALT AIFAVSSPRQ QTKSTSTIRI ALSAVCGGSV
GLYMALEFFV QQVALTALLF VACALVGVFL SFTYSPSPKS SRWLPYVYLS FVSVLGLAYV
TQMGGSDETV DDHQARMEGK FGVVGVFAGT SLQIAFALKL RIKTSLESVQ HRRRQGGTSP
FLPATGRSRP EYFRGVASRN EHRELKAKAI AWMPIIGNIA TLTSFLACVV LSDELADGSA
FSVFVLAPIL LLLHQDSVIF PILEDSQRYA PPLAMIVGKM CWDAVAAILA GPNRVHVLAA
TASKLPWMTL NALSLLLASV NSINLVHYLA TSVRTDGMTL ILTAPLAVVA PFLSKIPSVR
ALAFTSLIAV VTQHTLQRRA KIVGLKYL