Gene OSTLU_95034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_95034 
Symbol 
ID5004730 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp498440 
End bp500101 
Gene Length1662 bp 
Protein Length505 aa 
Translation table 
GC content60% 
IMG OID640420151 
Productpredicted protein 
Protein accessionXP_001420711 
Protein GI145352772 
COG category[R] General function prediction only 
COG ID[COG0679] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.234943 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGATA GCGCAGTCAT TCTCCATCGC GTCGCTCTGA ACGCGGCGTT CATCGCCGTC 
GGCTACGCGC TGCGCGCGCT GAAAATACTT ACCCTCGAAG ATGGCAAAAC CGTCTTTCGC
TTTGCCACGA ACGTGACGCT TCCAGCGCTG TTACTGTACG TCATGACGCG CGCGAGCGCG
GTGGGCGCGT CCAGTGGATT GAGTGCGACC GTATCCACGG TGAGCGCGAT CATTCCCGCG
TGCTCGTTGC TCGTGGGCGT GGGATGTTCG TTCGGGGCGT ACCTCGCGTA TCGCAAGTCG
CCGGCGCGCG CGAGAGGGTT AGCCGTCGGG AGCGCGACGG GGGTGAATTT AGGAATGTTT
GCGTACCCGT TCGTGGAAGC GATATGGGGG GTGCCTGGTC TGGCGCTATG CGCGATGTGG
GACGCTCCGA ACGCGGTGGT GGTGTTCGGC GCGGCGAAGG CTATTTTCGC CGCCGAGCAA
AAGCACGGCG ACGCGTCTCG AGCCGTGCAC GACGACGGCG GGATTTACGA CGGGGAGTGG
TTAGATAAGA AAAAGCACGG GTACGGGTGT TACAAGTACC CGAGCGGGGC GACGTACGAA
GGGCAGTGGA AGAATAACGT CAAGGATGGC TTGGGGGTGT ACACGTACGG CAAGGGCGGT
TCGTACGCCG GCGAGTTCAA GCGCGGTCGG TTCGACGGGA CGGGGATTCG CGTGCTGCGC
ACGGGCGCTG TCAAGGCGGG ATTATGGGAA GACAACGAGT TTGTCGAGGC TACGACGGTA
AAGGATTGCG AAGGGACGAT TGCGGCGACG AACGCGGCGG TATCGACGGC TCGCAAAGCC
GCCGAGGCGA GCAACCTCAC GATGAAGGAT TTATTTTGGA AGGTGGCGAA ATTTCCACCG
GTGATCGCGG TGACATTGGC CAGTATGATG AACTTCACTG GTATTGCACT CCCTCAGACT
GCTTCGCAGC TCGTCGTGCC GCTGGCGAAC GCGAATAACC CGATCGTGTT GCTCACGCTC
GGCGTCCTTT TCAAGCCAGC GATGGACCGA ATGCAAGTGC AAGCGGTGGC TAAATTTATC
GGCGTGAAAT ACGGTCTTGG GTTGCTATCG GCGGCGGTAT GTACATTATT CATTCCACAA
AGTTTCGCGC TCGCGCGAGG CGTCATAGCC GCGCTGTGCG TGATGCCTGT GCCGTCCATC
GTCATGCAAT ACTCGGCCGA GCACGAAAAC GACGGCCAAC TTGCGGCGGC GATTGTCTTG
AGCTCGCAAG CCATGACGCT CGTTTTGATC TGTTGTTTCG CCGTCGTCGC GCCGTACATC
GTGAGTATTG ATAAATTCGT GTTTTCGGGC GCCCTTCTTG CTGGCGCCGT TGCCGTGGGC
GTAGCGAGCG CCGTCGGCGT CCTGGCGTTG AAGCCGTCTC GGGTAGACAA GGCGAAGAGT
CCGGGCGTGG CGGTGGCACC GACGGCGAGC ATGCGATCGA CATCCCCACG AAACATCGCT
CATCGGCGAC AACGGCGCGA TGTTACGGTA AACATCGCCG TTAATGCCCC TCTTCGTGCG
CTGACTGCTC GAGGTTCTTC CTTTACGTCG GCGAAGCGGT CGGCGCCGCA AGCGCCTTTG
CGAGCGGCTC TGAGCGGCGG TGTAAAATTA GTAGGATTGT GA
 
Protein sequence
MTDSAVILHR VALNAAFIAV GYALRALKIL TLEDGKTVFR FATNVTLPAL LLYVMTRASA 
VGASSGLSAT VSTVSAIIPA CSLLVGVGCS FGAYLAYRKS PARARGLAVG SATGVNLGMF
AYPFVEAIWG VPGLALCAMW DAPNAVVVFG AAKAIFAAEQ KHGDASRAVH DDGGIYDGEW
LDKKKHGYGC YKYPSGATYE GQWKNNVKDG LGVYTYGKGG SYAGEFKRGR FDGTGIRVLR
TGAVKAGLWE DNEFVEATTV KDCEGTIAAT NAAVSTARKA AEASNLTMKD LFWKVAKFPP
VIAVTLASMM NFTGIALPQT ASQLVVPLAN ANNPIVLLTL GVLFKPAMDR MQVQAVAKFI
GVKYGLGLLS AAVCTLFIPQ SFALARGVIA ALCVMPVPSI VMQYSAEHEN DGQLAAAIVL
SSQAMTLVLI CCFAVVAPYI VSIDKFVFSG ALLAGAVAVG VASAVGVLAL KPSRVDKAKS
PGVARSAPQA PLRAALSGGV KLVGL