Gene OSTLU_28050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_28050 
Symbol 
ID5006031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp162056 
End bp165130 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table 
GC content54% 
IMG OID640421452 
Productpredicted protein 
Protein accessionXP_001421858 
Protein GI145355209 
COG category[S] Function unknown 
COG ID[COG5594] Uncharacterized integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0618861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0695266 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGT CGACGACGAC GTTCACGCCG CCCGCGCCGT CGACGCCGAC GACGCTGTGC 
GATTCGCAAA ACGTCGCGTG CGTGAACACG GATGTCGCGG ATGCGGAAGT TTTGAGCGCG
TTCGCCGCGT ACGTGTGCGC CGCGCTCGCG GTGTTGACCG CGTTCGGGAT CGCGAGGAAA
TACGTGCCGA TTTATACCGG TCGAGAACAC CTGCGGTCGC TGAAGACGAG CGGGTGCGCA
CCGCCGCGAT TCGACGCGAG CGCGAATCGC GGCGGCGCGC GAGAGGCGTG CTCGACCACG
TACGGATGGA TCGCGCACGT GTTGACGGTG GCAGATTCTG ACATCGTGCA CACGGCAGGA
TTGGATGCAT TAGTATTCTT AAGAATTGCG CAGTTCGGGA CGCAGTTGTT CGCGCCTTTA
GCGTTGGTTG GAGTGCTGGC GCTCGCGCCG ACACACCTGT CGAGATCGTA TTACGAGACG
ACGACGACGA GCGAGTCGTC TGCGGCGCGC GAGAGCCACG TGTTGATGCG AATGACGATC
GCAAACGTGG AACCGACCAG TTCGTTGATG TGGATGCACG TAGTGATGTT TTGGGCGTTC
ACGGCGTATG CGCTGTGGTT GCTGACGGCC CACTATCGCT CTTACGAGTT TTTACGTCAA
GTGTACGGAA CGACGACGGG CGAGTCGAAT CCTTGGCGCG CGGTGCACAT CCCGCAAACC
GTGCTACAGA AGTTTTTACA GCAAGGAATA AACACAAACA GAGAGTTCAT GACGGAGACT
ATTGAGGAAG AGGAAAGAGA GGGCGGACAG ACTCCTGCGG CGAGAACCAT GCCGACGACG
ATGCTTCTCG AAGCGTTGCT TGGGCCGAAA CGCTCGAACG GTCAGATGTC GACCGAGCAG
AAGCTCAGAC GTTTTCCTAG GGCATCGTTC ATTGAAAGCA CTCGTGGACC ATCGATGGCG
ACGCCATCAA GAGGCTATCA CTCCCACTTC GGTCCCCTTC GAGTGACGGA GACGCCGCGT
GAAGAATCGT CTCGATCGGT GTCGGAGATT TCCATGGCGT CCATGAGTGA CTTTGACGAA
ATGTCCAAAG AACATCACTT AGATAAATTG ACGAGCGATA GCGAGGATGT CGCGATAAGA
CACAATTGGT GGGAAGGGTT AGACATCGCT GAAGAGGTAT GGAGCGACCA ATTGAGGAGT
GGGAGCGACG GATTTGGATC GACCGATGCG CTCTCTGCTC CGCGGCAAAT CAAAGTTGAT
ATCGAGGGGC GATTTCCTTG CAACGACGCA TCAACGTGCG ATATTAACCC AGTACCATCG
ATAGATGATA GACGGTACGT CTCAGCCGTC GCGGACGAAG TCAGCGAAGA CGGCAGCGAA
AAGGAAGTGG TGGTCAGCGT GTTGGTGCAA AACTATTGCG TCTTGATGAC AGACGTCGGT
GGTAATCTTC CCGAGGGGGC CGCGGATCCG TGGGAAGGTG TGCGAGCGGT GGAGACATTT
TTCGGAGGCC TGTTTCCAGA CGACTTCCTA ATGGTGATTC CTTTACAGGA CTACCGCCCT
GTGGACGACT TACTCATCGA GCGCGACAAG CTCAAGAATG AAATCGAGAA ACAATCGATG
TTGCAATCAA AACGGCATGG ACACCGTCGT ATGCGTAGAG GGAGCGGTTT TCGGGATGAA
ATCACGGGTT TACGAGACAG AGTAGCAATT TTAGACCACT TGGTTGTTCA GGAGCGCACC
AGAATTCTTC AAACCGAGCC CGGGTCGAGT TGTATCGTTG CTTTCAAAAG CCAGTATGCG
GCGGCGTGCG CGGCGCAGTG CCGTATCACA TCGCGTCAGC GTGATCTTTT TGCGATCGAA
CCCGCGCCGG GACCCGACAA TCTCAATTGG CAATCGGTAT TACTTCGAAG ACGTCAGCGT
GAGATCCGAT CGATGGTGAT TTTCCCGCTC ATTCTCACCA TCATACTCAT TCCGACGGGA
ATGTTCACTG GCGTGATGTC GTCGCTATGC GTAGCAAATC AATTCGGTGC AAATCACAAC
GACGGCTTGA AGTGGTACTG CTCGAGCGAT TCCGCGCGGT ATCTACGAAT TCTAGTGCAA
GGTATTTTAC CACCCATTCT GCTGACACTC TGGGAAACGT TTGTCGTTTC GTTCGGAATG
ATGTATCTCG TTCAGGCACA GAGCAAGTAT TCTAGTCTGA GTAAAACAGA CGAGTCGTTT
GCGGAGTACT ACTTTCTGTG GGCGTTTCTG AATGTGTTTT TCGGCACTGT ATCTGGTTAC
GCCATTCAAC GATATTTGAA CGCGCTCAAC ACGAAAGGTC CGGATGCCAT GCTGCAACTT
CTCGGTACGT CGCTGCCGCT CACAAGTAAT TTCTTCCTAC TTTGGATCGT ATTCAGAGGG
GTATACCTCC CCACTCAGCG GTTGATTTTC CCTCATCCCG GAGTGCTATG CATGATCGTC
AATCGCTGGC TGTGCTGTTT GGGATGCAAC GTGACCGCTC GAGATAGAAC GATCAAATAC
AGCCCGAGAT CGGTTCGCCT TGGTCGCGAA GTCGGTGTGT TCGCCATGGT GATGATGATT
GGTCTCGTCT TTTCCACAGT CGCACCTTTG ATCACATTAC TCTGCACCGT ATTTTTCGTC
TTTAATTTTG TCATATGGCG TTATCACGTC CTATATGTGT ACGAACGCTC GTACGAAGCC
GGCGGGGCGA TGTGGACAAC GTTTTGCAAC TTGACGATTT ACGCGCTGGT CATCGCGCAG
AGCTTTTTGT CGTTTGTCCT CTTGTCCAAG CAAGCGTACG CCGGAGCACT CATTCTCTGG
ATCACTGTCT TACCGGTTCT AAGCAAAGCC AGTCACAGAT TTCGATCGAT CGCGAGCGAG
CTTCGCTGGT CCGTGCCCCT ACCACAGGCG TCCATCGCGC CTCGCGCCGA GTTCAACGCC
GAGACTTACA TGCATCCAGC GCTCAAGCGC AACTCCATGG GATGGCACCC AGAAATCGGC
AAGGTCTGGC GAGGGTACCC TAACGTCACC GTGAAAGAGA CTCGGATATT CAGAAGACGT
CAACGACATA GATGA
 
Protein sequence
MTTSTTTFTP PAPSTPTTLC DSQNVACVNT DVADAEVLSA FAAYVCAALA VLTAFGIARK 
YVPIYTGREH LRSLKTSGCA PPRFDASANR GGAREACSTT YGWIAHVLTV ADSDIVHTAG
LDALVFLRIA QFGTQLFAPL ALVGVLALAP THLSRSYYET TTTSESSAAR ESHVLMRMTI
ANVEPTSSLM WMHVVMFWAF TAYALWLLTA HYRSYEFLRQ VYGTTTGESN PWRAVHIPQT
VLQKFLQQGI NTNREFMTET IEEEEREGGQ TPAARTMPTT MLLEALLGPK RSNGQMSTEQ
KLRRFPRASF IESTRGPSMA TPSRGYHSHF GPLRVTETPR EESSRSVSEI SMASMSDFDE
MSKEHHLDKL TSDSEDVAIR HNWWEGLDIA EEVWSDQLRS GSDGFGSTDA LSAPRQIKVD
IEGRFPCNDA STCDINPVPS IDDRRYVSAV ADEVSEDGSE KEVVVSVLVQ NYCVLMTDVG
GNLPEGAADP WEGVRAVETF FGGLFPDDFL MVIPLQDYRP VDDLLIERDK LKNEIEKQSM
LQSKRHGHRR MRRGSGFRDE ITGLRDRVAI LDHLVVQERT RILQTEPGSS CIVAFKSQYA
AACAAQCRIT SRQRDLFAIE PAPGPDNLNW QSVLLRRRQR EIRSMVIFPL ILTIILIPTG
MFTGVMSSLC VANQFGANHN DGLKWYCSSD SARYLRILVQ GILPPILLTL WETFVVSFGM
MYLVQAQSKY SSLSKTDESF AEYYFLWAFL NVFFGTVSGY AIQRYLNALN TKGPDAMLQL
LGTSLPLTSN FFLLWIVFRG VYLPTQRLIF PHPGVLCMIV NRWLCCLGCN VTARDRTIKY
SPRSVRLGRE VGVFAMVMMI GLVFSTVAPL ITLLCTVFFV FNFVIWRYHV LYVYERSYEA
GGAMWTTFCN LTIYALVIAQ SFLSFVLLSK QAYAGALILW ITVLPVLSKA SHRFRSIASE
LRWSVPLPQA SIAPRAEFNA ETYMHPALKR NSMGWHPEIG KVWRGYPNVT VKETRIFRRR
QRHR