Gene OSTLU_18125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18125 
Symbol 
ID5005336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp425731 
End bp427800 
Gene Length2070 bp 
Protein Length689 aa 
Translation table 
GC content51% 
IMG OID640420757 
Productpredicted protein 
Protein accessionXP_001421289 
Protein GI145354009 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0142809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.181943 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCACC GATCGTACCG TGTATGCAAT GGACGCACGC TCGCACTCAT CGCGCTCGTC 
GTGCTTGTCG CACGCACGCA GCGCGTCTTC GCTTCGCCGA CCGACGAGCG AAGATTCTGG
ACGCCCACTG TCGTCGCCGT GGACGACGGC GTCGGTTCGA AGATCCGACG CATGGGAAAC
GATGTGGAGA AATCGAAGTC TTTCGTGTTT GAGAACACGG TAGCGGAATC GCTGACAAGA
GATGGCGTCG TCGCGGTGCG AGTGCGAGGC TTGAACGAAG CAAAGACACA GAGCGCAAAA
CTCTTCATGG ATTGCGCGCG GGTTACGGAT AATGTTGCAG TAGAAGAGTA CGACGATGGA
TCCAGGCGCG TGACGCTCGC CACGACGACG AAGCAGAGGG GGAGTGGTTT CGGGGAACCC
ACGTCCAAAG CATGTAAGAA GTTCGAGGTC ACGACGAAAA TGCTTCGCGC GAAAGTTTCT
GTGGCCGAAC AAGCAATCAC GGACGTGTTT CAGAGCAAGT TTTCAGATCT GAGTGAATTT
GCGAACGGGG TACTGATGCA GGGAGAGAAC GGCGAGACCT ATAAAACTTT CTCGGAAGTA
TTGAAAGACT GCGAGCACCT TGAACATTTT CACGGCTACA CGATTTCACC ACAACAAAAA
TTGAAGAATA GTAAAGTGCG GACGATCGAA GAGCACACGG ATCAAGGTCT ACTCATCGCA
TTCGTCCCGG CCATAATTGT CGATGCAGTG ACGGGAAGGC GCGATAAGTT TTCCTCAACC
GGCGATTTTT ATATGACGCT TCCTGGGCAT CGCAAGGTGC TTCTCCATTT TGACACGCTT
CCAGACGATG TCGTTATTTT TATGCTGGGT CAAGCAATCG AGCAACAAAT CACTCCCAAA
CTTGCAAACG GCCTTGAGCT TCACGCTGTA CCACACGCAA TGGATATGCC CAATCTTAAA
TCTAATCAGT TCAGATTTTG GTATGGGCGA ATGTTCCTCC CACCTGAGAA TGCCTTGGAC
GAAAGCCATG GTCTCTCTTT TGGGACAGTG CGCGCTAACA TCATCGAAGA TTTCGCACGT
GGGCTGAAGA CATCCGTGGG ATGTGGTCGT GAGCTCCTCT CCGTGGCAGG GAGCGGTTCA
TGCGCTAGCA ACCAGATATG GTGCTGGATG CGATGCATGA ATCACTCAGC GCCGGCGAGC
AATCCACTCA CATGCAGGAC GGGTCATAGT GTCCAATGTT TGAGCCAAAG ACTGGAAGTC
TGGACGGATG CCGACAGTCA TGGTGATTAT AATCCTGGAT GCACCGACGA GACCAACGAG
ACAAAGCCCG TCACCGATCC ACCGACCATC CCAGCTCGAG CAGCCTCTTG TACAGATGGA
AACGCCTGGG AAACTTTTTT GAACCACTCC GAATATGAAA ACAGCTTGGA GCTGTTACAA
GATAAGTTAT ATTTACTCTG GACTGTTCAA AATGGTAAGT TAAAAGCCCG AGTTGCCTTC
AATGGCAAGA TTGGTTGGTT TGCTCTCGGC ATCAAAAACC TAGGAGGGAA ACACAACGGG
ATGAACGGGG CGAACATCGT CATGGGCGTC CACGACCCTC ATCCAATGGG TCACGACAGC
AATTTCGGTT CTCCTTATGT TGGAAAGAGC AGTGCGAAAC AATACAAAAT CCACGATCAT
TCTTCTGCTT TTCGACATTG GAATGACACT GCTGACGTTG CCTCACCCTT CGTGAGTTCG
ACGACGTTCA ATTCATGTTT CAGTTGTATG CGTTTCGAGA CGGCCTCGAT ACGGGGCGAG
GCTCTGAATC TCACCTCTGG TATTAATGAA CTCATCTGGG CTGCTCATGA TGGCACTTAC
CTCAAAGGCT ACCATGAAGT ACAAGGATTC GATCGGAGAG ATGCACGCGG GCATCTCACG
CTTGACCTGA CGCGAAATTA CGGTCGTGTA ACGTGCGGCT TCTCAAGAGG CTGGGCTGCG
GCGAGTCAAA CATGTGCTTT TTCGGCAGCA AGCACTCCTT CAGCGATAGT TTCTATATCC
ATTGCTGTCA TTGCAATCCT CGTTATGTGA
 
Protein sequence
MRHRSYRVCN GRTLALIALV VLVARTQRVF ASPTDERRFW TPTVVAVDDG VGSKIRRMGN 
DVEKSKSFVF ENTVAESLTR DGVVAVRVRG LNEAKTQSAK LFMDCARVTD NVAVEEYDDG
SRRVTLATTT KQRGSGFGEP TSKACKKFEV TTKMLRAKVS VAEQAITDVF QSKFSDLSEF
ANGVLMQGEN GETYKTFSEV LKDCEHLEHF HGYTISPQQK LKNSKVRTIE EHTDQGLLIA
FVPAIIVDAV TGRRDKFSST GDFYMTLPGH RKVLLHFDTL PDDVVIFMLG QAIEQQITPK
LANGLELHAV PHAMDMPNLK SNQFRFWYGR MFLPPENALD ESHGLSFGTV RANIIEDFAR
GLKTSVGCGR ELLSVAGSGS CASNQIWCWM RCMNHSAPAS NPLTCRTGHS VQCLSQRLEV
WTDADSHGDY NPGCTDETNE TKPVTDPPTI PARAASCTDG NAWETFLNHS EYENSLELLQ
DKLYLLWTVQ NGKLKARVAF NGKIGWFALG IKNLGGKHNG MNGANIVMGV HDPHPMGHDS
NFGSPYVGKS SAKQYKIHDH SSAFRHWNDT ADVASPFVSS TTFNSCFSCM RFETASIRGE
ALNLTSGINE LIWAAHDGTY LKGYHEVQGF DRRDARGHLT LDLTRNYGRV TCGFSRGWAA
ASQTCAFSAA STPSAIVSIS IAVIAILVM