Gene OSTLU_42820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42820 
Symbol 
ID5003131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp602313 
End bp603611 
Gene Length1299 bp 
Protein Length432 aa 
Translation table 
GC content56% 
IMG OID640418552 
Productpredicted protein 
Protein accessionXP_001419216 
Protein GI145349598 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.246102 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGATG AGGCGAGCAA GACGGCGAAG ATTGAGAGCG CCGCCGAGTC GAAAATCGAC 
TTGTATAGGG ATCGGTTCTT GTTGTTGCAG CAGCGCTTGT CGCGGTCAAG GCAGTTCGCG
AAGCCGACGT TGCAGACGTC CTCGACCGAG GGCATCGCGG AGTTGACGTC GATTCAGTCG
TTGTTAGGGG TGAGCAAGGA GACGAAGTTT ATCATGGGAT GTTTGAGTCA ATTGGAGGAC
GAACGGTTTT ACATGGAGGA TTTGACGGGG ACGGTGCGAG TTGATTTGAC GGCGTGCGAG
CGCAGCGCGG GGTTGTTCAC GGAGAATTGC ATCGTGATCG CGCAGGGTGA GGTGCGACCG
GACGGGGTGT TTGAGGTCAT GGCGTTGACG TTTCCCCCGG CGGAGACGCG AGCGGCGACG
AGAAACGCGA CGAACGCTTT GGATTTCTTC GGCGCGGGGC ACATCTTGCG ACCGAACGAG
CTGGAGGAGC TCGAAGAGAA GGAACTTGAA CGCGTCGGTG AGAGGTTTAT CGTGTTGTCG
GACGTTTGGC TCGACCAACC ACGCACTTTT GATAGATTGG CAAAAATGTT TGACGCGTTT
GACTCGCAAG AGGAAGACGT GCCGGGATTG ATTGTCTTCA TGGGAGATTT CACATCGAAA
CCGTTCGGCC CGACGCACTA CGACTTTCGC GCGTATACCG AAGGCTTTGA CAAACTCGCG
GAGTTGCTGG AGGAATATCC GCGCTTGCGA CAGGAAAGTC GGTTCGTCTT CATCCCTGGT
CCGGGCGATC CCGGTTTGAA CGCCGCGCTT CCGCGCCCGG GATTGCAATC ATCCGTCATC
GGTTCTCTGC TGGAGAAGGT TCCGCGCGCG CAATTCGCGA GTAACCCGGC AAAAATTAGA
TACTTTTCGC AAGATCTCGT GTTCTTTCGC GACGACTTGC AGGCGAAGAT GCGCAGAAAC
TGTTTGATGC CGCCCGACGA CGATAAACTG CCGGAAATCG CGCCCGGCGA CGAGTGGGCG
AACCGCCCGG TGTTCAAGCA TCTCGCGGCT ACCATGGTGC AGCAGGCGCA CTTATGCCCG
TTACCGATCA CACAAAGCCC GATTTATTGG GAATACGACC ACTCGTTGTG GTTGTATCCG
GCGCCAAACT GTATTTTCTT AGGCGATCGA ACCGAGCAAC AATCGCTGGC CAACTTTGAG
GAGACTTCGC TCGCGAATCC CGGATGCTTT TCCGACGACG GGTCGTTCTT GCTGTACATC
CCCGCCACGG GTGAGTGTTC GTTCTCAGCC GTGCCGTGA
 
Protein sequence
MFDEASKTAK IESAAESKID LYRDRFLLLQ QRLSRSRQFA KPTLQTSSTE GIAELTSIQS 
LLGVSKETKF IMGCLSQLED ERFYMEDLTG TVRVDLTACE RSAGLFTENC IVIAQGEVRP
DGVFEVMALT FPPAETRAAT RNATNALDFF GAGHILRPNE LEELEEKELE RVGERFIVLS
DVWLDQPRTF DRLAKMFDAF DSQEEDVPGL IVFMGDFTSK PFGPTHYDFR AYTEGFDKLA
ELLEEYPRLR QESRFVFIPG PGDPGLNAAL PRPGLQSSVI GSLLEKVPRA QFASNPAKIR
YFSQDLVFFR DDLQAKMRRN CLMPPDDDKL PEIAPGDEWA NRPVFKHLAA TMVQQAHLCP
LPITQSPIYW EYDHSLWLYP APNCIFLGDR TEQQSLANFE ETSLANPGCF SDDGSFLLYI
PATGECSFSA VP