Gene OSTLU_93862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_93862 
Symbol 
ID5005901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp197017 
End bp198738 
Gene Length1722 bp 
Protein Length573 aa 
Translation table 
GC content60% 
IMG OID640421322 
Productpredicted protein 
Protein accessionXP_001422002 
Protein GI145355506 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones56 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.213448 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACC GCGAGCTCCA TCTCGCGCTC GAAAGAGTCG GCCCGCGGTT TACGCTCGGT 
GACTTACTCC TCTCCGAAGG CCCCGCGCTG GATGATTCCG CGGACGCTCT GCGCCAACGC
GTCATCTTTG CCGTCCATCT GAGCGGTGGT CATCTCGATG TCGACGAAGA AACCCGAGAA
GTCATCTACG TAGTCCCCGT GCGCGTGCGC GCGGCCATCT TAGCCCGTGA TGTGAGCGAA
CGCGCGCGCC GAGCGCGTCG GGCGGCGTGG CGAACGACGA TGCGCGCGGT GAGGGCCGCG
TTTGGGTCTT TCCTCGTCGT CTCGGCGGCG TTGACGGCGC TCGCAGTCAT CGCGTTGGTC
GTCATCGCGC TTTCTCGAGG CCACCAGGGA CGAGGAGGAG GTGGAGGGTC GACGCCGGTG
TTACCCACGT ACGTCGGGCA CGGACGAGGG GTGAACGCGG ATTTCTGGTA CTATCTGTGG
ATGCGCGATT TGATTGAACT AGCGTATTGG AACGACGTTA TGCGATTCGA ACGAGCGCGC
GCGTTTGATC GCGCGCATGG CGTCGCGGAG GGCGTTCCAG TGAGTAAACC GCGCGTTGGC
GGCAGCGGAC ACGGTGGCGG TGGCGGTGGC GGTGACGATG GAGGAAGTGG GGATCCCGCA
GGGCGAGCGA ACGCGCCGCC ACCGACAAAC GTCGGCCCGC GAAGAGGCGG CGACGGCGTT
GATGGCGATG AAGAAGAGGA AGAAGACGAT TGGCTCGACC GCGATCGTGA GTTGTCATTT
TTTGAAAGCA TATTCGCGTT TGTTTTCGGA AGAGGTGATC CGAACGATAA TTTGGAAACC
AGGCGATGGC GCGCTGTCGG GGCGCTTTTG CGAGTGAACA AAGGCTGCGT GTTCGCCGAG
CAAGTGGCGC CGTTTTTAGA CACGTATCTC CTCACGAAAG AGGATCATAG CGAAGTTCGA
AACGGGTTAT TTGCCGTGGT TTTCGACCTT GTCGCGCACG CGCGACGACT TTTCAGAAGG
AAGGCGGATG CCGAGCGAGA CGTTCGAAGG ATGCACGAGG GCTACATGCT CGAGGTCTTG
ACGCGTTTCG GTGGATTTGC CGAAGCTTCG GATGCTGGAG AGCTCATATA CGTGTTCCCA
TCTTTGCAAG TGACCGCTCG CGCGGTCGAA CCCTCTTCGT CGCGACTGAT GCCGTCGCGC
AGCGTTCAGG CTCCGACGCC GCCGCCAATT TACGAGCGAG TCCGACCTCT GTGGGAGAGC
GGTGCGAAGA TGCCGCTTGT TGTCGCCCTG GGATTTTTGA ACGTAGCACT TATTTTCATC
TTCCGTGCCG CCGGTGGTAT GGACTTCAAG CCTCCACGCC AAAGTCAACT TCCACGAAGA
GCCGAACAAA CGATGGGACG TCGTGCGGGA AGATTCCGTG ACGCCGCGCC GACGACGAGC
ACCGTCCCGA TCGATGACTA CGGCGAACCA CCGTTGGTGA TTCTTATCCT CGAGCTGTTC
CCCAAACTGC TCAAACTCCT CATGCCGCTC TTGCTCGTGT ACGCGGGCAT TTTCTTCCTC
GTGCCGACGT CACGAGCGCT GTACATCGCC GTCGAGAATC GCCAAATCAA GCGACGAAAC
GACGTGAGAA AGAGGCGTGC GCAGGAAATA TTATCAACTA GCGTCCAAAT GATCGACAAG
CAGTCGCGCG CAAGGGGCAA GCAAGCTTTA GAAGTGGTGT AA
 
Protein sequence
MSDRELHLAL ERVGPRFTLG DLLLSEGPAL DDSADALRQR VIFAVHLSGG HLDVDEETRE 
VIYVVPVRVR AAILARDVSE RARRARRAAW RTTMRAVRAA FGSFLVVSAA LTALAVIALV
VIALSRGHQG RGGGGGSTPV LPTYVGHGRG VNADFWYYLW MRDLIELAYW NDVMRFERAR
AFDRAHGVAE GVPVSKPRVG GSGHGGGGGG GDDGGSGDPA GRANAPPPTN VGPRRGGDGV
DGDEEEEEDD WLDRDRELSF FESIFAFVFG RGDPNDNLET RRWRAVGALL RVNKGCVFAE
QVAPFLDTYL LTKEDHSEVR NGLFAVVFDL VAHARRLFRR KADAERDVRR MHEGYMLEVL
TRFGGFAEAS DAGELIYVFP SLQVTARAVE PSSSRLMPSR SVQAPTPPPI YERVRPLWES
GAKMPLVVAL GFLNVALIFI FRAAGGMDFK PPRQSQLPRR AEQTMGRRAG RFRDAAPTTS
TVPIDDYGEP PLVILILELF PKLLKLLMPL LLVYAGIFFL VPTSRALYIA VENRQIKRRN
DVRKRRAQEI LSTSVQMIDK QSRARGKQAL EVV