Gene OSTLU_93558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_93558 
Symbol 
ID5004711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp441193 
End bp442293 
Gene Length1101 bp 
Protein Length366 aa 
Translation table 
GC content66% 
IMG OID640420132 
Productpredicted protein 
Protein accessionXP_001420692 
Protein GI145352733 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.00340176 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCGA GGCTCCTTCT CTGCGCCGTC TTATTCGCCT TCGTCCCTCG ATCGCGCGCC 
GTCGACGTCG TTCGCGCGAA CGATCGCGCG CCGTCGAATC CGATATTCCC CAACGGCCTG
GACGTCTTCG GCAGGGACGC GCGGTGCCGC GCGTGTCACG CCCTGGTGAA CGCGCTCAAT
GAAAATCTCA TCCCATCCAT CGCCGCCGAG CGCGCGAAAC CGGCGTCTCG AGCGACGTAC
GGCGCGCTGG ACGCCCTCAT CGAGGCCGCG CTGGCGCCGG CGTGCCGATT GAGCGCGACC
TGGCGCGACG CGACGACGAG GAAGGCGTGC GAGAGGCTGA TGGAGACGCG AGAGGACGAC
GTCGCGGCGG CGTATCATCG ATGGATCAAA CGCGGCGGCG GATCGGCGCG GGACGGCGGT
GGGACGCGCG GAGACGGGTC GAGGGTGACG GTGGCGGAGG CGAGATCGGG GGCGTACGAT
CCGGTGGGAT GGAATTGGAA TTACGAGGTG TGCGGACGCG CGACGGGCGC GTGCAGGGAA
CAGTTGGCGA TGCACGAACT CGCGGAGTTT GACGACGACG GCGCGGGCGA CGGAGAGGCG
AGGAAGTATC GATCGGAGCA GAGACCGGCG GACGGGGAGA CGGTGGATGG GATGCTGAAG
GTGACGGCGG GAACGTTTCA CGAGGCGGTG GTGCGCCGAG ACGCGGACGT CGTGGCGTAC
GTGGGATTTC CAAAGTTGGA CAAGTGGGGG CACTTTTACG CGGCGGCCGC GTTGGGGAGC
GTGCGCGAGA TGTTCGCGTC GAACGAGACC GCGCGCGAGG GGTTTGAGAT CGCGTTCGTG
GATGGCACGC ACAACGACGT GCCGCCGCCG TACGGGAGCG ACGCGCAGGC GCCGACGGTG
GCGATGTTCG CGGCGGGGAA TAAAAATTGG CCTCGGTACA TGACGGACAT GAACGACGGG
AAGTTGACCG CGTTTGAAGT CTTACAATTC ATCATGCGCA CGTCGGCGAA GCCGTCGACG
GTGCAACACG CGCATTGGCT CACGCAGTCG CTTTCGCAAA ACGCGCTTCA TCGTAGAATT
TGGGACGACG ACGAGTTGTG A
 
Protein sequence
MSSRLLLCAV LFAFVPRSRA VDVVRANDRA PSNPIFPNGL DVFGRDARCR ACHALVNALN 
ENLIPSIAAE RAKPASRATY GALDALIEAA LAPACRLSAT WRDATTRKAC ERLMETREDD
VAAAYHRWIK RGGGSARDGG GTRGDGSRVT VAEARSGAYD PVGWNWNYEV CGRATGACRE
QLAMHELAEF DDDGAGDGEA RKYRSEQRPA DGETVDGMLK VTAGTFHEAV VRRDADVVAY
VGFPKLDKWG HFYAAAALGS VREMFASNET AREGFEIAFV DGTHNDVPPP YGSDAQAPTV
AMFAAGNKNW PRYMTDMNDG KLTAFEVLQF IMRTSAKPST VQHAHWLTQS LSQNALHRRI
WDDDEL