Gene OSTLU_15365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_15365 
Symbol 
ID5002123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp251699 
End bp253669 
Gene Length1971 bp 
Protein Length656 aa 
Translation table 
GC content59% 
IMG OID640417544 
Productpredicted protein 
Protein accessionXP_001417717 
Protein GI145346485 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.138212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.727181 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACGA CGGAGGAGGG GTTCGACGCG AGGGCGTATC TCGTCGCGGC GCACGGCGAG 
CGAACGCGCG AAGAGCTGGC GAGAGGGGCG ACGAGGCTGG AGGCGGAGAT CGACGCGGTG
CGGGCGTCGA CGCGAATGTC GGCGGCGGAG GAGTTGCCGA CGGTGCTGGC GTGCTTGGAC
GCGATGGAGG ACGCGAGAGG GGTGCTGCGA AGAGGGCGAG AGGAGGCGGG CGAATTCGGC
GCGACGGCGG AGTTGGAGGC GCGACTGTCG CGAGCGTGGA AGAGCGCGAG GGAGAGCTTG
AGGGAGGTCT TTGCGATCGA GGAGAGACGG GAGAAGATTG CGCGCGCGCT CGAGGCGATG
GAGCGGCACG AGGACGTGTT CGGGATTCCG GGAGCGGTGC GAGAAGCGCT GTCGCGAGGA
GAGTACGCGC GCGCGGCGGA GACGTATCGT CGCGCGCGCG CGGCTTTCAG CGGCAAACGC
TCGCGCGTCT TAGATGCGGT CTTGGATGAA GTCGAAGAAA ACGTGAAATC CGCGGAGGAG
CGCATGTACG AACGCTTGTA CGTGGGAGAC CTCGACGACG CTCACGCGGA AAGAATCGTC
ACGGCGCTGC AGACGTTGAA ACTTTGCAGG CCCGCGTTGA CGTCGAGTCA GGGTGAAGTC
ACGGCTGCCG GTAATGCCGT GCATATTTAT TTAGATAGAT TAGTGGAGTA CGCGTGCGAG
GAACTGACGA ACACGGCCTC GAGCGATGAC TTTGACGTCG AGACGCTCAG TCGAGGATAT
CGCGCGCTTT TCGTTCGCGT CTGGCGCTTC GTGACTCTCA TGGACACGTG TGCGTCGTCA
TACGCGCGTG ATGCGTCGAC TAAGATCCAA TCGGTATACG TTGGTTTCAT GAAATCCAGG
TTTGACAACA GTCTCAACAA GCGGACAATT GAAACTGACG CCGAGCAAGC GAATCGTCGC
TTTGACGTCT TAATTGACAA GTGCGCGAAA ATGTCTTGCA TCGGCTTTTC GTTATCGTAT
TCATACGATA TTCTTGGCAC GCGATTAAGC TTACAACCGG ACTTGCTCGA GGCACTGCAG
CAACAATACA CGCGTTTTAG CGTCAGTTTG CGCGTGCACT TGGAACAAGC GCTCAAGTTG
GCGGCGCAAC CGCTCGCGCA AGACCAACGC CTCGAGACGA CCACACAGTC GTTCTTCCGT
GATGCACGCG TCGTATTTCA AATCACCGCG GAATACTGGC TCGATGAGCG GTTCACGCCT
TGGATGATCA ACGCCGGTTC ACGAGACGTG GGAAGTTTGA TTGACACCTT TTACGACGCT
GCTCGCTCGC TCGTGGCGCT GGCACGCGAA CTTCGGCGTG GTCCGTTGGC GTCGTTGGCC
GCTCTGAAAC AGATCGAAAA TTGGTGCGCG GTATTTTTCG ATGAGTTCAA TTTTACTGGG
ACCGGAATAG GTAACGAGGC GAATCGCGTG GCGTTCAGAG ACGACATCTC ACGAACCACG
CAAATGTTTC TGGACGAATT TGTCGGTGGC GAAATGAGCG CAATTATCGT CGCAGTGCGC
CGTTGGTTCG CAGCACCGGT GGAGAAAACG TTAGAATCAC GCCCGGAATG CGTCGATGTC
TTGCATCGGG TGCGTTCAAC GTACGAGTCT GCGACGTCCA CGGTGCCCGA ACTCGCCACG
GCCATCTCGC AAGACATCGC GGCGCGACTC GTCGACGCGT TACGCTCCGA GTTCACCTCG
AATCTCAGTC AGCTTAGACC GACGGCGAGT ACACTTCGCG TAGAGTTCGA GCTTTTGAAG
CTCGCGCTGG ACGCGCGCTC GACGAAACAG GCGCGAGACG GCGCGTCTCG TCTCGTCGAT
TTGGCGGCGC GCGTCGCCCC CGACGACGAC GCCATCGCGC GCGCGCGGGC AATCGCCAAC
GACGCTCAAA AACACAAACA CCTCCTTGTC GCGCTCGGGA GATTAGCCTA G
 
Protein sequence
MTTTEEGFDA RAYLVAAHGE RTREELARGA TRLEAEIDAV RASTRMSAAE ELPTVLACLD 
AMEDARGVLR RGREEAGEFG ATAELEARLS RAWKSARESL REVFAIEERR EKIARALEAM
ERHEDVFGIP GAVREALSRG EYARAAETYR RARAAFSGKR SRVLDAVLDE VEENVKSAEE
RMYERLYVGD LDDAHAERIV TALQTLKLCR PALTSSQGEV TAAGNAVHIY LDRLVEYACE
ELTNTASSDD FDVETLSRGY RALFVRVWRF VTLMDTCASS YARDASTKIQ SVYVGFMKSR
FDNSLNKRTI ETDAEQANRR FDVLIDKCAK MSCIGFSLSY SYDILGTRLS LQPDLLEALQ
QQYTRFSVSL RVHLEQALKL AAQPLAQDQR LETTTQSFFR DARVVFQITA EYWLDERFTP
WMINAGSRDV GSLIDTFYDA ARSLVALARE LRRGPLASLA ALKQIENWCA VFFDEFNFTG
TGIGNEANRV AFRDDISRTT QMFLDEFVGG EMSAIIVAVR RWFAAPVEKT LESRPECVDV
LHRVRSTYES ATSTVPELAT AISQDIAARL VDALRSEFTS NLSQLRPTAS TLRVEFELLK
LALDARSTKQ ARDGASRLVD LAARVAPDDD AIARARAIAN DAQKHKHLLV ALGRLA