Gene OSTLU_18717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18717 
Symbol 
ID5006194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp237704 
End bp239437 
Gene Length1734 bp 
Protein Length577 aa 
Translation table 
GC content66% 
IMG OID640421615 
Productpredicted protein 
Protein accessionXP_001422241 
Protein GI145356022 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.160009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.144375 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCTG GGGATCCGGT GAGACGCGCG GCGGCGACGC GGGCGCGGGC GAAGTGCGGG 
ATGGGAGGGA GAGACGACGA GGCACGGGGG GCGGGCGGGA CGCGCGAGGC GACGGCGGCG
ACGACGGGGC GGGGGATCAG GACGCGGTAT CAAGGGGATT GGTTGGACGT GGGAGGGAAA
GATGGGACGT GCGCGGTGGA GGAGTGCGTG GAGCGGTGGC GCGCGGCGCC GTTTGAGCCG
TACGCGGGGA GAGGGAAGGA TTGCGCGTAC GCGGTGTGCG CGGGGGAGAC GACGAGCGAA
CGCGCGAGAC GCGGCGCGGC GACGGTGATG CGAGAGATTT CTCGCGAGTA CGCGGCGCTG
GGGCTGGGGA CGCACGCATC GATGGTCGAA GGCGACGACG CGGGCTTGTT CGACGGCGCC
GAGGACGGTG CGAGCGCGTC GACGCTTTCG GCGTTGGAGC GGTATTCGAG AACGTTGAGC
GCGGCGGCGC AGCGCGCGGC TGAGCTCGGC GCGCTCGGTC CGCGAATGTG CGTGATTTAC
GTCGTCATTC CCGACGAAGT CGAAGATATC GACGCCTTGA CCGTGATCGC GCTCGCATCG
CACGTCATGA GCGCCGCGAC GGCGTCGGTG GCGCGACGCT TCTTGTCCGT CTCCGTGCAA
GCGATTCCGT CGTCGTGGTG CGAAGACGCG TACTCTTGGT CATCCACGAG CGTGCGAGCG
ATGGCGTTCA ACGTCTTCAC TAAACTCACC AGACCTACGG TGACGCGGGG ATTGTCGCTC
CCGAACGATC ATCTCAACGA TGCCCCGCGA AGTTTTCGCA CCTCCGAGAC GACGGACGGG
CGCGAGGGCG TCTCGGGCGC CGCCGTCGTC GGCAGGCGAA CGCCGCACCC GTTGTTTAGG
ATTCCAGAGA CGTCGAGCGA GCGCAATCTC CCACCGTTTC CCGCGTACGA GCCGCTATAC
ACGTTGACGC CCGAAATCGA CGACGACGAC TCGCGTGTTC GTGGCTTGCA CTGCGCGTAC
GTCGTGGCGG CGTCGCGATG GATCGTCGCC TCGTGGTCCG ACTCACACGG CGAATTCCTC
ACGCTCGAGG CCGAACCGTT CGCGGACGAG GCCGACTGCC TCGCGACTGG CTTGCGATGG
CTCATAGATC GAACGAGCGC ACTCGCCGAG CAGTTGGCGT TCGCGTACGG CGCGAAAGCG
AATGAGCGAT TAAAGTTTCA GCGCGCGGCG ATTTGTCGCC TGGGTGCGCC GTCATCGGCG
GAGCGCGCGG CGCTGGAAAC AGCGTGCAAA GCTGCGCCCG CACCGCTCGA TCGCGATTTC
TTGACGCGTC TGGAGATGAC ATGCTTTGAG CCCGACTCCG TGCCCGCGCG CATCGCCGCC
CTGGTGCCGA CCACGGCGCG CGATGTGTCA TTCGTCGCGG AGAGCGTCGA GGAGACCAAG
ACCGTCAAGA CGTACGCGGC GCCGCCGTGT GCGCAGAAAT CATTCCGCGT CGCATTCAAC
ACCAACGTAT CGAGCGCGCG CGTGCGAGCG CTCGACGCCA CGACAGATAT GAACACGCTG
AAGCATTTGG CCGCGCTATA CGCCACGCGT CTGTCGCAGC TCGGAATGAT GTGCTTGAGC
GAAAACATCG CGGATGAATT TGGAAGCATT CGCGCGCCGC TCCCGCTGCA CGCCGAGGTG
TGCGTGCGGT TCGCCAGCAC GCTCCAGACG CTCGAGGCGA ACGGGGAGCA GTAG
 
Protein sequence
MSAGDPVRRA AATRARAKCG MGGRDDEARG AGGTREATAA TTGRGIRTRY QGDWLDVGGK 
DGTCAVEECV ERWRAAPFEP YAGRGKDCAY AVCAGETTSE RARRGAATVM REISREYAAL
GLGTHASMVE GDDAGLFDGA EDGASASTLS ALERYSRTLS AAAQRAAELG ALGPRMCVIY
VVIPDEVEDI DALTVIALAS HVMSAATASV ARRFLSVSVQ AIPSSWCEDA YSWSSTSVRA
MAFNVFTKLT RPTVTRGLSL PNDHLNDAPR SFRTSETTDG REGVSGAAVV GRRTPHPLFR
IPETSSERNL PPFPAYEPLY TLTPEIDDDD SRVRGLHCAY VVAASRWIVA SWSDSHGEFL
TLEAEPFADE ADCLATGLRW LIDRTSALAE QLAFAYGAKA NERLKFQRAA ICRLGAPSSA
ERAALETACK AAPAPLDRDF LTRLEMTCFE PDSVPARIAA LVPTTARDVS FVAESVEETK
TVKTYAAPPC AQKSFRVAFN TNVSSARVRA LDATTDMNTL KHLAALYATR LSQLGMMCLS
ENIADEFGSI RAPLPLHAEV CVRFASTLQT LEANGEQ