Gene OSTLU_18664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18664 
Symbol 
ID5006156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp97918 
End bp100170 
Gene Length2253 bp 
Protein Length750 aa 
Translation table 
GC content56% 
IMG OID640421577 
Productpredicted protein 
Protein accessionXP_001422201 
Protein GI145355936 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.025698 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.204515 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGGAT TGCGACCGGC GCGCGCGCGC GATGGGACGA AGATGGTGGC GTCGGTGTCG 
ACGGCGGCGG AAGCGCGAGC GCGGGAGGGG ACGCGCGGGG CGGGGGCGAG CGCGGCGGCG
GGCGCGGGCG TCGCGGTCGC GGGAGGGGCC GCCGCGGTGG CGCTGGCCAG GGCGTCGAGA
GGGAAGAAGA AATCCGCGGA GGCGAAGGTT TTGGATCGTT ACGCGAAGAT TATTAACGAT
GCGTACGATA TTCCGTTCGT TCCGGAGTTC GCGGAGAGCG AGGTGTATCG CGAGGTTTGC
AAGGCGGCGT ACGCGTCGGC GATGAAGAAC TTACGACAGA ACCTCGTCGG AGCGAGCGTT
TTGGGGCATC CTCTGATCAT GAGCGATTAT CTGGACGCCA GACACGTTCC GGAAAAGAGT
GGTATCAAGG AGCAGGAGCT CGCAAAGTTT ATCGGCGACG TCGTCGGCGA AACGAGCGGG
ATTCCCGTCT TGTTACCCGG GCCGCTCGAA AGGAAGCTCT ACACGAACGG CGCGCTCACG
GGATGGACCG TGTTCGAAGA TACGTTAAAA ACGTTCAAGT TGCAACTTTT CGACCGGGAA
TTCGTCTTTG AAGTGTCGAG CGAAGAGGAT GGCATGGCGG AAGGGCGAAG GACTAAGCTC
GTCGCGAGCG GCGACGGCTA CTTAGACATC GTGCCGAATC TGTCGAATCA AACGCTCCGC
AAGATTGCGC GCGAAGAGCT CAAAGCCCCG CCACCGACAA ATCTGTTCCC GCATCTCCAA
GAAAACGTCG CCAAGGTTGC GGTGGGTATG TTATCCGAGG CGCTCACGAT GGAGGCAAAG
ATTAAAGGAT TCACCGTGGA GTTTGCGTTA GAGCCTTACG ACGACGATAC TCGAGAGGCG
TTTGCAAACA TCTCCTTTTC CGACGAGGAT CGTTTGGAGA CGCAGCGCGC GGTGAAGCAA
CTTATCGAAG TGTGCGTCGA TGAATATATG GACACTCGAG CGATGGCCAT CTCGTCAGTG
TTTCTTCCCC GACGAATCGA GCGCGACACG TACATCAACC TCCTCACCGG CTTGTTCGGT
GAGATAGGAA AAGAGCCCGT GTTGAGTAAC TTGGGTTTCA ACATTTCTAT GCGTGTGCTC
AGTCCGTCGA GCGGAGCAAA CGTGTCAGCG TCGTTCGACG ACGACGCCTT GACATTCGAC
GAAGGCGACG CCGCGAAGAA GACACCCAGC GCCCGCGACC TCCTGCAGCA AATGGGGAAC
GAAATTCGTA GAGGCGACTT TAGCAACATA GCCTCGGAGG CACGTCAAAT TTTGGGTGTG
AGCATCGACG GTGATGAAGA CCCGAGCAGA GCTGCGAGAA AAAAGGCACA AAAGCAGACT
CGAGAGGCGA TTTCGGAATT TGTCGACTAC CTCCTTCGCG ATCCCATGTA CAACGTCAAG
GCTATTCCCG ATAACATTGA GCGTTTGCTG TACATTAACT GTTTTGAACT CATAGTGGAT
ATTTTATCTA CAGTGTTGTC CGATTTTGAG CTCGACATGC TCGGGCGAAG AATCCGAATG
CTGGTGCGCC AAGCGCCGAA GCGCGACGTC AAGGACCTCA GCCGCTTCCG TCCGGACGCG
CGAGCGTTGA GGGAGATCAC CAAGGACTTT GCTGACATCC CTTCGGTTCA AGAGATCATG
GGCAACGTTT ACGCTTTTGT TCTTGCTTTC GTCGCTCAAG TAGCTTCTGA TTTTGAGGTG
ACCGTCGTCG GTCATCGACT CAACACGGGT TTGAGTAGAA GAGCCGAAGC GACTGTGATG
GAATCTGCAG GGCCGGCGGT TACGGATGCA TTGAACGAAT CACTCGTTTC AGCGATCGAA
GCTTTCGCGC AAGATGTGTT CGCCATCGGC TCTCGCACTG GTTTGAACGC TGGAGGTTCT
GCGGAGACAA GGTCGGCGGT TGCAATCGAT TTCGACAAAG AGGTCTTCGC GTTGTTTGAA
GCCAACGCGT CCAATCCGGA TGAAAAGTTT CCGTTCCCCT ACCTCAATCA AGAGCAGTTC
GCGAAGACGA TTGATATGTT CATCGAAACC CTCGTTCCGG GTGCCAAGTT GTGGGACTCT
CACAACGCGA CCGTTCAGAA AATTGCCAAA GCGGCTGATT TGAACGGCGA CGGCGTCATA
CAGTGGGCCG AGTGGTACTA CGCCGCGGGA GCCATTAACA GAGCGACCAA GATTGCGAAT
AAGAAGACTT TAGAAGAACT TAAAGGAGGA TGA
 
Protein sequence
MEGLRPARAR DGTKMVASVS TAAEARAREG TRGAGASAAA GAGVAVAGGA AAVALARASR 
GKKKSAEAKV LDRYAKIIND AYDIPFVPEF AESEVYREVC KAAYASAMKN LRQNLVGASV
LGHPLIMSDY LDARHVPEKS GIKEQELAKF IGDVVGETSG IPVLLPGPLE RKLYTNGALT
GWTVFEDTLK TFKLQLFDRE FVFEVSSEED GMAEGRRTKL VASGDGYLDI VPNLSNQTLR
KIAREELKAP PPTNLFPHLQ ENVAKVAVGM LSEALTMEAK IKGFTVEFAL EPYDDDTREA
FANISFSDED RLETQRAVKQ LIEVCVDEYM DTRAMAISSV FLPRRIERDT YINLLTGLFG
EIGKEPVLSN LGFNISMRVL SPSSGANVSA SFDDDALTFD EGDAAKKTPS ARDLLQQMGN
EIRRGDFSNI ASEARQILGV SIDGDEDPSR AARKKAQKQT REAISEFVDY LLRDPMYNVK
AIPDNIERLL YINCFELIVD ILSTVLSDFE LDMLGRRIRM LVRQAPKRDV KDLSRFRPDA
RALREITKDF ADIPSVQEIM GNVYAFVLAF VAQVASDFEV TVVGHRLNTG LSRRAEATVM
ESAGPAVTDA LNESLVSAIE AFAQDVFAIG SRTGLNAGGS AETRSAVAID FDKEVFALFE
ANASNPDEKF PFPYLNQEQF AKTIDMFIET LVPGAKLWDS HNATVQKIAK AADLNGDGVI
QWAEWYYAAG AINRATKIAN KKTLEELKGG