Gene OSTLU_18189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18189 
Symbol 
ID5005420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp601765 
End bp605049 
Gene Length3285 bp 
Protein Length1094 aa 
Translation table 
GC content63% 
IMG OID640420841 
Productpredicted protein 
Protein accessionXP_001421514 
Protein GI145354485 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000423092 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0423026 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGA GCGAGCGCGA GGAGTGTCGC GCGTTCGCGC TGCGATGGAC GCTGGACGCG 
GAGGGCGCGG CGAGGGCGGT GAGGAATCAG ATGACGGCGT GCTGCGCGAC GCTGGTGAAG
CGCGCGGCGG TGGACGCGGA CGACGCGACG AAGATGGCGA CGCTGGGGGC GTGCGAACGC
GAGGTGCGGG CGAAGATGGA ATCGTCGAAG GGGGAATCGG ACGCGGAGAC GATGGGACTG
GAGGTGTTCG CGGCGATCGT GAGCGAGTTC GCGCCGGGGA CGGCGAGCGA ACTGGGGACG
ACGTGGGAGC GGCACGAAGG ATGTCGAGCG AGCGCGGAGA AACATTTTTT GAAACCGTTC
TTCGCGCACG GGTGCGAAGC GGCGAGACGG TGCGTGGAGA CGGGGAGGGT GAGCGATGGG
AGCGATCGGG GGGCGTGCGC GGCGGCGCTG CGGTTGATGA ACGCGGTGTT GAGTTGGGAT
TTTAATAGAG ACGTGAGTTA CGGGTTTCGG GGACGGGCGT TTCCGAGCTC GGAGAGCGCG
GCGAACGCGT TCGTGAAGCT CACGCCGGGA ATGGAGTGGA GGGATGTGTT GTTGAATCCA
GGCGCGCTGG ATTGGTTGTT CGACTTGCAC GCTGGGGCGG AGAGTGCGGT GTTGGCGGGC
GGAGGCGTCG AGGCGAAGCG CGTCGCCGCG GCGAGTCGCA AGACGCTGAG CGCGCTCTGC
ACGCTCAGTG GATGCGTGTT TCCCCCGCGG GATGCGGACG ATAGCCTGAG ACAGGGCCAT
TTCGTGCGAT GCGCGCGCGC GATCGCCAAG TATCTCTTAC CGGCTGCGAC GAGCGTGCGC
GCGGCGTTGG AGGGGCACGG AGAAGACGCC CTCATCGACG GCTGCCGCTC CATGTCCGCG
TTGGCGCTCG TGCACGACGC CAACGATTTC GCGAGTTTAT CGCTCGGTCC GGAGTTGAAC
GAGCGTACCG CGCTAGACTT ACTCGGCGAG CTCACGTTAG AGTGCTTGAA TCAAGACGCG
CTGTCGGTGC AGTGCGAGGG CACGGTGACG GATGATTGTT TAAAGATGCT CCTCGAGGCG
TGGGCGTCGT TGGTGAACAA GGGCATGAGC GCGCCGGGAG GCGTGGAAAC GGCGGTTCCG
AGCGCGGTTT TAGAAGGAGC GGCAAATATT TCGCACGCTT ACGTCGTCGC TGGGTTGAAA
GCCGCGCGCG AGGGCGCGCA CGAAGAGGAC GATGGGCACG AAGAGGAGGG CCAAGCCGGG
GCGGCGGCGC TCGACGCGAG ACTCGAACTC GCCGCCCAAG TCTTGCGCGC GCATCCCACG
ACGACTTTAC CGACGCTGCA GCACGTTTTA GTTGAAAAGC GAAACTCGCT TCCCGCGTGC
ATGGCGAGCG GTCAAGATCC GAGTGAGTTG TTGGAGGAAT TATGGTGGCT GACGCGCCTG
GCTGCGCACG TGCTCGCGGA TGACGGCGAC GGCGAAACGC CGATTCCACC AGACTCCCTC
GCTGCGGCGT CCGCAGCCAC GGCACCAGGC GCACCAAACT GCGTAGTGGA ACTCGCGCGA
GCGTTGATCG ATTTCGGTTG CTTGGCGCTC GACGCCAACG CGCGCGGCGC GCTGTCGCCT
CGCCTGCTCG AAACCGTCGT GTGGGCGCTC GCGCGTTGGG CGGATACGTA TTTGATTCCA
GAGGATTCCG GCGGTAGCTT ACACGCTGCG GTGTACGCCG CCGCTGGAGG CGTGCGACGA
GGTGCTGACA TCGCGAACAA GCTCGCTGAG AACGGTGGGG GAATGTTCAG CGAAAGAAAC
GGTGGCGTGG AGGCGCTCGA CGCGCTCGTG CAAATCGGCG TCAAGGCGCT CAGCGATTGG
CCGGGGGAGA CGAGTTTACA AAAAACAATC GGATTCGTGT TGTTTCCCGT GCTCACGCGA
CGAAAGACGC TGTTGAAGCA CTTAGTAAAC ATGCCTTCGT GGGATGCGTT GCGTCAGGCG
TGCGCTGGGG CGCATCATGA GCGCGGCGTC GTCGCGTTCC CGCCCGAAGT CCATCGCGGT
TTGAGCGAGT GCGTCGGGCG CGTCGCCGCG AGCGTGATTG ATCCCGCGCA GTGCGAGGCG
TATGTGAACG CCCTCATCAC GCCCCCGGGC GAAGTCATCG CCGCGGTAAG CGTCGATCGT
GAGGGTTTAC ATCACCCCGA AGGCGAGGCT CGAGCGTGCG CCGCGCTCGA GGCGTTGCGA
GGCGTCGTGC GATCGACGAA TGGAAAGAGC CAACCGGCGG TTTTCAACTT TTTCGTCGCA
GCCGTCGATC ACTTGCTAAA TTTGCAAACG CTCGCGAAAG ATTTAGGACG AGTGATGAAG
CTATTGCTGC GCCTGACCGA GGAGTTCGTC GAGGCCAACT CGCCGTATCT CAACGCCCAA
CAAGTGGATT GGGTGTGTCG GTATTGCCTG CGCGTGGTGG AGACGTACGC GAAATCCGGC
CGCGGCGCCG TCAAGTCGGA AGCCGGCGCG CTATTGAGTC AAGAGGCGGT AAAAGAGGCG
TATAAAGAAG TTCGCGCGCT ATTGCGCATG CTCACGCACA TGTCGAGCGG AAACTTACAC
GACGCCATCA TCGAGAGCGC GCCGCCCGAC CAGGCGGCGG CGCTCGCGGA ACAAATCGAC
ATCGCTCGCG TCGTCTTCGC CGGTTTGAAC GCCGTCATCC CGCTCATCAC GGATGAATTG
CTCAAGTTCC CCAAGCTTTG CAGACAATAT TTCGAGCTTT TGGCGTACAT GCTCGAGGCC
TACCCCAAAA AAGTCGCGCA GCTAGCGCCC GACTTATTCG GCACCCTCAT GTCGACGCTC
GAATTCGGCT TGAAGCACGC CGACGAGACG GTGAGTAAGG AGAGCATGAC TGCGCTCGGC
GCCCTGGCCA CGTTCCAATG CAACAGTGCG AAGACACAAA CCATCGGTTT AGGCGCACAC
ATGGCCCCTA ACGCCGAGGG CGTGTCCATC CTCGCCCATC TCATGCGCCT CTTGTTCCAC
CGTCTCGTCT ACGAGGAAGC CGTCTTCAAT CTCGTCGACG AAGCCGCCGA CGCACTCCTC
CCCATCATCC TCCACGAGCG CCCGGCGTTC CAAAATCTCG CCTCTGCCTT CATCTCCGCC
GTCGCCGACG AGCCACGAAG CGTCGATTTA CTCCAAAACG CCTTCGTCGC CCTCACGAGC
GCCAACGGCC TCGCCGAGGG CGTCGACCGC GTCAACAAGC GTCGCTTCCG TCGCAACCTC
GCCGATTTCC TCACCGTCGC TCGCGGCGTC TTGCGCACGC GTTAG
 
Protein sequence
MRASEREECR AFALRWTLDA EGAARAVRNQ MTACCATLVK RAAVDADDAT KMATLGACER 
EVRAKMESSK GESDAETMGL EVFAAIVSEF APGTASELGT TWERHEGCRA SAEKHFLKPF
FAHGCEAARR CVETGRVSDG SDRGACAAAL RLMNAVLSWD FNRDVSYGFR GRAFPSSESA
ANAFVKLTPG MEWRDVLLNP GALDWLFDLH AGAESAVLAG GGVEAKRVAA ASRKTLSALC
TLSGCVFPPR DADDSLRQGH FVRCARAIAK YLLPAATSVR AALEGHGEDA LIDGCRSMSA
LALVHDANDF ASLSLGPELN ERTALDLLGE LTLECLNQDA LSVQCEGTVT DDCLKMLLEA
WASLVNKGMS APGGVETAVP SAVLEGAANI SHAYVVAGLK AAREGAHEED DGHEEEGQAG
AAALDARLEL AAQVLRAHPT TTLPTLQHVL VEKRNSLPAC MASGQDPSEL LEELWWLTRL
AAHVLADDGD GETPIPPDSL AAASAATAPG APNCVVELAR ALIDFGCLAL DANARGALSP
RLLETVVWAL ARWADTYLIP EDSGGSLHAA VYAAAGGVRR GADIANKLAE NGGGMFSERN
GGVEALDALV QIGVKALSDW PGETSLQKTI GFVLFPVLTR RKTLLKHLVN MPSWDALRQA
CAGAHHERGV VAFPPEVHRG LSECVGRVAA SVIDPAQCEA YVNALITPPG EVIAAVSVDR
EGLHHPEGEA RACAALEALR GVVRSTNGKS QPAVFNFFVA AVDHLLNLQT LAKDLGRVMK
LLLRLTEEFV EANSPYLNAQ QVDWVCRYCL RVVETYAKSG RGAVKSEAGA LLSQEAVKEA
YKEVRALLRM LTHMSSGNLH DAIIESAPPD QAAALAEQID IARVVFAGLN AVIPLITDEL
LKFPKLCRQY FELLAYMLEA YPKKVAQLAP DLFGTLMSTL EFGLKHADET VSKESMTALG
ALATFQCNSA KTQTIGLGAH MAPNAEGVSI LAHLMRLLFH RLVYEEAVFN LVDEAADALL
PIILHERPAF QNLASAFISA VADEPRSVDL LQNAFVALTS ANGLAEGVDR VNKRRFRRNL
ADFLTVARGV LRTR