Gene OSTLU_785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_785 
Symbol 
ID5005919 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp245610 
End bp248221 
Gene Length2612 bp 
Protein Length653 aa 
Translation table 
GC content55% 
IMG OID640421340 
Productpredicted protein 
Protein accessionXP_001422016 
Protein GI145355534 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.357988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000475729 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
CCGATGGACC TGCTGACGCT GCAGAGCGCG TGTAAGCGCG ACCCGGAGGG ATACGAGGAG 
GATTTCGCGC TGCAGCTGCG ACACTACGAG GCGCTGCGCG CGCTCTTCGC GATGAAACCG
TCGCGGGATC ATAAGGAATT CAGTGAGTTG GTGTCGTTCA TCGCGCACGT GAGCGTGGTG
TATAAGGAAC AGACGAAAGG GTTCTCGAAA GGGGTGGTGG AGGTGCTGGA GCGACATTAC
GCGATACTGG ACCCGCACTT GAGGAAGAAT TTGACGAGCG CGTTGATTTT ATTGAGGAAC
AAGGGCGTGA TCGGTGTGGA GGTGACGCTG CCGTTGTTTT TTAAACTGTT CCGGGTGAAG
GACAAAAATC TGCGGGTGCT GATGTTCAGA CACATCGTGA GCGACGCGAA GGCGGCGAAT
AAGAAGAGGA CGGATGACAA GTATAATCGC ACGGTGCAGT CGTTTTTGCA CACGGCGATA
AAGGATGAGA ACGAAGCCAC GGCGAAGAAG GCGCTCGCGG TGCTGACGGA GATGTACCGG
AGAAATATAT GGACCGACGC GAAAACGGTC AACTTGGTGG TGGAGGCGTG CAAGCATCCA
TCGCAAAAGA TCTTAATCGC CGCGTTGAAG TTTTTTGAAG GTCAAGACGA AGCCGCCGAG
GCGGCAGCCG AAGCTGGGGA TGGGAGCGAT TCCGACGACG ATCCGTCGAC TCGCGAATCG
GAGATTAAAT CGCGCACGCA GGTTTCCAAG GAAGACGTTT TCAAGGCGTA CAAGACGGTG
CGTGGTGCCT CACATTTTCA ACGCTCTTTT TTTTCAAACA CGACGACGTG CAAGCCGATG
ATGTTCAAAT ATATTTGCTA CTCTCGAGGC TCGACAGTGA CGACAATCAA TCACCAAGAT
CTCTGAGGCT TTTCTCGCGA CGTTCGTTTG CTCTTTTCTT TGCGCCGCGA GCGAGTGTTC
GAAATACTGA CGAAAATTGT CTCGCTCGTT TCTTCCGATC GCGCAGGGCG TCGCCTCGTC
TAAGAAGAAG AAGCAGAAGA AGCTAAAGCG GACGATTAAG ACTATGCAGC GCAAAGAACG
CAACGCGGAC AAAGCGATTG ATTCTCGGTT CGCCGCGATG CAGCTCATCA ACGATCCGCA
AGCGTTTGCT GAGTTGCTGT TCGGAAAGCT TCAAGTCGGG CACATGTCGT ACGACACGAA
GATGCTGTGC ATCCTCATGA TGTGTCGCAT CATCGGCATG CATCAACTCA TCATGCTCAA
TGTGTATCCA TTTTTGCAGC GCTATATTCA ACCGAGTCAG CTGGAGGTAC GCATTGATTC
ATCGAGCGCT CGACGACGCG CCGCCCGCTC GCGAATGACG ACACAACTCA AGGCCAAGGG
CATTCCAAAT ATGACCCATA CCAACGCCTC CGCTTAGTAC CTGACGCGAA CGCATCGAGC
GGTTACGTTT ACGCATAATC GAATGCGCGA CTGACGTTTG ATTTCCATCA TTTACGCAGG
TGACGAGACT TCTCGCCGCC GCCGCGACGG CGTGCCACGA ACTCGTGCCG CCGGACGCGC
TCGCGCCGAT GTTGCGTCAG CTCGTCAACC AGTTCATTCA CGATCGCGCG CGCCCTGAAA
TTGCCGCGGT TGGTTTGAAC GCGGTGCGTG AAATTTGTGC GCGCTGTCCT TTGGTGATGG
ATGAAGATTT ACTTCAAGAT TTGACGCAAT ACAAAAAATC GCGCGACAAG CCGGTGTCAA
ACGCAGCGCG AGGACTCATC GCTTTATTCC GAGAAATTGC GCCCGGCCTC CTCGACAAGA
AGGATCGTGG GAAAGCGGCG GATATGTCAA GAACGCTCAA GGGTTTCGGT GAAGCTGAAG
TGGTGGGTCG CATCGACGGC GTCGACTTGC TTCAACGCGA CATTTTGAAA CGTAAACGCG
AAGAGGAAGC CGTCGAAATG TCTTCAGAGG AAGAGTATTC CGACGAAGAC GAAGACGAAG
ACGAAGACAA CGAAGAAGAA GAGGAAGAGG ACGAAGAAGA AGAAGAAGAC GCGGACGAAG
AAGAAGAAGA CGAGGTAGAG CCGCCAGCGA TTGGCAAACG CGGACGCGAG AGTGACGAAG
AAGCGTCCAT CGATCCCGAC GCGCCGCCGC CAAAGATTCG TAAGAATGGC AAGTTGTCGC
TCGCCGAACT CAAGCGTCGC CACAAGGCAA TGATGCAGAG GCGTAAAGAA GAGGAAGAAG
CCGAGGTGCG CGCCGAGCAA GAGGCCGAGG AAGCCGAACT GGGTGGACCG GTGGAGCAAG
AGCGCATTCT CACCGACGAG GATTTCAAGC GCATCAAGGC GCTCAAAACG GAGCGGCAAC
TCAACGCCGC GCTCTCCAAG GCGGGCGCGA TGAAGGCTTC GAACGTCGCG ACCGATCACA
TTCGATTAAT GCTTCGCAAG GCGGATCGCG CGAGTGATCG TCGGGTGAAC CCTGATTCGC
TCGCCGCGAC GGGCTTGAAG AAGGCGCACG ACAAGGCGTC GCGTCTCGCC ACCGTCCTCG
CCGGTCGCGA GGACAACGAG TACGGCGCGT CGAGCGCGCG CAAACAAAAG AAGACTGGCG
GTTCGAGCAA CAAGGAGAAG GACAAGAAGA AA
 
Protein sequence
PMDLLTLQSA CKRDPEGYEE DFALQLRHYE ALRALFAMKP SRDHKEFSEL VSFIAHVSVV 
YKEQTKGFSK GVVEVLERHY AILDPHLRKN LTSALILLRN KGVIGVEVTL PLFFKLFRVK
DKNLRVLMFR HIVSDAKAAN KKRTDDKYNR TVQSFLHTAI KDENEATAKK ALAVLTEMYR
RNIWTDAKTV NLVVEACKHP SQKILIAALK FFEGQDEAAE AAAEAGDGSD SDDDPSTRES
EIKSRTQGVA SSKKKKQKKL KRTIKTMQRK ERNADKAIDS RFAAMQLIND PQAFAELLFG
KLQVGHMSYD TKMLCILMMC RIIGMHQLIM LNVYPFLQRY IQPSQLEVTR LLAAAATACH
ELVPPDALAP MLRQLVNQFI HDRARPEIAA VGLNAVREIC ARCPLVMDED LLQDLTQYKK
SRDKPVSNAA RGLIALFREI APGLLDKKDR GKAADMSRTL KGFGEAEVVG RIDGVDLLQR
DILKRKREEE AVEMSSEEEY SDEDEDEDED NEEEEEEDEE EEEDADEEEE DEEAELGGPV
EQERILTDED FKRIKALKTE RQLNAALSKA GAMKASNVAT DHIRLMLRKA DRASDRRVNP
DSLAATGLKK AHDKASRLAT VLAGREDNEY GASSARKQKK TGGSSNKEKD KKK