Gene OSTLU_27074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_27074 
Symbol 
ID5005030 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp164663 
End bp166813 
Gene Length2151 bp 
Protein Length659 aa 
Translation table 
GC content61% 
IMG OID640420451 
Productpredicted protein 
Protein accessionXP_001420922 
Protein GI145353228 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.00289082 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000904568 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGCTCC GGTTGCAGAA AAAGGCGTCG CGCGTCGTGC GCGCGGTGGC GGAGGAGGTC 
CAGGTCGCGC GCGGCGTCGG CACGCGCACC GATAAGCGCG GGAACGCGAA GACGGCGCTG
TGGGAGCGCG GCAAGGTGCG ATCGAGCGGC GGATTGGCGC GTGTCTCGAT CGGCGCGACG
AGGATCGCGC GCGAACGCGC GACGCGAGGC GACGCGCGAC GCGCGCGCGC GGAGACGCGA
AGGGAAACCC GCGCGGGAGA TTCGGGGAGA CGCGCGAGAG ACTGACGGGA CGCGGACGCG
ACGCAGGCGA AACTCGCGGC GACGGCGGTG GAGAAGGAGT TCGCCGAGAC GCAGTTGGCG
TTTAGCAAGG CGACGAATTT CGATGACGTG CCGCCGAAGG AGAAGCACGT GCTGGCGCTG
GTGCGCACGT GCGGAGGCGC GGGCGGAGGG AGCTCGAGGG ATCGGGCGTT CGTGTTGGAG
ACGTTGGCGA GACAGGTGCG GAAGTGCGCG CCGTGGAGGA CGATGCTGAA GACGCACGTG
TTGCTGCACA GGTTGATGCG GGAGTGCGAG GGAGGGGGGT TTAAGGATGA CTTCTTCAGG
TTTTTGGAAT TTTTATCTCG GAAAACGTAC GGGCCGAAAG AACAGACGCT GTTTAACATT
CGCTACTGGA AGGACGAGAC CAACAAGGAC GCGTACGAGT TGTCGGGGTG GACGCGCGCG
TACGCGGCGT ACCTCGAAGA GCTGTGCGCT TTGAATGAGT TCATCCCGAG CCTCGTAGGA
AACGTGAGTG GCGCGGTGAC GACGACGACA AACGGCGAGG CGCGAGCGGT GGTGGCGAAT
CCGTTGAAAG ATTGTGATTT CGCGACGTTG ATCAAGGTTT TGCCCTTGGT GCAGACGCTC
GTGCGACGCA TCACGGATTG CGCGCCAACA TCTACGACGC TGCAGAAAAA TGCCGTCTCG
CGATACGCCG TCGGACTCGT CGCAAAGGAT AGTTTCTTGG TGTATCGCGT CATGAACGAG
GGCATCATAA ACCTGGTGGA CAAGTACTTT GAAACGAGCA AAGTCGAGGC GGAGAAAGGG
TTGGTGATTT TCAAAAAGTA CTTGACGCAA ATCGAAGACT TGCAACGATT TTACGACACG
TGCGAAGCGT GCGCGGCGGT GGAAAACGCA GTCGTCAAGC TTGAAGCACC CCCTGCGACG
TTTTTGAAGA GCATGGAAGA GTACTTCGAA TCGGCGCCTC GCGAAGGCTT GCCTCTTCGC
GAGCGGCGGT TGGGCGCGAC ATCTTCGACG ACGGCGAACA ATGCACGAGC GAATGCGGTG
GGGTCGACAA TGTTGGCGAT CGACGTCCCC GCCAACAACG CGGACTTTAT CAGTACCACT
GCTGCGCTAC CGCCGGTGGA GCCGTTGAAT GCGCTCGATG CGCTCAGTCA GCTTGATTTA
GGTACGCCGA GCCCAACGAG CAAAGACGAT GTTTTTAGCT CAAACGCGCT GCCCGCGCCG
ACGCAACCGC CGGCGTTAGC GCCGGTCGCG CCCGCAGCTT CGAGCAGTAC AAGCGCACTT
GATTCTTTCT CCGAGTCAAT CGCTCCGGCG GTGCCGACGG AACCCTCTGC GGTGGCGTAC
AACCCATTCG GAGCGAACCC ATACGGCGGC GCCCCGCAAA TGGTGCCGGC GGCTCCGCAA
ACGGCGCCGG CGCCCCAAGC AAAATCGCCG AGAAGCACGA ACCCATTCGG AAATAACCCG
TTCGGTACAC CACAGCCTCA AAGTTTGGAC AAGAGCGCAC TCAATGACTT ATACGCGCAA
GCTCCAGCGT CACCCAGGAG TGGCCATGGC ATGAGTTCCA TGGCACCGCC GCAACACATC
AATCCGAGCT TTATGCAAGC GCCGCCCAAC AGCGCCTTGC AGCGGCAGCA AGTTGGCGCA
CCTCAGCTAG CGCTACCGAT GGCTGGTGTG CAGTATCCAC AACAATATCC GCAGATGGCG
TTCCCCCAGC ACCATCAGCA GATGGGATAC CCGCAGCAGC ATCCGCAGAT GGGATATCCC
CAGCGCCCGG CGGCTGAACC ACCGTTGCAC CCGGCGTTCG CCAACCCTCA CCAAGGCGGC
AGTCCAAGTT CTAGCGCTCC ATCACCGCAG AACTCCGGCA GTTTGATTTG A
 
Protein sequence
MPLRLQKKAS RVVRAVAEEV QVARGVGTRT DKRGNAKTAL WERGKAKLAA TAVEKEFAET 
QLAFSKATNF DDVPPKEKHV LALVRTCGGA GGGSSRDRAF VLETLARQVR KCAPWRTMLK
THVLLHRLMR ECEGGGFKDD FFRFLEFLSR KTYGPKEQTL FNIRYWKDET NKDAYELSGW
TRAYAAYLEE LCALNEFIPS LVGNVSGAVT TTTNGEARAV VANPLKDCDF ATLIKVLPLV
QTLVRRITDC APTSTTLQKN AVSRYAVGLV AKDSFLVYRV MNEGIINLVD KYFETSKVEA
EKGLVIFKKY LTQIEDLQRF YDTCEACAAV ENAVVKLEAP PATFLKSMEE YFESAPREGL
PLRERRLGAT SSTTANNARA NAVGSTMLAI DVPANNADFI STTAALPPVE PLNALDALSQ
LDLGTPSPTS KDDVFSSNAL PAPTQPPALA PVAPAASSST SALDSFSESI APAVPTEPSA
VAYNPFGANP YGGAPQMVPA APQTAPAPQA KSPRSTNPFG NNPFGTPQPQ SLDKSALNDL
YAQAPASPRS GHGMSSMAPP QHINPSFMQA PPNSALQRQQ VGAPQLALPM AGVQYPQQYP
QMAFPQHHQQ MGYPQQHPQM GYPQRPAAEP PLHPAFANPH QGGSPSSSAP SPQNSGSLI