Gene OSTLU_27740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_27740 
Symbol 
ID5005610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp100163 
End bp103378 
Gene Length3216 bp 
Protein Length1071 aa 
Translation table 
GC content60% 
IMG OID640421031 
Productpredicted protein 
Protein accessionXP_001421709 
Protein GI145354893 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00683941 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0438763 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGGGT ACGCGCAGAC GGACGGCGGC GCGAGCGCGG GCGCGGAGGC GGCGGCGAGG 
GAAGAATTCG ACGACGGGGC GATGGAATTC ATCGCGGCGT CGAGCGCGCG AGGACGCGCG
CGCGGCGAGG AGGGCGACGC GGACGCGCGA GGACGCGCGT TGGCGTGCTC GGTGAGCGGA
TTCGGCGCGA GCGGGAGCGC GCGAGCGGCC AAGGTGGAAA ATTTTATCGC GGCGAGACGA
CGCGCGACGA AACGGTTGGC GCCGTGGTAC GAGGGATCGC GCGCGAGGAT CCACGCGATG
GCGCACTCGC GCGATGGGGA AAGCTTGTTG TGCTGCACGA GCGCGGGAGG GGTGTACGTG
GTGCCGATTT TGGAGTTGAC GCGTGACGCG GAGGCGTCGC CGGCGATGCG GGTGTTGGCG
TCGAGAGGGC CGAAGCCGGC GAGCGCGCTG TGGTGGTATC GCGCGCTCGA ACCCGAGCGC
GAGGATCCGG TGGTGGGGAT ATGCGTCGGC GTGGACGGGG AGGTGCGGGC GTGGGACGTG
AAGGAGGGGT CGCCTCTGGG CGCGTGCGTC GTCGGCGCCA AGTGCGCGAG CGCGGAGCTC
GCGCGCGGCG CGACGAAGCA ATTTCTCGTC ATCAACGGTG TTCAGGGCGA AGTTTGGACG
TTGATGCTGG AGAAAATGTC GCGCGTGGTG AAGACGTCGA CGAGCGAAAA AGGCAAGGTG
GAGACGACGA CCAAGGCTGA ATCGTTACCG GACGCCGCGG GTTCGCACGG ATTCGCCGCG
CACCTGCTCA AGGATGAGTA CGGCGTTCGC GAGGGGCGGC AGGTGACGCT GAGCGTGCAA
GAGACCGGCG AGAGCGATGG ATATTCGCTC ATCGCTGCGT TGATCGATCG CCGTACGTTG
GAGTTGTACG ACGTGGATCG CGCCACCGAG CCAAAATCCA CGCACGCGTT GCCGAATCAC
ACCGTCGCCG TGCACGTGAC TGAAGATTTA ATATTCGCCT TGGTGCGCGA ACCGGTTTTC
GGTGAGGAAG ATCTCGCAGA CTTAACCTCA TTCACCGCGT CCGTGCACGT GTTGGCGCGA
AGATTCGGGA CCGACGGTTC CAAGTCGTTT ACGCTGCAGA CGTTTCACGT GCCGCGCTCC
GCGGGCGTGC CAAAAAAGTT CCTCGCGGCT GAGCTCCCGG AAACGTACGT GCGAAGACGA
GAGACGCTGT GCGGATGCAT GTTGTGGACA TCGTACGGGG TGTACGAAAT TAAGTCAAAA
GTCGACATCG CGTCTACGCT GCGATCGTTC ATGTCGCCGA ACGCGGTCGT TGCCTCGACC
CGCCGCGACG CGTGGGCGCC GATCGAACGC GACGAGTTCG GTGACAACGC TTTAAGCGTA
GATCTCGACC AAGACGAACG CTTAAAACTC GTCGCGCACG TCTTGGACGA GGATCACATG
CCCGTGTTCG TCGAGGCTGC GCGCCAGGAG CTCAAGCGAC AAAATTTCTC TCGAGCGCGC
GATTTATTCG GGAAGACTGG AAAGCCACTC AAAGATTTCA TCTCGCTGAG CTTAGAGGCG
TGGGAGGCGT CGCAAGCACT GACGAACTTC CACGGAAAAT CAGTCGAGGC GTACGGAGGA
CAGGCGAACC TTTCGTGGTT GAAGACGGCC GCCGCCGCGC ACGCGCATTT GCACGCGTGG
TGCGAAGCGT CGAACGCAGT CGCGTACAAC GCCGCCGCGC ACGATTTGGG AGAATCGATT
CAAGCGAAGC TCCAAAAGTT GAACCTGAAA GAAGAACGCT CCCCTGAACA AGCGGTGAAG
GATCTTTTGC GCGTCATCGG TGAAGGTATC GAAGCCACCA AAGACGCGGG CGTGCAGCGC
GACGTGGCGT GCTCGGCGAG CGCCGCCGTC GTCGCGTTCG AGGCTGCGAA CGCGGTGAGC
ACATCCGTGA TCTCGTCGTG TGCGGCGAAC GAGTACTCAG ATCGAATCAA AACGTTATTC
ACCATGCTGT TGTCCTCATC ATCCACCGTT GAGAGCTTGC ACATGCTCGG TGGCGTCACA
GCGAATTTGA TGACGCAAGC AGCCGAGCAC AGCGGCGGCA CCGTGCGTAC TTGGGAGTCC
AAACCACTCG TGTTTTGGGA TCCGAATGTC ACGTACTACG TCATCGCTAC GCAGACGATG
GAAAATCTGT ACGACATCGT TCATGTGCTC GGTTTGGACA GCGACACGAC GGACACGTGC
GAGGCGTCGC CCTTGGAACT TGTGTACGAC ATGTTGACGA GAGATGAGCT GCGCTCGCTC
GCGGATATGG CGCGCGAGGC GAAAAATCTT GGCGTCACCG GTGCAGCTGA AATCGAGCTA
ACGATACTTT TGTACTTGGA CGACGAAAAC GCTCTTGCCG AGCGCATCGA AGTCATGTTG
GAAGACGATC CAAAGTTGTT TGGTTGGATC GCGTCCAAGT GTCTGGACAA GCGAAAGTTT
CGCATCACCG AGTTGGCAGC GTCACAGATG GAAGACTTTG CCACCGCCGC TATGTGCCAC
GTGGCCGCTG TTCAAAAGCT CGCCGCTGTG GGCGAGACGT CGCCGTCAAC CTTACAGGCT
GAGCTCGAGC TGGGCGTCGA GACGTACGTC GCGCGGGTGG TCGACACGCG CGCGCAAGTG
AAATCAATCG AAGACGTAGC ACGATGCTGG CAAAGGAATG GATTGCCTAT CGACGAACTC
GAACGTTTGT TTCTTGACGT ACTCGTCTCT CAAGGTCACG CAGAGGCTAT GCAAGTTGTA
CTGCAAAGTG ATCTCGGATT TCAATTCAGC GGTCAATTCA TTCTCGCCGT CGCGACGAAA
CGAGTAGCCG AGGACGAGTC AAAGTATTCT TCGAAGGATG GCGCTACAAT CGAAAGCGTT
TGGTTGAAAA TCAAGCAAGA TCTAGCATCT CGCCTGGATT CCCCCGAGTT TGTTCAAACG
CGAGCTTTCG ACGCGATGGA ACTGGACTCT CTGACTTCAG TCGACGGTGC GCAGTGCTGG
GCATTCACGT GCGGACACCG CTACGGCTCT GAAGAGTTGC AGCGCGAAGT CAACGACGCA
AAGGCGAGAT TGAAGATTCT GGATTTACCC CTGTCGTCCA TGTTACTCGA AAGCGACTAT
AAATTGCAAA AGTGCGCGGT CGCGTGTCCA AACTGTGTGT CTTACGCCGT CGAGCACTAC
GTCGAAGTGC GTCGCAACAC GAGGGGCGCC GCGTAG
 
Protein sequence
MFGYAQTDGG ASAGAEAAAR EEFDDGAMEF IAASSARGRA RGEEGDADAR GRALACSVSG 
FGASGSARAA KVENFIAARR RATKRLAPWY EGSRARIHAM AHSRDGESLL CCTSAGGVYV
VPILELTRDA EASPAMRVLA SRGPKPASAL WWYRALEPER EDPVVGICVG VDGEVRAWDV
KEGSPLGACV VGAKCASAEL ARGATKQFLV INGVQGEVWT LMLEKMSRVV KTSTSEKGKV
ETTTKAESLP DAAGSHGFAA HLLKDEYGVR EGRQVTLSVQ ETGESDGYSL IAALIDRRTL
ELYDVDRATE PKSTHALPNH TVAVHVTEDL IFALVREPVF GEEDLADLTS FTASVHVLAR
RFGTDGSKSF TLQTFHVPRS AGVPKKFLAA ELPETYVRRR ETLCGCMLWT SYGVYEIKSK
VDIASTLRSF MSPNAVVAST RRDAWAPIER DEFGDNALSV DLDQDERLKL VAHVLDEDHM
PVFVEAARQE LKRQNFSRAR DLFGKTGKPL KDFISLSLEA WEASQALTNF HGKSVEAYGG
QANLSWLKTA AAAHAHLHAW CEASNAVAYN AAAHDLGESI QAKLQKLNLK EERSPEQAVK
DLLRVIGEGI EATKDAGVQR DVACSASAAV VAFEAANAVS TSVISSCAAN EYSDRIKTLF
TMLLSSSSTV ESLHMLGGVT ANLMTQAAEH SGGTVRTWES KPLVFWDPNV TYYVIATQTM
ENLYDIVHVL GLDSDTTDTC EASPLELVYD MLTRDELRSL ADMAREAKNL GVTGAAEIEL
TILLYLDDEN ALAERIEVML EDDPKLFGWI ASKCLDKRKF RITELAASQM EDFATAAMCH
VAAVQKLAAV GETSPSTLQA ELELGVETYV ARVVDTRAQV KSIEDVARCW QRNGLPIDEL
ERLFLDVLVS QGHAEAMQVV LQSDLGFQFS GQFILAVATK RVAEDESKYS SKDGATIESV
WLKIKQDLAS RLDSPEFVQT RAFDAMELDS LTSVDGAQCW AFTCGHRYGS EELQREVNDA
KARLKILDLP LSSMLLESDY KLQKCAVACP NCVSYAVEHY VEVRRNTRGA A