Gene OSTLU_37792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37792 
Symbol 
ID5005993 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp39719 
End bp42787 
Gene Length3069 bp 
Protein Length934 aa 
Translation table 
GC content58% 
IMG OID640421414 
Productpredicted protein 
Protein accessionXP_001421820 
Protein GI145355127 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.772788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.533349 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCTCT TCGCGCCGTA TCGCGCGCTC GGCCTCGTGA ACGCGCAGCG AGGATGCGCG 
CTCGTCGTCA AGCGGCGCGG GACGGAGACG TTCGTCACGG TGAGCGCGGA GGACGCGTTC
GCGGTGTACG ACGCGCGAAA ACTCACATTG GTCTTTCGAA GCGCGCGGTT CGCGACGAGC
GACGCGCGAG GGATCGGGGC GATGGCGGTG AGGAAGGATT ACACGTTTTG CGCGATCGGA
CGGGAGATAA GGTGCGCGAA ACGATTGGCG GAGTGCTGCG AAGGACTGGG GCGGGACGAC
GGCTCGGGAC ACGACGCGCG GGTGACGACG CTGCACGCGT TCGGACGACA CTTGGCGAGC
GTGGACGAAA ATGGGGCGGT GAAGATGTGG GACATCGATG ATGACGCCAT GCGGGCGCGC
GAGCGGGCGT GGGCGGTGGG ACGAGGAGAA CCGACGGGGC CGGAGGGCGA GAGCGACTCG
GAGCTCGAGC TCGGCGCGCG AGCGATGGAG ATGCCTCGGA CGTTCGCGGC GACGACGGTG
TGTCATCCCG ATGGATACGT GGATAAGTTA CTGTTTGGCT CGAGCGATGG GCGATTGGCG
CTGATGAACG TGCGAGCGGG TAAATTGGTG CACGAATTTG CGGGCTGGGG GTCGGCGGTG
ACGGCGCTGG AGAACTCGCC GGCGACGGAT GTCGTCGCCG TCGGTTTGGC GGATGGGCGC
GTGCTTTTGG TGAACGTATT AGAAGATAAA GTTTTGTTTA CGCTCACGCC CGAGCGTGGG
GTAAAGGTGA CGGCGCTGGC GTTTAGAACG GATGATCAGG ACGACGTGCT GTGCGTCGGC
GACGAAACGG GACGCGTGAC GGTTTGGGAT TTGGAAAAGC GCTCGTTGCG CACGCTCATC
GTGCAATGTC ACGAAGGTCC GGTGGTTTCG TTGAAATTTC TCGATGGACA ACCGGTGATG
GTGTCGAGTG GGTTCGATAA CACGTTGAAG GAGTGGATTT TTGATAGAGA AGACGGCGAC
GCTCGACTCT TGCGCTTTCG CGCCGGACAC TCCAAGCCGC CGACGAGCGT GTCGTTCTAC
GGTGAAGGCA AGAAACTTTT AGCGGCTGGG AGCGATCGAA CGTTGCGATT TTTTCACGCG
TTTCGTGATC AGCAAAACAT CGAGTTGAGC CAAAAGAATG TCAGCAAGCG CGCGAAGAAA
ATCGGCGTCG CGGAGGAAGA GTTGAAACTT TCTCCGGTTA CGAAGATGGC GTGGGGCGAG
CTTCGCGAGC GCGATTGGGC AAACGTCGTC ACCGCGCACG AAGGTTCAAA CAAGGCGTAC
ACGTGGCGTA TTTCCAAAGG TGCGCTCGGT GAACACATTT TGCAATGTCC GAAGGACGAC
GGCAAGTGCG AAGTCAAGTC GGTTGCGATT AGCGCGTGCG GCAACTTTGC CTTTCTCGGT
GCCGCAAACG GGGCGATCCA TCGATTCAAC TTGCAATCGG GCGCGCATCG TGGCGCGTTC
GAGCGCGTCC TCGACGCTGA TGAGGTGATG CGCACAAAGA AGAAGAAGAA CGGAAACGAA
GGGTACAATT TCCCCGGCGG CAAGCGTTCG TTTTGGGCTC TCGCGAATCA AACCGGTGGA
AACGCAGAGG GAAAATTGCG CGTTGCCGCG CACGACGGTG AAGTGACGTG CATACAAGCA
CACTGCGCGA ATAGAAGCGT CGTCACCGCG GGTGTCGACG GGATGATTCG CGTGTGGAAA
TTTAGCGAGT TGAAGATTGA CTTGGAAATC GACGTCGGGT GCGGCGTGCG GTGCGGTCAC
TTGCACGAGG ACTCATTGCT CGTCGTTGGA TGCGCTGATA AGCACGTGCG CGTATACGAT
ACCATGACGG GTAAACGTGT TCGCACGTTC AAGCCACGCG GTCAGGAAAA TGAGGCTGGA
GATATCACGA GCGTGCAAAT CAGTGAAAAC GGTAAATGGA TCTTCGTCCT CGACACCACC
GGAACGATAC GAGTGTACGA TATTCCCGCC GCGAGATTGA TTCAACACAT GATTCTCGGT
GCGGATAAAG TCACCGCCAT GAGCTTTTCA CCGCGAATGG ACTTTTTAGC CACCGTGCAC
GAGAACAGGG TCGGGTTGTA TTTGTGGGTG AACATGCCGA TGTTTGATTA CGACACCAAA
TTGGCGTACG GGCGCAAAGT TTCCATCGCA CTTCCGAGAA AACACGCGGA ATCCGACGCA
GACGGCGGCG TGCGCGATGC GTCTGAGGAG ACGAACAAGG AGTCCAACGA TACCGACGTG
TACATTCACC CGTTCGAAGA AGACGAAGAA GAAGAGCATC GCAATTTGCA AGAGTTGGAA
GAATATTTCA AAGAAATGGC CACCGGCCCG AAACAAATAG CCCCTGGCAT GATTACTCTG
GCGATGATGC CTCAGACGCA AATCGAAATG CTACTGAATC TGGAGACGGT GCAAGCGAAG
AGTAAGGTGA AGACCGATGA CAAAGAACCG GAATTGGCGC CGTTCTTTCT GCCCACAGCC
GCCGCGAGCG ACGACGTGCG TCGATCTGTC TTCGATCCCG CGCGAGAATC CGAGCTCAAC
GCCAAAGACG ACACCGGCGA TGATGATTCC AAAGCACCGA AGAGTCGAAT TTTGCGTCAA
GGCGCGGATC TCGCCGGCGC GTCCGCGACG CCGCTCTTGA CGTTGATTCT TCGAGGAGAA
CGATCGGAGG ACTACACCGA AGCGCTCGAA TTCTTGAAGA ATGCGTCGAT ACACGTCGTC
GACGCCGAGC TGCGATCGCT CGGACCTTGG GATCACAAGT TCATGTCCGA GGACGACGTG
AAAACTTTGC GGAGCGCCAT CAAATTTTTC ACCGCCGCAA TCGCGAGCGG CATGTACTAC
GAGATGGTCA ACGCCCAACT CAACGTGTTT CTCAACGTGC ACACCACCGC GATCATGCAA
TCCAGCGCGC TCGTCGACGA GTGTCACGCC CTGCGAGAGG CCATGCACAA ATCCTACAGC
AGGATCGACG ATTTGTTCAA CGAAATTCGG TGCACGCTCA GTTTCCACAT CGGCGACGCC
GGCGTCTAG
 
Protein sequence
MALFAPYRAL GLVNAQRGCA LVVKRRGTET FVTVSAEDAF AVYDARKLTL VFRSARFATS 
DARGIGAMAV RKDYTFCAIG REIRCAKRLA ECCEGLGRDD GSGHDARVTT LHAFGRHLAS
VDENGAVKMW DIDDDAMRAR ERAWALGARA MEMPRTFAAT TVCHPDGYVD KLLFGSSDGR
LALMNVRAGK LVHEFAGWGS AVTALENSPA TDVVAVGLAD GRVLLVNVLE DKVLFTLTPE
RGVKVTALAF RTDDQDDVLC VGDETGRVTV WDLEKRSLRT LIVQCHEGPV VSLKFLDGQP
VMVSSGFDNT LKEWIFDRED GDARLLRFRA GHSKPPTSVS FYGEGKKLLA AGSDRTLRFF
HAFRDQQNIE LSQKNVSKRA KKIGVAEEEL KLSPVTKMAW GELRERDWAN VVTAHEGSNK
AYTWRISKGA LGEHILQCPK DDGKCEVKSV AISACGNFAF LGAANGAIHR FNLQSGAHRG
AFERGKLRVA AHDGEVTCIQ AHCANRSVVT AGVDGMIRVW KFSELKIDLE IDVGCGVRCG
HLHEDSLLVV GCADKHVRVY DTMTGKRVRT FKPRGQENEA GDITSVQISE NGKWIFVLDT
TGTIRVYDIP AARLIQHMIL GADKVTAMSF SPRMDFLATV HENRVGLYLW VNMPMFDYDT
KLAYGRKVSI ALPRKHAESD ADGGHRNLQE LEEYFKEMAT GPKQIAPGMI TLAMMPQTQI
EMLLNLETVQ AKSKVKTDDK EPELAPFFLP TAAASDDVRR SVFDPARESE LNAKDDTGDD
DSKAPKSRIL RQGADLAGAS ATPLLTLILR GERSEDYTEA LEFLKNASIH VVDAELRSLG
PWDHKFMSED DVKTLRSAIK FFTAAIASGM YYEMVNAQLN VFLNVHTTAI MQSSALVDEC
HALREAMHKS YSRIDDLFNE IRCTLSFHIG DAGV