Gene OSTLU_18782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18782 
Symbol 
ID5006366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009372 
Strand
Start bp6042 
End bp8315 
Gene Length2274 bp 
Protein Length757 aa 
Translation table 
GC content54% 
IMG OID640421787 
Productpredicted protein 
Protein accessionXP_001422314 
Protein GI145356179 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.546086 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGCGT GCGATGCGCG GGCGACTCTG CCCGCGCATC GAACGCAGCG GCGCGCGCGA 
CGACGAGCGC TTCCGATAGT ACTCGTCGCG CTGTGCGCGT GCTGGCTGTG CAGAGAGCGC
ACGCGCGAGC GCGCCCGAGA AAGCGATGAT GTTGCACTCC AACGTAAAAA GTCGCGAGTG
GTCGTCGAAC CGTGCATCGG AGGTCGGCTT GAAGCGTCGA TAGCCGACGA ACCGGAATCG
ATTCGCTTCG CTACGTTTGC CTTTGCGCGG AACGCTGGCA AAAGATGGAG CGTGGAAAGC
ACGAAGATAG CTTTGCGGCT CATGTACTCG TCGTTGGCGA GATCGCAAGC AAGGAAACCG
CCGTGTTTAC ACGTGTACAC GGACACGCCG GGCGTGATTC CGCTCAAGAC CACGTTTGGC
ACAGAGGTTG ACGTAATCAC GCACGCGTGC GACTCGCAGT CTTTTCCGCC AAACGCTTAC
ACGAACGTCG GGCCGTGGGC GGCGCTTTCA CGAGCGAAAC TTGATGCTGT AGAACATTTA
ATGACGGTGT TCGGTGCGAG GGTGATATGG ATCGATCTCG ACACTCTCGT CTTCGTAGAC
TTGGGACAGA CGTTCAGGCA AAGCTCGTCG TGGGTCGTGG GTTACCAAAG AGGCGCGCAC
TGCGAGGGAA TAAGAGCATG TTTTCATTCC GCGCACTCTT CGGTTCGACC GGAATTCGAC
GCACTCGGAG ATCTCTGGTC GCTCGATCGC GAAACAATAG CAAAGGTCAG AGACTTTGAA
CGGAGGAGGA TTACGTCCAG CGCGAGGACG CCACCGAAAT ACGATTTACA AACGTATTAC
GGACAGATGC TCGAGGAAGA AATGTTATCG AGCGATGTGT TGCTGCACAA ACTTTTGTCG
AGTCATAACT TTGGTTTCTT TTGTTCCAAC TTCATGCATC CGACGGTTGA AAACTTGGAA
CTTTCAATTG ACGAGGAGGG GAATCTCGTC TGTCCTCGAC GGCCGGATGT CAAAATGGGC
GAACGCGTGG GCGCGATTTC GTTCACAGCC AAGACGTTTC AAAGTATGTT TTCCGCAGAC
GGCGATGTAT TCGAAAACAT AGCGCATCCC GGAGCGCGAC GATGGCTGCG CGACTGGTTT
TATGGTCCTA TCGCTAGAGG GAGCAACGGC TCGGCGCGCC CCGGCGTCGA CAAAAACGTC
GATTCAAAGC CGAAGACTGA TCACGAGCGA ATGGTTTTCA CGCCAATGAA GCCATCGAGC
GATAGGTACG ACGACACAGA TATCGTTTCA GTTAAGCGAT GGTACGGTCG GCTCGGCAAT
AGGGTACGGA TATTTGCGAT GATGCTCGAG GACGCGCTGG TCCGCGGTTG TCACGTGCGC
ATACTCGACG ACATACTACC GGGATGGAGA TCAGAAAATA CGTTTTTCGC AAACAAACAG
ATCGATGGTG TGCGGAATAA GCGTGCTGCG AAGAAGTGTC AGAAACTGAG CGGGCGTCGC
TGGTACGATA CGTACCTCAA ATCGAAAGAT GCGCTTCGTC AACAGAACGG TTTTGCAGAC
GACCGAATGT CTATACCAGA CGGGTCTGTC GTAACCAATG CTATCTCGCG TTACTTTGAA
ACGAACGTGA CGCACGCGTT TGGTCGTGCC TGCGAGAGTG TCGGCGATGA TGTGCTCGCG
GTGCACGTGC GCGCGGGAGA CATCGTCAGC GGCTCCTACA GTCGTTGGAC GGGTCATTTC
GTTGCGAAAG AGCCGTCAAA GCACACGACG TACGGACCTT TTCCGACGTC GTACTACGCG
AGCGTGCAGA GTTACGCTTC GGAATCGGCA TTACAAGTGC GAGTTTTCTG CGAAGACTTG
AACAATCCAA CGTGTATGTT TTTTCAACAG CTCGCCGCGG TACTTCCAAA CGTCACCATG
AGACTTGGAC GGGACTTGAT TTCTGACTTG GTGGAGTTTA ACTGCGCCGC TCGCGTAGCG
TTTTCGTACG GCTCGTTTCG AGATGCGCTA GTTCTAAGAA ATCGTCCTTT ACGAACGCAC
GACTTTTTCT TTGACAGAAA TGAAGCCGAG ACTACGTGCT CGCGCGCGTC GCGTGGATCG
AGCGCGCGAC ACCGCCGGTA TTTCTTCGCG TTGAAAAGCG ACGAAATAGA GTATCAAAAA
TGGATTCGCC GGAATAACTG GCGCAACACG GCGAGGCAGC GCCACTTGGT TGACAAACAC
TATCAAATTG GTTTCGTTGA ATGCGAGAAC CGTGGAAAGG ATTTCTTCGG GTAG
 
Protein sequence
MFACDARATL PAHRTQRRAR RRALPIVLVA LCACWLCRER TRERARESDD VALQRKKSRV 
VVEPCIGGRL EASIADEPES IRFATFAFAR NAGKRWSVES TKIALRLMYS SLARSQARKP
PCLHVYTDTP GVIPLKTTFG TEVDVITHAC DSQSFPPNAY TNVGPWAALS RAKLDAVEHL
MTVFGARVIW IDLDTLVFVD LGQTFRQSSS WVVGYQRGAH CEGIRACFHS AHSSVRPEFD
ALGDLWSLDR ETIAKVRDFE RRRITSSART PPKYDLQTYY GQMLEEEMLS SDVLLHKLLS
SHNFGFFCSN FMHPTVENLE LSIDEEGNLV CPRRPDVKMG ERVGAISFTA KTFQSMFSAD
GDVFENIAHP GARRWLRDWF YGPIARGSNG SARPGVDKNV DSKPKTDHER MVFTPMKPSS
DRYDDTDIVS VKRWYGRLGN RVRIFAMMLE DALVRGCHVR ILDDILPGWR SENTFFANKQ
IDGVRNKRAA KKCQKLSGRR WYDTYLKSKD ALRQQNGFAD DRMSIPDGSV VTNAISRYFE
TNVTHAFGRA CESVGDDVLA VHVRAGDIVS GSYSRWTGHF VAKEPSKHTT YGPFPTSYYA
SVQSYASESA LQVRVFCEDL NNPTCMFFQQ LAAVLPNVTM RLGRDLISDL VEFNCAARVA
FSYGSFRDAL VLRNRPLRTH DFFFDRNEAE TTCSRASRGS SARHRRYFFA LKSDEIEYQK
WIRRNNWRNT ARQRHLVDKH YQIGFVECEN RGKDFFG