Gene OSTLU_19010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_19010 
Symbol 
ID5006746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp225999 
End bp228383 
Gene Length2385 bp 
Protein Length794 aa 
Translation table 
GC content66% 
IMG OID640422167 
Productpredicted protein 
Protein accessionXP_001422527 
Protein GI145356623 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.386268 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGCG CGCGCGAGCC CGCGGGCGAC GCGGGCGGTC GCGCGCCGTC GCTCGACGCG 
CTCGACGCGC CCGATCGCGC GCGCGACGAC GCGCGCGACG GCGCGAACGA CGTCGAATGG
CGAACGGTCA ACGGACGCAA AAATGCGAAA TCGTCGCGCG AGCGTCTCGC GCGGTCGTCG
TCGTCGCAGT CGCGCGCGGA GGAGATCGCG GACGCGAGCG TCGCGAGGGT GAACGCGTAC
GAGGCGATCG CGCCGCCGGG AACGCCGACG CGAAGCGAGA GGGAAGGGGA GGGCGAGGGA
ACGGAGGGAC GAGACGCGAG ACGACGCGCG AAGCGACGCG CGAGACGAGG CACGCGACGG
ACGAGCGCGG CGCTGCGCGT GCTGGGAGGC GCGAGCACGG GAGCGAGCGA GGATGAGTAC
TACGGAGAGG GCGCGGACGG CGGTTGGGGA ATCGGAGGGA TCGCGGCGCG AGCGGCGGCG
TTGGCGGCGG CGACGGCGAG GGCGAGATTC GATACCGCGC GCGAAGCGAC GACGACGCCG
GATTCGGACG CGGCGGGGCG AGGGAAGGAA GAGGGCGAGG TGGTTCCGTT GATGATGCGG
ACGTTTCCGC CGATCGGGGG GTCGAGACCG CCGTTGCCGG GACGAGCGAC GGTGAGGGCG
CCGATGTCGC CGCCGAGGCC GAGACCGCCG AAGACGCCGA AGACGCCGAA ATCGCCGACG
ACACTCGAGC GCACGAAAAG TGGGCGTCAC GGAGGGGACG GGGAGGAGAG AGAGCATCAA
AACGCGCTGG ATTGGTTCTT TCATCAGATG GATCTCGCCT TTGCGGGCGT CATGCTCGCC
GTGTACGCGT TTTACGACAC GGTGATGAAA TTCATAGGTA TCAAGGTTTT GCGCGTATCT
CAGAGTTCTC GGAGCGTGCG ACAGACGGAG GCTCGAGTGG CCGAAGAGCG CGCGGCCGTG
GAGGAGTTTG AGGAGATTTT GTCGGCGCGG CAAACCGCGA CCGAGGCGTC GTCGAGGTCG
CCGTCGGCGA GCGACGTTGA CGGCGCAAAC CGCACGAGTC GACCTCGCAC GCCTCCGAGC
GCGTCGACGA GCGCGACGAC GTCAAAACAC GCGACGATAC CTTCGGTTGA GCAGCGCGTG
GTGGAAGAAC GCGAGGAAGG TGAACTGACG CCGTCGCACG CTTTACGGCG ACGACTAAGC
TCGAGTCTCG GCGGCGCGTG GGGTGGCGGT GGCAACACGC CACCTTCGTT CCGAGTTGCG
CGCGTCGACG AAGACGCGAC GCACGCGCAC AACGAGCTCG TCTCGTGCGT GGGCTCGCGC
GGCGATGAAT ACATAACTGG AGGGTGGGAC GGCACGCTGC GGACGTGGAA GTGGGATCCG
ACGAAAGGGC TGTCCGGAGG CTTGCCCATG ACCGGACAGC ACAACGACAA CGTCGAGTTC
CTGAGCGTCG ACGCGAGAGA AGACCACGAG CGACTAGCGA TTTCGGGTGG GCGCGATTGT
ACGGTGCGCA TTTGGGACGT CGCGAAGCGA TCGCAGCGAA GTCGTATTTA CGCGTTTGAA
AACATCGCGA GCGGGTGCGT CGACTGGGAG TCGCAAACAG TCGCCGTGGG CTCACGAGGA
GGCGCGGTGA TGTTGTGGGA CGCCGAAAAG GGATCCAAAA AGTGCACGCT TCGCGGGCAC
GATGGTGAAG TCACATCCAT GTGCACGTAC GATTGGTCCG AAGGTGGCGC CACGCTTTAC
GTCTCCGGCG GCGCTGACGG CACGGTTCGC GTGTGGGACG CTCGTCAGCA TGTCGCCGTT
GCGACGATGA CGGAGCATCG TCGACGCGTG TACGCTGTGT GTCCGGGTCC AAAGGGTATC
ATCTTCGCCG GCGATTTTTC GTCGAACGTC AAGGTTCACT CTTTATCCAA CCCGGGCGCG
CTACCTCGCT TGCTGCCAAA CGTGCCGAGC ATGGACGGCT GCGAAGCCCC GATCGCGGGG
TTGCAATACG TGAAACTCGA CGGCATGAAC GGTGGCGGCC TGTTGCTCTC AACCGCTGCT
TACTTCCCGC TCAACGAAAA CGGCGAGGAA TCCGACGACG ACGACGCTCC GCAAGGCTGC
GTCCACGTTC GCGCCGTCGA CGCCACGGGC GCCGGCGTCG GCCCGGTCTC CGACCAAGAC
GGCGACGGTT ATATGTACAC CCTGAAAGGC ATCGAAGGGT TGCTCACGTG CGCGTCCCTC
ACCGCCACAT CCGACGGTCA TCGCATGCGT CTCGTCGTCG GCGCCGGATC TGGCGCGCTC
GGCGCGTACG CCGAGGGCGG CGCGCTCAGC GGCCAAACCG CCGACGACGC CTACGCGTCC
ACCATCGAAC GCGCCGACGA CTTGGGCGTG GAATCTTTCG ACTGA
 
Protein sequence
MDRAREPAGD AGGRAPSLDA LDAPDRARDD ARDGANDVEW RTVNGRKNAK SSRERLARSS 
SSQSRAEEIA DASVARVNAY EAIAPPGTPT RSEREGEGEG TEGRDARRRA KRRARRGTRR
TSAALRVLGG ASTGASEDEY YGEGADGGWG IGGIAARAAA LAAATARARF DTAREATTTP
DSDAAGRGKE EGEVVPLMMR TFPPIGGSRP PLPGRATVRA PMSPPRPRPP KTPKTPKSPT
TLERTKSGRH GGDGEEREHQ NALDWFFHQM DLAFAGVMLA VYAFYDTVMK FIGIKVLRVS
QSSRSVRQTE ARVAEERAAV EEFEEILSAR QTATEASSRS PSASDVDGAN RTSRPRTPPS
ASTSATTSKH ATIPSVEQRV VEEREEGELT PSHALRRRLS SSLGGAWGGG GNTPPSFRVA
RVDEDATHAH NELVSCVGSR GDEYITGGWD GTLRTWKWDP TKGLSGGLPM TGQHNDNVEF
LSVDAREDHE RLAISGGRDC TVRIWDVAKR SQRSRIYAFE NIASGCVDWE SQTVAVGSRG
GAVMLWDAEK GSKKCTLRGH DGEVTSMCTY DWSEGGATLY VSGGADGTVR VWDARQHVAV
ATMTEHRRRV YAVCPGPKGI IFAGDFSSNV KVHSLSNPGA LPRLLPNVPS MDGCEAPIAG
LQYVKLDGMN GGGLLLSTAA YFPLNENGEE SDDDDAPQGC VHVRAVDATG AGVGPVSDQD
GDGYMYTLKG IEGLLTCASL TATSDGHRMR LVVGAGSGAL GAYAEGGALS GQTADDAYAS
TIERADDLGV ESFD