Gene OSTLU_16903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16903 
Symbol 
ID5003520 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp604954 
End bp607908 
Gene Length2955 bp 
Protein Length785 aa 
Translation table 
GC content65% 
IMG OID640418941 
Productpredicted protein 
Protein accessionXP_001419852 
Protein GI145350944 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.500288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00972481 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCCCGCCC TCGCGCTCGA CGACCGCGAT CTTCGCGGCT CCGTCGCGCT CAGCCCCGAC 
CGCACGCGCG CGCTCGTGCG CGATGGCGTC CATCTGACGC TCCGGCGCGC CGCGCGCGTC
GACGAACGCG TCGGGAAGGC GATACAGTTT GAAAGCACAA TCGCGTCTGT CGCTTGGCAC
GCGACGCGCG ACGTCTGCGC GGTGGTCCTC GACGACGGCA CCGCGTGCGT CGTCACCGTC
GACGACCGCG GGTTGACGCT CGAGCGCGTC GCGCGCGGCA TCGGGGCGTG CTCGCGGGCG
TTCCGCGCGG GCGACGGCGC GCTCGCGTTC GCCGGCGACG GCGACGCGTC GACGACGATG
CTCGACGACC GAGACGGATT CGACTTCGTT CGAATCGCGC ACGCGAGGAA GAGTAGCTCC
AGATTTAGCG CGCACGCGCG GTCGCGAGAT GGACGGTGGA TGGCGTTCGT GACGCGCGAC
GACGAAGCGC GCGAACGCGT GGAGGTGTTC TCGGCGACGG CGCCGTACGA AGGCGCGATC
TCGTTTCGCT TGCCTCCGGG CTTCGATGCG CAGATGATCG AGTTTGGCGG CGCAGGGGGG
CAGGAGATTC TAATTTATGA CGATTCGTGC GCGGTGGAGG CGCGACCGGC GCTGGAGATT
TTTTCGGCCG AAGGCCTCCG GCGAGCGACG ATTCGAGGCG CGTCGGCGCC GAGCGTTCGG
GCGAAGGATG GACGTCTCAT CGTCACGTGC GTGTCGAACG AGGCCCTTTT ATGGATCGAT
GAATTGACTT GGCGGATCAC GCGCGCGCTC ACGCACCCGC TGACGTGCGA CGCGACGGCA
TCGACGAGAA TTTTCCGCGA GTCTCAGCGC GGAGACGGAT ACGAAAAAGT CGCGCGGTTT
GAACAGAAAA CTTTGGCGTG CGACGCGCGA GGTATCGAAC GAGCGGGCGT GCGACTGGCG
CTGAGTTCGT GTAAGACGAT GCTCGCGACT CTGTGCGCGA GCCACAACGA CAAGGTTTTA
TTTCTATGGC CGGCGTCCGG GAACGACGCC GACGACGGAG CTCCGCTCGC CGTCTTCGTT
CACAGAAAAC CAATCATCGA CTTCAGGTGG TTCGAAGACG ACGACCTGGA CGGTAGATCA
CGCCTCGTGT TTCTTTGCGC AGACGAGCCG GCGCTTTATT CTTTCGTCCC GGGCATGCTC
GCGCCCGCGC GAGTCGCGCT CGAGCGCGCC GCCGCGTCGT TTTCGCCCCG CGAAATCGTC
GGCGCTTCGC TCGAGCGCGC CGACGTCTTC ATCGTGGCGA GCGCGTCGAA ATCATTCGTT
CAACAGGCCG TTCGCGCGTA GCCCATCGTT CGATCGTCGC GCGCGCGATC GATCGAAATC
CTCCGCGCGC CGCGCCGGGT CAGTCAGGGC TCCGCCAACC GTTGTAACAC CCCATTCCGA
CGACGCGTCA TTCATGGCAC CCCACTCCGC ACATTCCGAC GCGTCATGGC GCCCGCGCGC
GACGCGACGG AGTCCACGGC GCTGCTCCGC GCGGCGAACG AGCTCGACGT CGCGCGTCAC
CGCCGACGCC GCGCGACGAT CGCGTCCGTC GCCGTCGTCG CCGTCGCGGT CGCGTCGATC
GTCGCGCGCG GGCGCGCGTC TCGCGAGGGC GCCGGCGCGC CGTTCGAGGT GCAGTTTCAG
GTCGACGTCG GGTGCATCGA GCTCGACGTC GTCGATCGCT CGCCGATAGA GGACTTCTAC
GCGTCCGACA TCGCGAGCGT GCGGTTGCTG CGACGAGGGG ACGATCCGAT TGAAGAGTTC
GGCGGCGCGA GCCGACGCGC GGGCGGGGTC GAGCTGACGC AGAGCGGATG GTCGACGATA
TACGTCGGCC GAGCGCGCGT GCGCGCGGGA GAGGAGATCG GGTTCGGGCT GGTGAACGCG
CGAGGCGAGA CCATCTTCGA ACTGGGGCAC AACAAGCGGT TTCCGGCGAC GGGCGAACTC
GCGAACGCGA CGTGCCTGGC AAACGTCCCC GCGGGCGGGG GCGCGTACAG AAATCGCGTG
ATTCCGAAAA GGTCGAGCAT GCGGCTCGTC GACGGCGTGC GTCGATTCGA GACGACGTGG
GCTGGATGTC TCGAGCGGTG CCCGCTGACG GTGAAGCTCA TCGCGTGCAC CGGTCCGGCG
ACCGGCGACG CGGACAACGT CTGGGGAATC GAAGAATCGG TGGCGAAAAC TGATGTCTGG
CGTAACTACA AAGGTCACTT TACCTCGGTG AGTTCGGGAG AGTTCAACAC GTGGGGTTTA
AATACCGTGA CTGGAACTCT CGCCTGGACG CGTAACGCCG ACCTCAATCA GTTCTCTCGC
GATAACTGGA ATACGGCAAC CAACAATCCC GTAGCCAACG GCGTCGCCCA AGGCACCATG
GTTGACTTTG ACGCGGGCTA CGATGCGCTC ATCGGAGTCA CGGACACGAG TATACACACC
TCCGGCGGAT ACATGTGGAG TCGTCCCGTC GACGGCTCGG GCGATTGGGG TTTCGCCGAT
GACGGCGGCG GAAAAGGTGT CCAAGTGACC ATCGGTCGCA CGCACCACTT TCACATGAAC
AAGCAACACA ATATGTGGAG CGCGGAACTG CCTAATGGCG GATGGGTTCG TCAACACGTA
AAAACCGTCA AGCAGGTCGA AGTCGGCGAT TCGGACGCGT TCGTCGTATA CCAAGATGGA
AGGACGCTGA AACGAAAAGC CTCGGACATG ACCGGCGACT GGTCCACAAT CTCCATTCCG
AGTGCGCTCT CGACCGCAAC GATCAGTCAA ATCACCGTCG GCGCCACCGC GCTCTGGCTT
CTCGACGGCA ACGGCAATCT CTGGGGCTGT GACCTTCCTT GCAGAGACAG CGGTGGCTTC
GTTCGCGCCG CCAACGCGCC CGCGAACATC ATCTCCATCG ACGCCGGCAA GGTGATCCAC
AACGTCCCGA ACTAG
 
Protein sequence
MPALALDDRD LRGSVALSPD RTRALVRDGV HLTLRRAARV DERVGKAIQF ESTIASVAWH 
ATRDVCAVVL DDGTACVVTV DDRGLTLERV ARGIGACSRA FRAGDGALAF AGDGDASTTM
LDDRDGFDFV RIAHARKSSS RFSAHARSRD GRWMAFVTRD DEARERVEVF SATAPYEGAI
SFRLPPGFDA QMIEFGGAGG QEILIYDDSC AVEARPALEI FSAEGLRRAT IRGASAPSVR
AKDGRLIVTC VSNEALLWID ELTWRITRAL THPLTCDATA STRIFRESQR GDGYEKVARF
EQKTLACDAR GIERAGVRLA LSSCKTMLAT LCASHNDKVL FLWPASGNDA DDGAPLAVFV
HRKPIIDFRW FEDDDLDEDF YASDIASVRL LRRGDDPIEE FGGASRRAGG VELTQSGWST
IYVGRARVRA GEEIGFGLVN ARGETIFELG HNKRFPATGE LANATCLANV PAGGGAYRNR
VIPKRSSMRL VDGVRRFETT WAGCLERCPL TVKLIACTGP ATGDADNVWG IEESVAKTDV
WRNYKGHFTS VSSGEFNTWG LNTVTGTLAW TRNADLNQFS RDNWNTATNN PVANGVAQGT
MVDFDAGYDA LIGVTDTSIH TSGGYMWSRP VDGSGDWGFA DDGGGKGVQV TIGRTHHFHM
NKQHNMWSAE LPNGGWVRQH VKTVKQVEVG DSDAFVVYQD GRTLKRKASD MTGDWSTISI
PSALSTATIS QITVGATALW LLDGNGNLWG CDLPCRDSGG FVRAANAPAN IISIDAGKVI
HNVPN