Gene OSTLU_16966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16966 
Symbol 
ID5004127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp70701 
End bp73310 
Gene Length2610 bp 
Protein Length870 aa 
Translation table 
GC content58% 
IMG OID640419548 
Productpredicted protein 
Protein accessionXP_001419892 
Protein GI145351034 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGA AGAAACGCGC GCGGGGCGGG CGGCGGGTGA CGGTGGACGC GGAGGTGGAG 
ACGTCGCAGC GGGAGCGGGT GACGTACGGG GCGCCGAGCG GGACGACGAC GACGACGACG
ACGGCGACGG CGACGACGAC GACGGAGGCG AATGGGAGCG CGCGGCACGG TGATAGCGCG
GCGATGGAAC GATTGTTCGC GTTAGGCGCG GGAACGGTCG CGGGGGGGGG CGAAAACGAT
GCTGATGATG GATTAGGGGT GTGGTTGAGC GAGAGCGAAT TCGGCGGCGG CGGCGGCGGC
GCGAGGGGGA AGTCGAACGC GGTGGGCGCG AACGGCGAGC GTGGGAGGCT TTCCGTGCCC
GTGGCGCCGG CGTGGGAAGA ATCATCGGCG GAGAAGTCAC CGGGGGGGCG GAGACGACGG
GCGAGCCGCG ATGGCGAGGA GCGAAGACGA TCGGAGGATT TGTTTGAGGA ATTGTTGACA
AACGTCGAGC GCGGCGCGCC GGCGCGAGAC TCCGCGCCGG CGACGATGGA AGTCGCGACG
ACGACGACGG CGTCCGCGGT GAATACACGC AAAGAAGGTG ACCACACGCT TGCGCTGGCG
TCGATGGATC CCGATTGGAA CGACGAGGAT GAGGATAGTC AGGCGCTCAT CGACGCCGTG
ACGCTCGCGG AGAAGCGCGC CAAGATGGCA AAGTCGGCTT CTGCGACGAC GACGCAAGCG
AGTAACGCCG TGCCTTCGCT GAGCGATATA ACCCTAGACG ATGGCGATGA CGACGACGAC
TTATTCGCCG CCGCGGTGGA TATGGCTGAA AAAGCGCGCG ACAGTCGCGT TGAGTCCGGA
ACGCCGTCCA CGGCGACTTC CCCTAGGAAA TCGCCTAGGA AATCCGCCGC GGTCGCGGCT
TGCGCGCGCA CGCAATCGGC GACTGCGCGA GTTAAAATTG AAGATGGCTT TATGGAAGAT
TGCAAGGCGG AGTTGCAGAA AGCGTTGGAA TTCGCGAGAG AAGAAGCGGA GGCAGAGACT
GAAATTCTAG GCACCTGGTT CGTCGAAAGC GTCGCGTCGC CGCGGATTGG CTGCGTCGAA
TCCCTCGTCC TGACGAAGGC TAGCGGTGCA AAGAAGACTG TCGTGCTGAA GGGTTATTGG
CGAGAAACCC TGGTTTCAGT GGGAGATTCT CTGATCTTGT ACTGTTTGCC CGGCGTGGCG
CGCAAAGAAG ACGTCGCGAA GGATATTGTC GAAGTCACTG ACGATGGAGG GGTGATGATT
ATTCTTTTCC CGGCGTACTT GATAAGTGCG ACGACGATAG GGAATGGTAT GGACTGCCCT
CGACAATCAG TGTTGCAGCA ACAACTGCAG ACGACGTGGG GTGACGCGAA CGAGGCGGCG
ACGATCGGTA CTATTATGCA CGATCTCGTT GAGCACGCGC TCCTCGTCGG ATCGAGCCGA
GCACTCGTGC CGATGGCTTC CAAGGTGAAG AGCGTGATCG AAAGCAACGT GGACAATATA
TTCGCCATCG ATTCAACCGA AAAGAAGCTG CAGCAAAGAA TAGATGCGAC CATTCCAGGT
GTGGAATCAT GGGCTAACAA GCTCACGGCG GCGTCTCGTG TGCCAAAGCT TGGTCGCGCG
AAGCCCGGCG GCAACGCCCA GGTGCGAATG AACAGAAAAG CTGCGCACGA TAAGCTTCGA
AGAAAGTGCA CTGTGGGCGT CGAGGCAAGC TTGGATTACA ATACGTCTGC AACGTTGCAA
GTTGACGACA TGATTGACAT AGAAGAACTC ATTTGGGCTC CAAAACTTGG ATTAAAAGGC
ATCTTGGACG GCGTCGCAAA CGCCGTCATA CGTCAATCGA AGGAAGATTT ACCATTGCCG
AGCGTCGTTC CGATCGAGTT GAAGACGGGT AAGTGGAAGT CTGTCGGTCA CGACGCACAA
GTCTTGTTTT ATAATCTTAT GATCGGCGAA CGCTACGGCA AAGTGTCTCC TTTCGGCGTA
TTGCACTACA CCACGGACGA TAATGACGAC GAAGGCACGA GTAAAATTTT CACGAATAAG
CCGGCTAACA TCAGCGCTCT GATGCAGCGT CGGAATCACC TTGCGGCAAT GTTAAGACCG
ACGCCCGAAG AAGCGATGCA ACACAACGCT CCAGTCCCGC ACGGCAAGGC CCGTCTCGGG
AACGGTAAGC TGCCGAAGAT GCAGCCACAA AGCTGGTGTG AGCGTTGCTT CTCACAGCAC
GAGTGTTTCA GCTTGCATCG CGCGCTCGAG GGAGGAGACG GCGAAACAAG CGAGCTCGGC
AATTTGTTCA TCGGCGCCAC CTCGCATCTG AACCAGACTC ACGAAACAGC AATCCGTCAC
TGGATTCATC TGATTGATTT GGAATCTTCC GAGTCTTTGC GCAAGCGTGC AACTCCGTGG
CTTCCGGTTG AACTCGTCAA TCGTCGCGCG AGCGACACGT ACGCGATCGA CGACCTCGCT
TTTGTTCGTG AAAGCACGCA AGAGAAGTCT GCATTCGTCG GTGATAATCA CTATTACATA
TTCAAACCCT CAAGTTCTGT CGCAGAAAGC GTTCTGAAGA GAGTTGGGAT CGCCGATCGC
GTCGTCTTGA GTCGCGACAA AGGCTTGACG
 
Protein sequence
MSGKKRARGG RRVTVDAEVE TSQRERVTYG APSGTTTTTT TATATTTTEA NGSARHGDSA 
AMERLFALGA GTVAGGGEND ADDGLGVWLS ESEFGGGGGG ARGKSNAVGA NGERGRLSVP
VAPAWEESSA EKSPGGRRRR ASRDGEERRR SEDLFEELLT NVERGAPARD SAPATMEVAT
TTTASAVNTR KEGDHTLALA SMDPDWNDED EDSQALIDAV TLAEKRAKMA KSASATTTQA
SNAVPSLSDI TLDDGDDDDD LFAAAVDMAE KARDSRVESG TPSTATSPRK SPRKSAAVAA
CARTQSATAR VKIEDGFMED CKAELQKALE FAREEAEAET EILGTWFVES VASPRIGCVE
SLVLTKASGA KKTVVLKGYW RETLVSVGDS LILYCLPGVA RKEDVAKDIV EVTDDGGVMI
ILFPAYLISA TTIGNGMDCP RQSVLQQQLQ TTWGDANEAA TIGTIMHDLV EHALLVGSSR
ALVPMASKVK SVIESNVDNI FAIDSTEKKL QQRIDATIPG VESWANKLTA ASRVPKLGRA
KPGGNAQVRM NRKAAHDKLR RKCTVGVEAS LDYNTSATLQ VDDMIDIEEL IWAPKLGLKG
ILDGVANAVI RQSKEDLPLP SVVPIELKTG KWKSVGHDAQ VLFYNLMIGE RYGKVSPFGV
LHYTTDDNDD EGTSKIFTNK PANISALMQR RNHLAAMLRP TPEEAMQHNA PVPHGKARLG
NGKLPKMQPQ SWCERCFSQH ECFSLHRALE GGDGETSELG NLFIGATSHL NQTHETAIRH
WIHLIDLESS ESLRKRATPW LPVELVNRRA SDTYAIDDLA FVRESTQEKS AFVGDNHYYI
FKPSSSVAES VLKRVGIADR VVLSRDKGLT