Gene OSTLU_19906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_19906 
Symbol 
ID5006723 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp146764 
End bp148631 
Gene Length1868 bp 
Protein Length559 aa 
Translation table 
GC content61% 
IMG OID640422144 
Productpredicted protein 
Protein accessionXP_001422502 
Protein GI145356572 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.722622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.566407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGCGA CCGATGGGAC GAGCGCGCGC GAGGCGCGTC GAAAGGCGCG CGAGTTGGAG 
GAGGGTCGAA AGGTGCGCGC GCGCTCGCGC GCCGAGCGCG TCGACGCGTC GCGCGAGCGA
CGCGCGGACG ACGCGCGAGG ACGATTTCGT CGCGCGCCGC CCGCCATTCC TCGCGACATT
CGTTTCGTCG CGCCTCGACG CGACGCGACG CGCGCGACGG CGGTGGACTG ACGGCGACGA
CGCGTTCGCG CGCGCTCCAG GCTGGGTTGA TCCCGCACGA GATCGATGAG GACGGGAACG
CGATCAACCC GCACATTCCT CAATTCATGG CGGCGGCGCC GTGGTACCTG AAGCAAGACG
GGCCGGGATT GAAACATCAA AAGGCGCCGA AAAAAGCCGA AGAGAGCGCG GAGTGGTACA
AGAGAGGCGT GACGACGACG AAGGCGACGA AATTTCGTAA GGGGGCGTGC GAGAATTGCG
GAGCGATGAC GCACAAGAAG AAGGATTGCA TGGAGCGGCC GCGCGCGAGA GGCGCGAGCA
AGACGCAAAA GGACATCGCG GCGGACGAGT ACGTGCAACC GGAGTTGAAG CTTGGGTTTG
AGAGTAAGCG CGATCGGTAT AACGGATTCG ATGTGGATGA TTACGTCAAG GTGGTGGAGC
GATACGAGGC GGCGGACGCG ATGAAGCAAA AATTGGCCAA GCAAAAAGAG TTGGAACGCG
CGTTTCGGCG GGCGAATAAA AAGGAGGACG ACGCGGCGAG CGACTCGGAT TCGGACGATA
CGAGCTCCGA CGACGACGAC GACGACGACG CGAAGGTTGC GGATAAGGCG GCGACGGGGT
TTGCAAACAT CAAACGCGCG GTGCGCGCGC CCGGAGGGGG CGCTTCCGGC ACGGTGCGTA
ACTTGCGTCT TCGCGAAGAC ACGGCGAAAT ATTTGCGCAA CCTGGATGTG GATTCGGCGT
ACTACGACCC AAAGACGCGC TCGATGCGCG AGAATCCGAC GCCGAACGCC GATCCCAAAG
ACAACTTCTT CCGCGGTGAT AACGCGGCGC GAAATGACGG GCAAGTGGTG GAGTTTGAGC
GTTTGAATCG TCACGCATGG GAGCAGGCGG AAGCCGGCGG CGCGAGCGCC ATTCACATGC
AAGGCGCGCC GTCGCAAGCC GAGGCGCTGT ACAAGCAATT CAAAGAAAAG AAGGAAAAGC
TCGCGGGAAT GAATAAAAAG AACATCATGG AAAAGTACGG CGACGCGAGC GCGGGCAAAG
AGCTTCCCGA CGGTTTGGCG CTCGGTCAAA CGGAGCAATA CGTCGAGTAC GACCGCGCGG
GCCGTCTCAT CAAGGGAACC GAAAAAGCCA CGGTGAAGAG TTGTTACGAG GAGGATGTCC
TTTTGCAAAA TCACACCAAG GTTTGGGGCT CGTACTGGAA CGCCGGTCAG TGGGGTTACG
CGTGTTGTCA AAGCATGGTG AAGAACTCGT ATTGCACGGG CGAGCGCGGC GTCGAAGCCG
CGCTCGCGAG CGAGCAACTC ATGGTGGACA ACATGGAGAA CAAGCGCGCG ATGGACGAGG
CGAACGAAGC GCGAGCGAAG TCGCAGCTCA ACGCGACGAC GAAACCGAGC GATCTGTGGG
GTGGTGATGT CAAGGATGAC GTCGAGATCG ATCCTCAAAA GCTCCTCGAA GCCTTGAAGC
GGCAAGACGA ACGCGAGGAA GCGCTCAAGC GCGGCGGCGA CGGGAAGAAC AAGCGCGGGT
ACAACGTCAC GCACGATTCG CAAGTCACGG CGGAAGACAT GGAGGCGTAT AGGATGAAAA
AGCGCGCATT CGAGGACCCG ATGAAAAAAG CGTCGGGCGC GGGGACCGAT GGTTACGATC
TAGTGTAG
 
Protein sequence
MGATDGTSAR EARRKARELE EGRKAGLIPH EIDEDGNAIN PHIPQFMAAA PWYLKQDGPG 
LKHQKAPKKA EESAEWYKRG VTTTKATKFR KGACENCGAM THKKKDCMER PRARGASKTQ
KDIAADEYVQ PELKLGFESK RDRYNGFDVD DYVKVVERYE AADAMKQKLA KQKELERAFR
RANKKEDDAA SDSDSDDTSS DDDDDDDAKV ADKAATGFAN IKRAVRAPGG GASGTVRNLR
LREDTAKYLR NLDVDSAYYD PKTRSMRENP TPNADPKDNF FRGDNAARND GQVVEFERLN
RHAWEQAEAG GASAIHMQGA PSQAEALYKQ FKEKKEKLAG MNKKNIMEKY GDASAGKELP
DGLALGQTEQ YVEYDRAGRL IKGTEKATVK SCYEEDVLLQ NHTKVWGSYW NAGQWGYACC
QSMVKNSYCT GERGVEAALA SEQLMVDNME NKRAMDEANE ARAKSQLNAT TKPSDLWGGD
VKDDVEIDPQ KLLEALKRQD EREEALKRGG DGKNKRGYNV THDSQVTAED MEAYRMKKRA
FEDPMKKASG AGTDGYDLV