Gene OSTLU_33567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33567 
Symbol 
ID5003808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp428426 
End bp429616 
Gene Length1191 bp 
Protein Length396 aa 
Translation table 
GC content57% 
IMG OID640419229 
Productpredicted protein 
Protein accessionXP_001419594 
Protein GI145350398 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.416798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0211717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCCG AGGTGGGGTT GATGGCGGCG TACACGCTGT ACGCGATGAT GACGCTCGAT 
GGGAATTGGG TGCTGTTGTT GGTTTTCACG CTGCTCTCGG CGCTCGTGGC GTACACGTGG
AGGGACGAGT TGAACTTGGT GGCGTCGATG ATTTCTGTGT CGACGATTAG CTTGTCGGAT
AACCCGCACC TGGTGACGGT GACGATCGGT TTGCAGTGTC TCGTGATGGC GTTCGTGGCG
CCGATGGCGT GGTTCGCCGT CCAGGCGTCG CAGCACGGCT CGGCGATTAT CAACCAATAC
GCCACCGAAT TCAGTAACGA CGCGTGCACT GGGTATTACG GTCAATCCGT CGATTGTTGC
AAGTGGAACA TCGACAGTTG GGTGGGGCCT TATTACGCGC TCGTCGTCAT AGCGTGCGTG
TGGTTCACGT CGTGCGCGCT CGAAGCGCGC ATGTACGTCA TCGGAGGCGT CGTCTCGCAG
TGGTACTTTG CCCCGGCCGG GACAAAGAGT TTCAAGGGCA CGACGAGAAC GTCCGTGAGT
AACGCGTACG GACCGTCGTT TGGGACGATT GCGTACGGCG GCTTCGTGAT CACCGTCGTC
GAAATAATTC GAAGCATGGC GAACAAGTCT CGCCGGGAAC GCAACAATTA CGGCAACCCG
CTTTGTTGCC TCTTTTACGC GATGCTGGAC TGCATCTTTG CCGTTATCGA GTACCTCAGT
CGATTTGCCA TGATTCAGGC TTCGATCACC GGCGAAGCGT TTTGCGATGC CGCGAGGAGC
ATCAACGATC TCCTCAAAAG AAACTTTCTC TTGGCGTACG GCGCGTACGC CTTTCCGAAA
CATATTTTGG GCTTCCTCGT CTTCGTCTTG GCCGCCCTTC TCGGCTACTG CGTCAACATT
TTGAGCAAGC ACGTCTTCGC CGCGAACTCC CTCGGCGCGA TCGTCAACGG AATCGGCTCC
TTCTTCATCG CTTACATCGT CCTCAGCTTC TTCGTCATGA TTTTGCTCAA CTGCGTCGAC
GCCGTCTTCG TCTGTTACGC CTTGGACAAG GATCGCGCCG CGGTGCACCA TCCAGACTTG
CACAAAGTCT TCGACGAGGT CACGCGCAAG CAGCGCGCGA TCGAAGAGTC CGATGCGGAG
GGTATGGAAG AGCCATTGAT CTCGGGCAAG CCCAAGTACG CGTCCATGTA G
 
Protein sequence
MIAEVGLMAA YTLYAMMTLD GNWVLLLVFT LLSALVAYTW RDELNLVASM ISVSTISLSD 
NPHLVTVTIG LQCLVMAFVA PMAWFAVQAS QHGSAIINQY ATEFSNDACT GYYGQSVDCC
KWNIDSWVGP YYALVVIACV WFTSCALEAR MYVIGGVVSQ WYFAPAGTKS FKGTTRTSVS
NAYGPSFGTI AYGGFVITVV EIIRSMANKS RRERNNYGNP LCCLFYAMLD CIFAVIEYLS
RFAMIQASIT GEAFCDAARS INDLLKRNFL LAYGAYAFPK HILGFLVFVL AALLGYCVNI
LSKHVFAANS LGAIVNGIGS FFIAYIVLSF FVMILLNCVD AVFVCYALDK DRAAVHHPDL
HKVFDEVTRK QRAIEESDAE GMEEPLISGK PKYASM