Gene OSTLU_36075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_36075 
Symbol 
ID5000144 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp318845 
End bp320212 
Gene Length1368 bp 
Protein Length447 aa 
Translation table 
GC content53% 
IMG OID640415565 
Productpredicted protein 
Protein accessionXP_001416368 
Protein GI145343519 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00709127 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAGG TAGAAGTCGC GGTGAGCGCG TGCACGGTGA AGGAGGCCAA ATGCTCGACG 
TGTAAAACTG GTTACAGTGA CGTCGTGCTT TTACAAGCAT TCGATTGGTT GTCGACGAAG
CGTTCGTTAC ACAGCGACAA GTCTTACTAT CGCAGAATTG AGGATCGAGT GCCGATGATA
GCGGATTTCG GGTTTACGCA CGTTTGGTTG CCTCCACCGT CGTTATCGGT AGACGAGCAC
GGTTACATGC CATCGGAAAT TTACAACTTG GACGGTAGCG AGTACGGCGA TGAAGCGGAG
CTTAAATCGT TGGTACAGGC TCTGAAAAAA GCCGGAATAG TAGCCGTGTG CGACATCGTC
ATCAACCATC GTTGCGCTGA GTACGCTTCG GATGGCCGCT TCATCTCGTT TGCGGACGAA
GTAACGCCGA GCGGGAGACG AATAAATTGG GGAGCTTACG CCATCGTCGG CGACGATCCA
TTTTTTCGCG AAGGTCAAGG AGCCAACGAT AGTGGCGACT CGATCGAAAT CGCCCCTGAT
CTCGACCACA CAAACGCCGA GATTCGCGAA GCGATCATCG AGTGGTTGAA CTGGTTGAAA
GATGACATCG GTTTCAGCGG ATGGAGGTTC GATTTCGTCC AAGGCTACGC TCCGAATTTC
GTGAGAGAGT ATGTGGAGAA AACGGTTGGA TTTGAGCAAT TCTGCGTCGG CGAGAACTGG
GTCGGGATGA CGTGGTCGGG AAGCTTTCTC GAGTACAATC AAGACAAGCC GAGACGCGTG
CTCGTGGATT GGTTGAACGC CGCAGACGAA TGCGCGGCGT TGTTCGACTT CGTGACCAAG
GGAATTCTAC AAGAAGCAGT CAAGCGAGTA GAGTTTTGGC GGCTACGAGA CCAGCAAGGC
GGCATGCCTG GGCTTGCCGG CTGGGTACCG CAAAGTGCTG TGACATTTCT CGACAACCAC
GATACCGGAT ACCCGCAGAA TCACTGGCCG TTTCCACTCG ATCGTCTCGG TTTGGGTTAC
GCGTACACGC TTCTGCATCC CGGCATTCCC TGCGTGTTTG GCCCGCACAT TTGGTGCTGC
GACGAAAACT TGGGTTGGTC CTAATCGCTA ACATCAGAAA TTCGAGCTTT GTTGAGCTGC
CGTAAGCTCG CCAACGTGTG CTGCGAGAGC AGAGTTGACA TCAAAATCGC CGAGAGCGAT
TTATACGTCG CGGTCATCGA TGACAAAATC ATCGTCAAGC TAGGGCCGAG ATACGACGTT
CCGGGTGAAA TACTCGCTCA AATCGCAGAG TTCGAGCTCG CAACGCACGG CGACGATTAC
GCGGTGTGGA TTCGAAAAGA GTTACTCGAA CAACCGTTCG AGGAGTGA
 
Protein sequence
MDEVEVAVSA CTVKEAKCST CKTGYSDVVL LQAFDWLSTK RSLHSDKSYY RRIEDRVPMI 
ADFGFTHVWL PPPSLSVDEH GYMPSEIYNL DGSEYGDEAE LKSLVQALKK AGIVAVCDIV
INHRCAEYAS DGRFISFADE VTPSGRRINW GAYAIVGDDP FFREGQGAND SGDSIEIAPD
LDHTNAEIRE AIIEWLNWLK DDIGFSGWRF DFVQGYAPNF VREYVEKTVG FEQFCVGENW
VGMTWSGSFL EYNQDKPRRV LVDWLNAADE CAALFDFVTK GILQEAVKRV EFWRLRDQQG
GMPGLAGWVP QSAVTFLDNH DTGYPQNHWP FPLDRLGLGY AYTLLHPGIP CVFGPHIWCC
DENLEIRALL SCRKLANVCC ESRVDIKIAE SDLYVAVIDD KIIVKLGPRY DVPGEILAQI
AEFELATHGD DYAVWIRKEL LEQPFEE