Gene OSTLU_18701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18701 
Symbol 
ID5006184 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp203570 
End bp204939 
Gene Length1370 bp 
Protein Length376 aa 
Translation table 
GC content60% 
IMG OID640421605 
Productpredicted protein 
Protein accessionXP_001422230 
Protein GI145355999 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00776601 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGGCCG CCGGTGCGTG ACGACCTCGA GAACGAGGCG TGGGAATTTT TCGCGCCGCC 
CGCGGATCGC GCGGCGTCGC GCGCGGCGTC GCGCGACGCG CGACGCGCGC GAGTTATCCG
ATGACGTGGG CGAGTGCGGA ACCCGGAAAC GGCGCGCGTG GCGGCGCGGT GCGGCGCGCG
CGACGACGCG CGCGGGGCGG GAGGCGATGA TTTGGAGGGA AAATAATACT GACGATGTTT
CGCCGCGCGC AGGGTACGAA TTGCGCGGGA TTTACGGCAC GGAGGAGTAC TGGCCGGAGA
AGAAGCTGAA GATTTGCGTG ACGGGCGCGG GAGGTTTCAT CGGGTCGCAT CTCGCGAAAC
GATTGAAAGA GGAGGGACAT CACGTCGTGG CGTGCGATTG GAAGCGCAAT GAACACATGG
AAGAGGCGAT GTTCTGCGAT GAGTTCATCT TGGCTGATTT GAGGCTGTAC GAAAACTGTA
AAAAGGTTCT CGAGGGGTGC GACCACTGCT TCAACCTCGC GGCGGACATG GGAGGGATGG
GATTCATTCA GTCCAACCAC TCCGTCATCT TCTACAACAA CGTGATGATT TCCTTCAATA
TGATGGAAGC GATGCGGGTG CAGGGCGTGA CGCGATGCTT TTACGCGTCG AGCGCGTGCA
TCTACCCGGA GGGCACGCAG TTGAGCACGG AGATGCAAGA CGGGTTGAAG GAAGCGAGCG
CGTGGCCGGC GCAGCCGCAA GACGCGTATG GTCTCGAAAA GCTCGCGAGC GAGGAAGTGT
ACAAGCACTA CCAGCAAGAT TTTGGTATTC AGACGCGCAT CGGTCGATTC CACAACATTT
ACGGTCCGTA CGGCACGTGG AAGGGCGGTC GCGAAAAGGC GCCGGCGGCG TTCTGCCGTA
AGGCTGCGAC GGCTGAAAGC GAAGTCGAAA TGTGGGGTGA CGGTAAGCAA ACGCGCTCTT
TCACCTACAT CGACGATTGC GTCGAGGGCA TCTTGCGTCT CACCAAGAGC GACTTCGCCG
AGCCGGTGAA CATCGGTTCC GACGAAATGA TCTCCATGAA CGATATGCAA GCCATGACGT
TGAAGTTCGC GGGCAAGGAC TTGCCAATCA AGCATATTCC GGGTCCGGAA GGTGTGCGCG
GTCGCAACTC CAACAACGAA CTCATCAAGG AAAAGCTCGG TTGGGCGCCG TCTGTCAAGC
TCGCGGACGG CTTGAAGGTT ACGTTTGAGT GGATCTCGAG CAAGATTGCC GAAGAGAAGG
CCAAGGGTGT TGACACCGCC GCCGCTTTCG GTAAGTCCAC CATCTGTGGC ACGCAAGCGC
CGACCGAACT CGGTCAGTTG CGCGCTGCGG ACGGCGACGA AAAGCTGTAA
 
Protein sequence
MAAAGYELRG IYGTEEYWPE KKLKICVTGA GGFIGSHLAK RLKEEGHHVV ACDWKRNEHM 
EEAMFCDEFI LADLRLYENC KKVLEGCDHC FNLAADMGGM GFIQSNHSVI FYNNVMISFN
MMEAMRVQGV TRCFYASSAC IYPEGTQLST EMQDGLKEAS AWPAQPQDAY GLEKLASEEV
YKHYQQDFGI QTRIGRFHNI YGPYGTWKGG REKAPAAFCR KAATAESEVE MWGDGKQTRS
FTYIDDCVEG ILRLTKSDFA EPVNIGSDEM ISMNDMQAMT LKFAGKDLPI KHIPGPEGVR
GRNSNNELIK EKLGWAPSVK LADGLKVTFE WISSKIAEEK AKGVDTAAAF GKSTICGTQA
PTELGQLRAA DGDEKL