Gene OSTLU_49124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_49124 
Symbol 
ID5000736 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp796419 
End bp797930 
Gene Length1512 bp 
Protein Length444 aa 
Translation table 
GC content61% 
IMG OID640416157 
Productpredicted protein 
Protein accessionXP_001417060 
Protein GI145345099 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.469563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.18765 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGCG ACTTTTTCCT GCCGTCGATC GGCGGCGTCG AGCTCCACAT CTACGCCCTC 
GCCGCGAGAT TGCGCGCGCG CGGACACAAG GTGGTGGTGT ACACGCACGC GCATCCCGGA
CGCGTGGGCG TGCGGTGGAT CACGCGCGGG ATCAAGGTGT ATCACGTGCC GAGGGTGGTC
ATGTACGACA ACTGTACGTT TCCAAATTTT CTCGGCGGGT TCAAGTTATT TAGAAAGGTG
CGACGACGAC GACGACGACG ACGACGACGG CGACGGCGAC GCGCGCGCGC GAAGTTTCTT
TCGTTTCGTT CGCTCGTTCG TTTCGTTCGC GTCGACTGAC GCGCGAATTC TGATTCGAAT
TAGACGTGCG TTCGCGAAGG CGTGACGCTG GTGCACGCGC ACCAAGGGTG CACGATGTCG
CACGAGGGCA TACTGTACGC GCGAACGATG GGAATGAAGT GCGTGTTCAC CGATCACTCG
TTGTTTGGTT TCGCGGACGT CGGGGCGATT CACACGAACA AGTTATTGGA CATGACGTTG
GCGGACACGC AGCACGCGAT TTGCGTCAGT CACACGGCGA AGGAGAATAC GGTTTTGCGG
AGCGGGTACT TGCTCGGGGG CGAGCCAGGT TTGGCGCCGG AGCGCGTGAG CGTCATCCCG
AACGCCGTGG ACTCCGTGAG ATTCACGCCG GATGTGACGA AGCGGAAGAA AGGACGCAGG
ACGGTGGTGG TGACGTCGAG ATTGATGTAT CGCAAGGGCG TGCATCTGTT GGCGGGGGTG
ATTCCGCTCG CGTGCGCCGA GCACGACGAT TTAGATTTCC TAATCGCGGG CGATGGATCG
ATGCGGAAGC ATTTGGAGAA GGCGATCGAA GATGCGGGTT TGACCGAGCG CGTCACCATT
CTCGGGAGCG TGTCGCACGA CAAGGTCCCA GAAGTGCTTC GTCGAGGCGA CGTGTTCCTC
AACGCCTCGC TCACGGAGTC GTTTTGCATC GCCGTCCTCG AAGCGGCGTC GTGCGGATGT
CTCGTCGTCG CCACCGCGGT GGGAGGCGTT CCAGAGGTAT TACCAGAAGA TATCATGTTC
TTAGCGAAGC CGGACGTGCA GTCCATCCTG GACGCTCTCG ACGAGTGTCT CGAAGCGCTC
CCGCGCGCCG ATCCGTGGCG GATTCACGAG CGCGTCGAGG CGTTGTACAA CTGGGACGAC
GTCGCCCATC GCGTCGAGCT CGCGTACGAC CGCGCGTACG ACACGTGGGA CACGTTCATG
GGGCGTCTTT ACAGGCTGTA CCGCCGCGGC GTCGTGTTCG GAAAGATGTT GTGGTGCGTC
GCGGCGGTGA CGTACCTGTG GTGGCGCGCT CTCGAGTTTT TCGAACCCGC GGCGAGCATC
GAGCCCGCGC TCGCGCTCGA CGACGAGCGC TTCGACGTCG AGCGCTTCGA CGACGAGCGC
GCGCTCGCGC GCGAGGAGTA ACGAACGATT CATCCTCCCT CTCCTAGATC TAGCCAGTCC
ATCTATCCAT CC
 
Protein sequence
MLSDFFLPSI GGVELHIYAL AARLRARGHK VVVYTHAHPG RVGVRWITRG IKVYHVPRVV 
MYDNCTFPNF LGGFKLFRKT CVREGVTLVH AHQGCTMSHE GILYARTMGM KCVFTDHSLF
GFADVGAIHT NKLLDMTLAD TQHAICVSHT AKENTVLRSG YLLGGEPGLA PERVSVIPNA
VDSVRFTPDV TKRKKGRRTV VVTSRLMYRK GVHLLAGVIP LACAEHDDLD FLIAGDGSMR
KHLEKAIEDA GLTERVTILG SVSHDKVPEV LRRGDVFLNA SLTESFCIAV LEAASCGCLV
VATAVGGVPE VLPEDIMFLA KPDVQSILDA LDECLEALPR ADPWRIHERV EALYNWDDVA
HRVELAYDRA YDTWDTFMGR LYRLYRRGVV FGKMLWCVAA VTYLWWRALE FFEPAASIEP
ALALDDERFD VERFDDERAL AREE