Gene OSTLU_18368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18368 
Symbol 
ID5005730 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp283463 
End bp284785 
Gene Length1323 bp 
Protein Length419 aa 
Translation table 
GC content62% 
IMG OID640421151 
Productpredicted protein 
Protein accessionXP_001421765 
Protein GI145355010 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.370386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.858147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAAC GACGCACCGC GCTCGTGGTT TTGGGCGACT TTGGTCGTTC GCCTCGAATG 
CAATACCACG CGCTGTCGCT CGCGCGCGAC GCGGATCGCG CCGTCGACGT CGTGTGTTAC
TCGGGCACGC CTCCGATCGA TGCGCTGTCT CGCGAAGACG CGGTGACGAT GCGTTACGTC
GTGGGATGTC GGTGGCGGTG GTTGACGCGC GTCCCGCTCG CGCTCGCGCT CGGGACGCGC
GTCGCGGCGC AGTGCGCGCA CTTGTTTTGG ATCCTGATGA CGATGCAGCG GTGCGAAGAG
ATGCTGATAC AGAACCCGCC GTGCGTGCCG ACGTTTTTGG TGTGCGGAAT CGTGTGTCGC
GCGCGACGGA CGCGGTTGGT GGTGGACTGG CATAATTTCG CGTACACGCT GTTCGGGATG
AAGCGCGGCG ACGCGAGCGC GACGACGCGA ATGTTGAAAT GGTACGAACG GACGCAGGGA
AAGATGTGGG GAGACGCGCA CGTGTGCGTG ACGAAGGCGA TGGGAAACTT TTTGGAGAAA
GAATGGAAGA TTGAGGGCGC GCGCGTCGTG GAAGACCGCG CGGCGGAGCG ATTTCGAGAG
GCGGCGCGCG AGGCGACGAC GCCGTTGGAA TTTTGGAGAA GCGAACCCGC GCGCTCGGCG
CTGGAGGCTT CGCCCGTCGC GCGGAGTGAG GACGCGCTCG ATCGGTTTTT GCGGGGCACG
CACGAGAATA TGACGAAGAA TAAGCCGAGG TTCATCGTGA GTTCGACGTC GTGGACGCCG
GATGAAGACT TTGGCGTTTT GCTTGACGCC GCCGTCGCGT ACGACGCGCG CAAGCGCGCG
AAGGGCGATC ATGCGTCAAA GTCGTACCCT GACATCGTCA TAATTATCAC CGGTCAAGGC
CCACGAAAGA CGATGTACGA GAAGAAGATT AACGAACTCG CGCTCGAGCA CGTGGCGTTT
CGAACCGTCT GGCTCGACGC CGCTGACTAT CCGCGCGCGC TCGCGAACGC GCACCTGGGC
GTCTCCCTGC ACACCTCGAG CAGCGGTTTA GATTTACCGA TGAAAATTGT GGATATGTTT
GGGGCATCGT TACCCGTCGC CGCGATGCGG TACGCTGTCA TCGGAGAGCT CGTGCAAGAG
GGCGTCAACG GCGTGCTCTT TGCCGACGCC ACCGAACTCG CGGCGATGTT CGCGAAACTT
CTCCGTGGCG ACGAACGCCT CACGCTCAGA GCGTTGAAAC ACGGCGCGGC GAAATGGGGA
GAGCAAACGT GGGACGATCA TTGGAAGCGC TGTGCGTTAC CTGTGTTCGC CGACGCGGCG
TGA
 
Protein sequence
MTKRRTALVV LGDFGRSPRM QYHALSLARD ADRAVDVVCY SGTPPIDALS REDAVTMRYV 
VGCRWRWLTR VPLALALGTR VAAQCAHLFW ILMTMQRCEE MLIQNPPCVP TFLVCGIVCR
ARRTRLVVDW HNFAYTLFGM KRGDASATTR MLKWYERTQG KMWGDAHVCV TKAMGNFLEK
EWKIEGARVV EDRAAERFRE AAREATTPED ALDRFLRGTH ENMTKNKPRF IVSSTSWTPD
EDFGVLLDAA VAYDARKRAK GDHASKSYPD IVIIITGQGP RKTMYEKKIN ELALEHVAFR
TVWLDAADYP RALANAHLGV SLHTSSSGLD LPMKIVDMFG ASLPVAAMRY AVIGELVQEG
VNGVLFADAT ELAAMFAKLL RGDERLTLRA LKHGAAKWGE QTWDDHWKRC ALPVFADAA