Gene OSTLU_18863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18863 
Symbol 
ID5006431 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009373 
Strand
Start bp49834 
End bp50920 
Gene Length1087 bp 
Protein Length303 aa 
Translation table 
GC content64% 
IMG OID640421852 
Productpredicted protein 
Protein accessionXP_001422423 
Protein GI145356407 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value0.00172466 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0000000976033 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCGACG ACGATCTCGC GCGCGGCGAC GCGCTCTGCG CCGCGGCGGC CAAGAAACTC 
AGGGTGCGAC GCGCGCGCGC GAGCGCGCGA TCGGACGCGC TCGAGCGGCG CGAACGCGCG
CGCTCGACGC GAACGATCGA CGCGAATGAA TGAATGAATG AATGCGAACG AACGAACGAA
CGAACGAAGG AACGAAGGAC GACTGACGAC GGCGCGACCG ACTGAACGAC GACGACAGAG
CGTTGGATTC TTCAGCGCGA TCACGGGCGC GAACAGGCAC GAGGAGGCGG CGGAGCTGTA
CGAGCGCGCG GCGACGTCGT TTAAGCTGGC GAAGAGCTGG CGAAGAGCGG CGGACGCGTA
CGAGGCGCTG GCGACGTGCA GAGCGACGAC GAAGGAGACG CACGACGCGG CGTCGGCGCA
CGTGGACTGC GCGCAGATGC TGAAGAAGTG CGGGGCGAAC GAGGAGGCGA TCGGACACTA
CAGGGAAGCG TCGAACGCGT ACGCGCGATT GGGACGACTG GCGCAGGCGG CGAAACATTT
GAAGGAGATC GGCGAGACGT ACGAGAGCCT GGGGACGGCG GAGGGGGATG AACGCGCGGT
GGAGGCGTTC TCGAGCGCGG CGGATCTGTA CGACGGCGAG GGCGACTCGG GACGGACGAC
GGGGAATAAT TGTAAACTGA AGGCGGCGAC GCTGCTGGCG AGCAAGCTCG ATCGATTCGA
AGAGGCGACG GAGATTTTTG AGGACGTCGG ACGCGCGTCG TTGAATAACA ATTTACTGCG
GTTCTCGGTG AAGGGGTACT TTTTACAGGC GGGGATCTGT CGATTGTGCT GGAACGACGC
CGTCGGGGTG CTGAACGCGT GCGAGCGATA CGAGGAGAGC GACCCGGCGT TCGCGTCGTC
GCGCGAGCGC GATTTGTTGG TGAATTGCGC CAAGGCGTTC GAGGCGGGCG ATCAAGACGC
GTTTTCGAGC GCGGTGGCGG AATTCGACTC CATGTCCAGG CTCGACGGTT GGAAGACGAC
GATGCTATTA AAGGCTAAGA AGCGCATCGT CGCCGCCGTG GAAGCCGAGG AGGACGATCT
CACGTGA
 
Protein sequence
MGDDDLARGD ALCAAAAKKL RSVGFFSAIT GANRHEEAAE LYERAATSFK LAKSWRRAAD 
AYEALATCRA TTKETHDAAS AHVDCAQMLK KCGANEEAIG HYREASNAYA RLGRLAQAAK
HLKEIGETYE SLGTAEGDER AVEAFSSAAD LYDGEGDSGR TTGNNCKLKA ATLLASKLDR
FEEATEIFED VGRASLNNNL LRFSVKGYFL QAGICRLCWN DAVGVLNACE RYEESDPAFA
SSRERDLLVN CAKAFEAGDQ DAFSSAVAEF DSMSRLDGWK TTMLLKAKKR IVAAVEAEED
DLT