Gene OSTLU_16330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16330 
Symbol 
ID5003387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp44085 
End bp45305 
Gene Length1221 bp 
Protein Length406 aa 
Translation table 
GC content64% 
IMG OID640418808 
Productpredicted protein 
Protein accessionXP_001419053 
Protein GI145349256 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0411346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAC TGGAGACGGC GAAGACGGCG CTGCGCGGCG CGAGCGGGCC GGACGACGGC 
GATCGCGCGA TCGAACTCGC GGGCGAGGCG CTGGAAAAGT TTATGGAGAT CGCGAACGAC
GCGGACGCGG ACGAGGCGAG CGTGCGACGG GAGACGGCGC GCGCGCACTT TGTGTACGGA
GAGGCGCTGT TTCGAGGCGC GCAGGCGCAA AACACGGTGT TCGGGGAACA GGTGCGGGCG
AACGCGGAGG CGAGCGGGAC GAGGCTGGAG GACGCGCCCG AGGACGAAGA CGTGGGCGAG
GAAGACGAGG AAGAAGGAGC GGGCGCGACG GAGGGCGAGA AGAACGGTAA GGAAGCCGCG
GATGACGAGG ACGAGGGCGA GGAGGATGAG GAAGAGTCTG ATATGGAGTT GGCGTGGAAG
ATGTTGGAGA CGGCGCGCGT GATGTTTGAA GAGGACGCGA ACGCGGCGTT GGAATTGGCG
GATGTTTTAG AGACGATCGG GGAGTTGAAC ATGGAACAGT CGCAATTTGA TACGGCGTTG
TCGGATTACA AGTCGGCGTT GAAGCTCTTG GAAGAAAACT TGGAAGCGAC GGATAGGCGT
TTGGCGAGCG CGCTGTATTC GATTTCCATC GCTAATCAAA TGATGGAGGC GAATGACGAC
GCGCTCGCGG CGAACACGCG CGCGATCGAA ATCTGTGACG CGCGAATCGC GGAGCTCAAA
GCTGGGACGG CGCGCGTGAG CAAGGGTGCG CGCGAGAACG CGGATGAAGT CGTCTCGCCC
GAGGCTGCCA TCGCCGAGTT GGAGCAAATA ATGGGCGTGG CGTCCGATTT GAAGGAGCGC
CAACTCGAGC TGAAAGAGCT CGTCAGTGCG GACAACTCCA CGCGCGAAGC TCTCCGACAG
GCGTTCAAGG CGATCGGCGG TGCAGCGCCC CCGGGTGCGT CGGAGCCGGA GGAGAGCGCC
GGTTTCGCCG CTCCGACGCT TACGTCGAGC GTTCCCGTGC AAGCGGCGCC TGTTCGCCGC
GTATTACCCG CGCCTGTTCG CCGCGTCGAG GTCGCACCGC TTCAAGAAGC GCCGGCGAAG
CGCGTGGAGC CGCAGCAAAC GTCCGCGCCC GCTGCCGCTC CAGAGGCGAA GAAGATGAAA
CCGACGCCCG TGGACAAAGC CGCGTTGATT GGTGCGACTG CTCCCAAGGA CGCCGAACCA
AACGGATGCC CGCAGCAGTA G
 
Protein sequence
MAKLETAKTA LRGASGPDDG DRAIELAGEA LEKFMEIAND ADADEASVRR ETARAHFVYG 
EALFRGAQAQ NTVFGEQVRA NAEASGTRLE DAPEDEDVGE EDEEEGAGAT EGEKNGKEAA
DDEDEGEEDE EESDMELAWK MLETARVMFE EDANAALELA DVLETIGELN MEQSQFDTAL
SDYKSALKLL EENLEATDRR LASALYSISI ANQMMEANDD ALAANTRAIE ICDARIAELK
AGTARVSKGA RENADEVVSP EAAIAELEQI MGVASDLKER QLELKELVSA DNSTREALRQ
AFKAIGGAAP PGASEPEESA GFAAPTLTSS VPVQAAPVRR VLPAPVRRVE VAPLQEAPAK
RVEPQQTSAP AAAPEAKKMK PTPVDKAALI GATAPKDAEP NGCPQQ