Gene OSTLU_29635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29635 
Symbol 
ID5006842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp11597 
End bp12730 
Gene Length1134 bp 
Protein Length377 aa 
Translation table 
GC content58% 
IMG OID640422263 
Productpredicted protein 
Protein accessionXP_001422877 
Protein GI145357339 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones64 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGACA TGTATCACAG ACATCGAAGG CTTAATGACG ATGTGGTGAT GTCGTTCGAT 
GCAGACTCGA TTCTGCAATC CAGTGCGACG GACACGAACG CGATGAGCGA CTTCGACCGC
GCGGCGCAAT GCGCTTCGAC GGAGGATGCG CTTCGGACGC TTGGCATGTT AGCTATGCCA
AGCGAGTTTC GTCAATCGCG GCTCGAACGA CAGCGCGAGG CCGCGAAGAG GAAAGAAAGC
GTCGCACGTG TGTCGAGTGG GCCCAAAACG TCTTCCGCGG TTGTCGACAT TGATCCGTTT
GCTGAATACT TGCAGCACGA TCGTGGGTTT GCGCCGGCGC GCGTTTCGTC AAAGCCCGCC
GGCGTAAGTA GTCCACCGCC GTCGATGAGG ACGGCGAAGC CCGCGCGAAG ACCAACTGGA
GGTTCTTACA CGAATGCTCG GAAGCACGGT GCTTCGACGT CGGTGTCGAG GCTCGCACGT
GTTCCATCGT CCGAGTCCGA TGACGACATT GTGCCGGCGC GCAGGCGCCG AGACTTCGAC
GTTGGTTTCA GGCGCACGGC GAGCGGAAAG TTTACATTTG GCGGAGAGGA TCGAACGCAG
ACGCGCGTCG CATTTGAGCC GCAAGTCCAC GTAGCGCAAA GTCGGAATCA CGAGGACGAC
TTTCGCGCTG ACATCGAGCG TTGGCCACAA CTACAGCCGG AGACAAACGA CAACGCAGAA
GATGAGCAAA TCGCCGAGGC GATACGCTTA TCGAAGCTCG AGTTTAAAAA GCAATCGCGC
GAGCACAACT CGGCGCGGGC CATGCACATC GAGTGCGATG AGCTCTTCGG CGACATGACG
GAGGAAGAAA TCGTCGCCCT CGTCGTTCGC ATGTCGCAGG AAGAGACGAC GAGCGAGGCT
GCTTTGCCAA TGCCGACAAA GACGGAGTGG GTGAACGCGC GTCTCGGCGA AATATTTTTC
CCCGACATCG AGCGCGCGAG ACAGGTTGCG GAAATATTCA GAGCGACTGA AACCGACCGA
AATGCGACTA TCGATTTGCT CATTGGTGCC GGCGCAGCGG AGAGCGACGC TAAAGCGTTT
TGGGAATTGT TTGACGCCGT CACGCTGACG GATAACGATA GCCACGTAGA TTAG
 
Protein sequence
MYDMYHRHRR LNDDVVMSFD ADSILQSSAT DTNAMSDFDR AAQCASTEDA LRTLGMLAMP 
SEFRQSRLER QREAAKRKES VARVSSGPKT SSAVVDIDPF AEYLQHDRGF APARVSSKPA
GVSSPPPSMR TAKPARRPTG GSYTNARKHG ASTSVSRLAR VPSSESDDDI VPARRRRDFD
VGFRRTASGK FTFGGEDRTQ TRVAFEPQVH VAQSRNHEDD FRADIERWPQ LQPETNDNAE
DEQIAEAIRL SKLEFKKQSR EHNSARAMHI ECDELFGDMT EEEIVALVVR MSQEETTSEA
ALPMPTKTEW VNARLGEIFF PDIERARQVA EIFRATETDR NATIDLLIGA GAAESDAKAF
WELFDAVTLT DNDSHVD