Gene OSTLU_19107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_19107 
Symbol 
ID5006812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp448339 
End bp449547 
Gene Length1209 bp 
Protein Length402 aa 
Translation table 
GC content57% 
IMG OID640422233 
Productpredicted protein 
Protein accessionXP_001422593 
Protein GI145356759 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.258179 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTTT GCGATAAGGC ATTCTCGTGC GCGTGGCTCG CGGATGATCT GGCGTTGATC 
GGGACGAAGG ATAATCGGTT GTTTGAGTTG GAGATCGATC GACGAGGGCG TCGAGACGCG
TGGCGCGAGA TCGAACTCGG AAGGGAGGAC GCGTACACGA GGAACCGGGC GATGGAGATC
GCCGCGCAAT GGGTGAGCCG ACACACGCCG CAGGCGTTAC GAGCGATCAA CGTTTCTCCG
CCGTTCTTGG CCGGTCCCTA TGGGCACGGC GCGGTTCGCG CCGCCGCTTC GAGACGGGGC
GGCGCCCACG CGTGTTCGGT GAGCTCGACT CGAGAGCACA TCGTGTGTAG CGGTGGTCCA
TCACACAACA TCGTTGGGTT CCGTCGAGAC GAGGAGACGG AATGTCTCGT GCCTAGAATG
GCATTTTCTG GGCACGACGA CGTCGTGTTC GACGTTGGCT TCATCGGACG AGACGCGATG
GCGTCGGCGT CGCGCGACTG CACGGTGAAA GTCTGGCAGC TTCCTAAGAG TCCTAGCTAC
GACGAGATAA GAATCACTCC AAGTGGTTCG GTGCACCCGA TCGGAGAGTG CACGCAGAAC
GAACGCGTGC GTGGCGTCAA GGTTGTTGAT CGATGTCCCG CTCGTCATCT TGCGACGTGC
ACGTCGAGCG GACACGTACT TCAGCTCGAC GCAGAGACGC TTTCGCTCGT ACACAGTGGA
TATCAGTGTC GAGGATATCT CGAAACTTGC TGCCTCGCCA CCGATGGCCA AATCGTCGCC
GTCGGTTCTC GCACGCACAT TGGTTTCGTG GACTTTCGAT CAAAAAACTT TTACGCCTCC
GTAGCGCTAC CATACGGCGA TACCAACAGC ACGCGCAGTC TTAGTTTCCA TGAGGGCGGC
AATCTACTCA CAATTGGCGG CGGTCGAGGA TTGATTTCGT TTTACGACGT TCGCATGCGA
AAATATCTCG TCGATAACGG TCGAGGTCGG GTGCGCCAGC TCTTCAACAA CCAATATTGC
GTCCCCTTTG CAGACAACGG GATCTTCGAG GATGAGCACG ATGACGACTT TTACGACATT
GAGATTCGCG ATTACTGTTT GCCAGCAATC TTTGCGCATC AGTGGGACCC GAGTGGAACA
CGTTTGCTCT GCGCCGGTGG GCCGCTTCAG TCGATGCTAC ACGGCTTCTT TGTGGGCGTG
TGGAGTTAG
 
Protein sequence
MSFCDKAFSC AWLADDLALI GTKDNRLFEL EIDRRGRRDA WREIELGRED AYTRNRAMEI 
AAQWVSRHTP QALRAINVSP PFLAGPYGHG AVRAAASRRG GAHACSVSST REHIVCSGGP
SHNIVGFRRD EETECLVPRM AFSGHDDVVF DVGFIGRDAM ASASRDCTVK VWQLPKSPSY
DEIRITPSGS VHPIGECTQN ERVRGVKVVD RCPARHLATC TSSGHVLQLD AETLSLVHSG
YQCRGYLETC CLATDGQIVA VGSRTHIGFV DFRSKNFYAS VALPYGDTNS TRSLSFHEGG
NLLTIGGGRG LISFYDVRMR KYLVDNGRGR VRQLFNNQYC VPFADNGIFE DEHDDDFYDI
EIRDYCLPAI FAHQWDPSGT RLLCAGGPLQ SMLHGFFVGV WS