Gene OSTLU_18783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18783 
Symbol 
ID5006347 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009372 
Strand
Start bp9491 
End bp10681 
Gene Length1191 bp 
Protein Length396 aa 
Translation table 
GC content51% 
IMG OID640421768 
Productpredicted protein 
Protein accessionXP_001422315 
Protein GI145356181 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCT CCGCACGTCG GCTCGGTTTG CGCGTGCTGT TGGCTTTTTG GTTGTTGAAG 
TTTGTATTGT CTTTGCAGTT CATAACACGC GGATGCGAGC GCTCGGTGAA GTATTTGAGC
GAGATGTGGA TGCGAGAAGA TGCTTGGACG CACTGTCGGA ACTGCTGCGG CGACGTGAGG
GAACGAATGC TGAAGTTCAG TTCATTTGGC TTGGGTAAAG ACGTCGCGTG TTTCACCATT
CATCCGGAGG CGACTTCGCC GTCGTCGTAC ACCTTCGTCG CACGCAGGCG TGAACCGCTG
AATCGTAGCT ACTTTAAAAG TGCCCTGCGG TTTCTCCGAG ACGTCGCTGT GGAACGGCGA
CAGAACCAAG GAGAACGAAG CGACGTGGCT CTACGACTCC TTTGGGTGAT TGACGACGAT
GCCTCTATGC CGAGGGAGCT GAAACGGGAA TTGAAATCCT TGAACACTTT CATTCTGGCG
CATTCTATCA ATGTGGATTG TTTCACCGAC GAATCAGTCG TGCTCGCGCC AAACTTCCAC
TTCATCAAGC GCGACGGGTT CAAACCGTTG CTTCGGAATC TGCGTGAACG TGAGATTCCG
TTTGATGAAC GCAAATCCGA CGTATTCTGG AGAGGCTCGA CGTCTGGTAT GTCAACAAAA
TGCGAAATAG AAGAACCCGC TCGCGTCGAC GTGAACGAAA GGGTTACCGC GTGCGTTGAA
CTTCCGCGCG TGCGAGCGGT GCAATCATCC ATAAATGTTC CTTGGCTCGA CGTCGAGATC
ACGCGAAGGG TACAATCGTG TAAAGGACAA ACAAATGTAC GCATTAGCCC GCACGTATCG
GAACAGCATT GGATCACGCA CAAAGGTATC CTCGAAATCG ATGGAAACGT CGACGCGTGG
GGAAACCGTT GGCGCATGGA AAGCGGGAGC GTCCTGTTCC TCGTCAAGTC AAATTTCAAG
CACTATTACA GCGACAAGCT GGTAGATGGA GTACACTACA TCGGAATATC TGGAGATTTG
CACGATTTAG TGGAAAAAAC GAAGATTGTG GCGAACACCG ATGGCGAATC GCTAACGAAG
TTGCGTGATA TCACAGCAAA TGCTCGCGCG TTGATGCAAG AATTTACGTA CGAGCGCGTC
GTCAAGGGCG TTTCCCATCG TCTGAATGAG CTAGCCTTAG GTATGGTATA G
 
Protein sequence
MKTSARRLGL RVLLAFWLLK FVLSLQFITR GCERSVKYLS EMWMREDAWT HCRNCCGDVR 
ERMLKFSSFG LGKDVACFTI HPEATSPSSY TFVARRREPL NRSYFKSALR FLRDVAVERR
QNQGERSDVA LRLLWVIDDD ASMPRELKRE LKSLNTFILA HSINVDCFTD ESVVLAPNFH
FIKRDGFKPL LRNLREREIP FDERKSDVFW RGSTSGMSTK CEIEEPARVD VNERVTACVE
LPRVRAVQSS INVPWLDVEI TRRVQSCKGQ TNVRISPHVS EQHWITHKGI LEIDGNVDAW
GNRWRMESGS VLFLVKSNFK HYYSDKLVDG VHYIGISGDL HDLVEKTKIV ANTDGESLTK
LRDITANARA LMQEFTYERV VKGVSHRLNE LALGMV