Gene OSTLU_18554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18554 
Symbol 
ID5006063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp264676 
End bp266331 
Gene Length1656 bp 
Protein Length551 aa 
Translation table 
GC content70% 
IMG OID640421484 
Productpredicted protein 
Protein accessionXP_001421895 
Protein GI145355286 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0184372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0131926 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGC GCCTGCAGCC GTGGGCGAGC GTCCGGGAGT GGCGCGACGT CCGCGACGCG 
CTCGCGCGCG CGACGGAGGA CGAGGCGCGC GGCGACGGCG CGCGACGCGG CGCCGCCGGG
GACGTCGAAA TCGACGCCGC GCTCGGCGTC GTGCGCGCGT GGCGCGCGCG CGGGCGCGCG
CCGCTCGCCG CGGACGTCAC CGCGATGCTC GTGCGCGCGT CGCGCGCGCG CGAGGGCGTC
GATCGAGGCG ACGCGGACGC GGCGCGGTCG GCGACGGCGA TGACGCTCGC GAGACTGGTG
AACGGCGTCG TCGACCCGAA GCAAAAGGGA CGGTACGCCG CGCCGATCGC GACGCTGGCG
CGAGAGGTGG GACTGCCGCG ATTTTTAGTG GATCTGCGAC ACGAGTGCGC GCACGGGACG
ATGCCGAGCG CGGGAGCGCT GCGGAGGGGC GCGCGGCGAG CGTTGGCGTG GTGTCGACGA
TGGTATTGGG ACGAACAGGC GCGGGCGTTC GACGCGGCGT TCGAACGCGT GCGGGGATGC
GTGCGAGCGA TGTGCGCGTG CGAAAAAGAC GCGAGAGGGC TGCGGGCGCG GGAGGGACGG
GGCGCGGTGG AGTCGTCGAG CGAGGAGATG GACGAGGACG GGGCGAGCGA GGGCGAAAAC
GCGGGACGGG AGAGCGAGGG CGGGACGTCG TTTAAGGACG TGCGAGAACG ACGACGACGG
GCGATCGGAA CGTTGAGCAG CGTGTGTCCG AAGGGCGCGG CGCACGTCGT GGCGGAGGCG
TTGCTCGACG GGGGGTGGCT CCGCGTCGTG GAGGACGAGA CGGCGAGCGA CGTCGACGAC
GCCGACGAAG CGACGTTTCG CGCGAGCGCG GAAGATTGGC GGCCGACGCT CGAGCGACTG
TGTCAGAAAT GGACCGGTTT GTTCGCGTAT CTGTTCGACG CCGCGATTCG AGGCGAGAAA
CCGGGAAACG ATGTGGGGTT TCAAACTTTG TTGAGCGTCG CGGCGAGCGG CGACTTCGCC
GCGAACGGCG ATCAGCGCGT CGCCGCGTTC CACGCGTGCA AACGCGCGCT CGCGAGCGTG
CACGAAGACG ATTGGAGCGA TGACCCGGCG GTGGCGAAGA GAACGATCCG GACGCTGCAA
AAAATTGCCG GCGTGTCAAA GGATGAAATC AAATCCGCGT CGCGTCGCGG CGCGGCGCCC
GCGGACGCGC TCGCGAGCGC GAGAGCCGAC ATCGAGGCGC TTCGCGCGAC GCTTCAGTCC
GGTCGTAAAC GCAAGCGCGA TTCGCGCTGG GAACGGGCGG AAGATTGGAC GCCGTCACCC
ATAGGCGTCG TCGCGGGCGT CTCGGCGCGC GCGTTGGTCG ACGTCGCGCC GTCGTCGCGA
ACGATTCGCG TCACATCCGG TGTCAGCGCG TCGAGCGACG CGGGGTATCG AAGCGCCACG
AAATCGGCGA CGTATCCACG CGGCGACGAC GGTGACGACG ACGACGCCGC CGACGACGAC
GAGGACGAAA ACGACGACGA GGACGAAAAC GACGACGGCG GCGAACCGTC CGAACGCGTC
GGCGTCGCCG CCGCGCTCAA CGTCGCCGGC GGTCGCGTGG AGCTCTCAAA ATCGCAAGCC
GCCGCCGTGG CGGCGTCTGT GGCGTGTCTG CTTTAG
 
Protein sequence
MSARLQPWAS VREWRDVRDA LARATEDEAR GDGARRGAAG DVEIDAALGV VRAWRARGRA 
PLAADVTAML VRASRAREGV DRGDADAARS ATAMTLARLV NGVVDPKQKG RYAAPIATLA
REVGLPRFLV DLRHECAHGT MPSAGALRRG ARRALAWCRR WYWDEQARAF DAAFERVRGC
VRAMCACEKD ARGLRAREGR GAVESSSEEM DEDGASEGEN AGRESEGGTS FKDVRERRRR
AIGTLSSVCP KGAAHVVAEA LLDGGWLRVV EDETASDVDD ADEATFRASA EDWRPTLERL
CQKWTGLFAY LFDAAIRGEK PGNDVGFQTL LSVAASGDFA ANGDQRVAAF HACKRALASV
HEDDWSDDPA VAKRTIRTLQ KIAGVSKDEI KSASRRGAAP ADALASARAD IEALRATLQS
GRKRKRDSRW ERAEDWTPSP IGVVAGVSAR ALVDVAPSSR TIRVTSGVSA SSDAGYRSAT
KSATYPRGDD GDDDDAADDD EDENDDEDEN DDGGEPSERV GVAAALNVAG GRVELSKSQA
AAVAASVACL L