Gene OSTLU_18354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18354 
Symbol 
ID5005696 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp249852 
End bp252651 
Gene Length2800 bp 
Protein Length855 aa 
Translation table 
GC content58% 
IMG OID640421117 
Productpredicted protein 
Protein accessionXP_001421608 
Protein GI145354684 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.683916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0555178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTTA TGAAGCGCGT CGGCGGCGTG GATTTAGTGG TGAATTTTGC ACAGCAAGTG 
GCGGCAGACG ACGAGAAATT TGATGCACTT ATTCGTGAGG GCGCGTTTGA GGTGTTCACG
CACGCGTTGC TCGCAGATGA GGACGAGGTC ACCGTTCGCG GGCTCATCGG ACTCGCTTGC
GCGCTGCCGA GACGACGGGC GCTGCGCGTC AAACTCGCTG AAGACGGCGA GTGCGTGCGA
CGGCTCGCGA CCCTGATGGG ATCATCCACG GATGAAAGTC TCAAGGGCTT CGCCGGGGGT
TTGTTCCGAG CCTTAGCGCT CGATCCCGAG ACGAAAGGTT TAGTCGAGAA GGCGCTGCGA
GAAGGCGGTG CCGGCGTGCT GCAAACAGCT TGATAAACAT CGATCCCGGC TCGCCGCCGC
CCGATACCGC TCGTATTTGA TGTAGTATTC ATTTGCCATT GTCTAGGATG TTTCCCGTCG
TCGCGCGCGC GTCATCGCCT CACTTCGTCT CTCTCTCGCG GCTTCACTTC ACTTCGCGCC
TCGCGCTCCG CGCTCGATGC CTCCTCGACG ATCGACCCGA TCGCGATCGC CCTCGAAAGA
GGCGCGAGAC GCGATCAACA ACGCCGCCGT CGCGTCATCC GCGCGCGCGC CCGCGGAAAT
GCGTCAAGAC GACGCGCAGA ATGAAAAAGA AGGCGGCGGC GCGGTGGAAT CCATGTCAAT
CGAAGCGACG AATGCGATGC GTTTAAAGCT CGGTTTAGCG CCCTTGCGCG AAGGAGCGTC
GAAGAAAACC CATGACGCGG ACGCTTTACG TCGTGATGAG GCGAAAGCGG CGGAAACGGC
GGCGCTGGCG GAGAAGATTG CTGCGAGGAA GCGTCAGAGG GAGATTGAAA AGTTGAACGC
GGCGACTACA AAGCTCGGCG ACGCGGACGA TGAGGAGGAA GACGCGGGAG CTTGGTTGGC
GAAGAGCAAA ACGAAGATGG CGACGAAAGC GCAGGAGTTG GAACGCGCGA AAGCGGCCAA
AGTTGCGGCG ATGTTCGCGG AGAGGGATGA GGACGCGGAG GCGAGCGCGT CGGAGGAGGA
GGACGAGGGG GCGAAGAAAT CAAAGTCGGC GGCGTACACG TCGAAGGATT TGCGCGGACT
CAAGGTGAGG CACACCGCGG ATGAAATTAA TGAAGGTCAA GAGGTGGTGT TGACGCTGAA
GGATACGAGC GTGTTGGATG ATGAGGATGA TGAATTAGAA AACGTCTTAA TCGCCGAGCG
CAAGTCGCGT AAAAAGGCGA GAAAGGAGTC AACGAAGAAG AGTGACGACC CTTTCGGGGA
GGGCAAGGAT GTGGAGGCCA AGAAAACGGT GTTAGGAAAG TACGACGCTC ATGATGAGGA
TGCGGCAATG GAACTCGACG GTGAGGGTGG CCTCGACGCG GCTGAGGAGA AGCGAAAGGC
AGAAATCAAG GCGCGACTCG CCGCCGAACT CTCGGGATTG AAGGGTAAAG CGGAAACCGC
GGAAGTTGTC AAGGGTGAGC AAGCTGATTT TCACACCCAA GAAGAGATGC AAGCTAAGTT
TGTGAAGCGC GAAAAGAAGA AGAAGATGCG CAAAAAATTG AGAAAGAAGC ATATCGATGC
TGCGGAGCTC GAGCAAGACG CGCTGGCGCC AGAGTCGAGC GATCTCGGTT CACGACGCTC
GCGCGGCGAG TCTAGTGCCG AAGCTAAGGC CACGACTAAT GAGAAAGACG CGAAGTTCGC
GAATGCGTTG CAAAAAGCTC GTGAAGTGAC CGATAAGAAG ATTCTCGCTG AACTCGCCGG
CGAAGCAGAG GAAGAGGAAG ACGACGAGCT CGCTCGTGCG CTCGCAAGGA GCCGGAAGAT
GGCAAATATT TCGAACAAAG CAGCGCGCTC GGCGCCGGCA GACGTGGTCG CGCAAGTGGC
GGCGCGTCGT CAAGCAGATG AGGCGAAGGC TCGAGAGAAC GCCGCGAATG TCGCCGACGA
ATCTCTAGTT TTTACAGACA TGTCTGAGTT TGTTCAAGGC ATCAACACTC AAGACGGCGC
GCTCGATGAG GTCTACGACG AAGACGCAGA CACTGAGGAA ATGCCAGACG TACCGCCGCC
GCCACCACCG GGCGGTGAAG ACGAGCCGAT GGACGATGAA ATGCCAGATG TGCCGCCACC
GCCACCGCAA GAGGAAGTCG ACGCGCACGT TAACATGCCC GTACTCGCGG AAAAGCACGT
CGTGCAAAAA GGGTTGGCGA GCACACTCGC TCTTCTCAAA GACAAGGCTC AACTCGATGA
TGCGCAAAAC ACGCGCTGGA GTGGGCGCGC CAACGACATG AAGGATCGAT TCGACAGACA
GCACGTGCTC GAGGCGCACG CCATAGAAGA AAAAGCCGTC GACGGCTACA AGTTCGGTTT
CAAGCTCGAT AAGTTGGACG AGTTCGGACG GAAGCTCACC CCGAAAGAGG CGTTTAGAGA
GCTTTGCCAT CGTTTCCACG GCATCGAGCC TGGCAAAATG AAGCGCGAAA AGCGCCTGCG
ACAATTCCAG GAGGAGCAAC AGCGTCTCAA GGCGTCAAGC GTCATGGACG ACCGCATCAA
GGACGTACAG CGCGATCAAG CGACGCCTTA CGTCGTACTT AGCGGTCACG TGAGGGCTGG
ACAGGCGAGA AACGCTGATC CGGTGGCGAC GATGAAGCGC GAACAGGAAT CCGCGCGCGC
CACCCCCGCC GCCGCGTCGC GCGGTCCATC ATCGGCATCC GGTCTGAAGG CTTCCAACGC
GACAAAAGTC TCGTTTGCGA TGAAACCATC GAAGAAGTAA
 
Protein sequence
MDFMKRVGGV DLVVNFAQQV AADDEKFDAL IREGAFEVFT HALLADEDEV TVRGLIGLAC 
ALPRRRALRV KLAEDGECVR RLATLMGSST DESLKGFAGG LFRALALDPE TKGLVEKALR
EGEARDAINN AAVASSARAP AEMRQDDAQN EKEGGGAVES MSIEATNAMR LKLGLAPLRE
GASKKTHDAD ALRRDEAKAA ETAALAEKIA ARKRQREIEK LNAATTKLGD ADDEEEDAGA
WLAKSKTKMA TKAQELERAK AAKVAAMFAE RDEDAEASAS EEEDEGAKKS KSAAYTSKDL
RGLKVRHTAD EINEGQEVVL TLKDTSVLDD EDDELENVLI AERKSRKKAR KESTKKSDDP
FGEGKDVEAK KTVLGKYDAH DEDAAMELDG EGGLDAAEEK RKAEIKARLA AELSGLKGKA
ETAEVVKGEQ ADFHTQEEMQ AKFVKREKKK KMRKKLRKKH IDAAELEQDA LAPESSDLGS
RRSRGESSAE AKATTNEKDA KFANALQKAR EVTDKKILAE LAGEAEEEED DELARALARS
RKMANISNKA ARSAPADVVA QVAARRQADE AKARENAANV ADESLVFTDM SEFVQGINTQ
DGALDEVYDE DADTEEMPDV PPPPPPGGED EPMDDEMPDV PPPPPQEEVD AHVNMPVLAE
KHVVQKGLAS TLALLKDKAQ LDDAQNTRWS GRANDMKDRF DRQHVLEAHA IEEKAVDGYK
FGFKLDKLDE FGRKLTPKEA FRELCHRFHG IEPGKMKREK RLRQFQEEQQ RLKASSVMDD
RIKDVQRDQA TPYVVLSGHV RAGQARNADP VATMKREQES ARATPAAASR GPSSASGLKA
SNATKVSFAM KPSKK