Gene OSTLU_18359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18359 
Symbol 
ID5005700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp260190 
End bp261509 
Gene Length1320 bp 
Protein Length439 aa 
Translation table 
GC content62% 
IMG OID640421121 
Productpredicted protein 
Protein accessionXP_001421611 
Protein GI145354690 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.523729 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0221416 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGA CGACGACGAC GGCGGACGCG GCGCGCGCGG CGGCGGGCGC GGCGCGGCGA 
CTGCAGGAGA CGTACTTCGC GGCGACGTCG AAAGCGATCG CGCCCGAGCG GTGGTACCCG
TACTGGTGGG CGCTGCCGCT CGCGCCGTAC GGGAGCAAAG CGACGGCGCT CGCGGAGGCG
GTGCCGGGGG AGGTGTGGAC GTTCGATCAG TTGCAAGGAT TGCTCGACGT GCTCGTCAAC
GTGCGAATGA CGGTGGCGCG ACTGGAGGGG GGGGGGTTGT GGGTGCACAA CCCGGTGGCG
CCGACGAAGG AGCTGGTGGG GATGGTGAAG GAGCTGGAGA AACGATACGG CGCCGTGAAA
CACATCGTCG TCGGGAGCGC GGCGATCGAG CATAAGATTT ACAGCGGGCC GTTCAGCAAG
GCGTTTCCGA ACGCCGACGT GTGGTTGCCG CCGAAGAATT GGACGTTTCC GGTGGACGTG
CCTCTGGAGA CGTACGTGCC GTTTTACCCG CAAGGGTCGC CGAAGACGCT TCCGATGCAA
TCGATCGGAG GCGAACAAAA CGTGCCGTGG GCGAACGAGA TTGAACACGC CGTGCTGCAA
GTCGGCGGCT CGTCCCTGCG CGGATTCAAG GATCCTTGGT TCGTCGACAC CGCGTTTTAT
TTGAAGCGCA CGAAAACGGT CGTGCTCACG GATGTCATGG AAAAGGTGAG CCAACAAGCG
CCGCCGGTGT GTCAAATCAA CCCGCAGCCG CTCTTGGTGC GCGCGATGGA TGAGCCGGAC
AAGGTGCCGG CGAACACATC GCAGGCGAGA AGCGACGGGT GGGGGAAAAC CGTCTTGTTT
GGTTTGCTCT TCAACCCGAA CGCGGTGGAG TTTGAATTTA GCGGAGACAT CGCCAACGAT
TTGCTCGATG GGTTCAAGTG GGATCCGAGC TGGCGCGCCG ATTTCGACGC CCTCGTCGCC
AAGCCGATGT TTGTGCCGCC CATTTTAGCC GTCTTGGCGT TCCCGCGTCG CCGCGACGAA
GTCAAGCGTT GGTCGAACAT CGTCACCTCG TGGGATTTCA CGTCCATCAT CCCGAGCCAC
CTCGACGGTC CCTTCAACGC GACGCCGGAT GAATTCACCG CCGCCATGGA CTTTGCCTTG
ACGGCGACGC CGTACGAGCA ATTCGGCGCC AACGCGCAGA GCTTAATCGA CGTCGACAAA
TTGAGCGTCG ATCTCAAGTC TCTCGAAGTG CCCAAGCCGT TGAGTCCGGA GCTCATCACT
CCTCCCAAAC CAGCGCAGGA AGCGGTGGAA ATCGTCCTCG ACGCCGCGTC GGAGCAATAA
 
Protein sequence
MATTTTTADA ARAAAGAARR LQETYFAATS KAIAPERWYP YWWALPLAPY GSKATALAEA 
VPGEVWTFDQ LQGLLDVLVN VRMTVARLEG GGLWVHNPVA PTKELVGMVK ELEKRYGAVK
HIVVGSAAIE HKIYSGPFSK AFPNADVWLP PKNWTFPVDV PLETYVPFYP QGSPKTLPMQ
SIGGEQNVPW ANEIEHAVLQ VGGSSLRGFK DPWFVDTAFY LKRTKTVVLT DVMEKVSQQA
PPVCQINPQP LLVRAMDEPD KVPANTSQAR SDGWGKTVLF GLLFNPNAVE FEFSGDIAND
LLDGFKWDPS WRADFDALVA KPMFVPPILA VLAFPRRRDE VKRWSNIVTS WDFTSIIPSH
LDGPFNATPD EFTAAMDFAL TATPYEQFGA NAQSLIDVDK LSVDLKSLEV PKPLSPELIT
PPKPAQEAVE IVLDAASEQ