Gene OSTLU_28298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_28298 
Symbol 
ID5006176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp174071 
End bp176464 
Gene Length2394 bp 
Protein Length797 aa 
Translation table 
GC content49% 
IMG OID640421597 
Productpredicted protein 
Protein accessionXP_001422222 
Protein GI145355982 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.126698 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.061599 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGCAC TACAGCAACG CGAGAGAATG GCGGTGTTGT CGGGTTTGGT CAAAGCGGCG 
GACGACGCGT TTGACCGAGC CGAGATCGAC GACGTCACGA AAGAGTTGTT TCTCGACACA
CAGATCGAGG ACGAGATGAA AGAGATGTTC GACATTGCGA GACGAATTCA ATTTTTTGAT
GAAACGGGAA AGTTGGAAGG CGCTCTGGAG TTGGTGAAGG ATTTCGAAAA GTACGCGCGG
AGCGAGGATT CGAAAGTCGT CGCAGTCGCG GTGTGGTGCG GGGCGTGGAG CGCGATTCAA
AATGAACTTT TTAGACGTAG CGAGTGGATA CGAGTGCTCG AGAAACTGTT TTGGACGATC
CACACGCTCT GTTTGAGTCA TCGCTTCAGC AAAATCCGCG CGTGTGCGGC GATTCAGTGC
GGGGAGTTGG CCACCGCCGC GATGTTGAAG TTGTGCGATT TGCGCGTGCA CGAGGCATAT
GCTCGTGATA TGTTACGCTT GAGCTTAGAT GAAAAGTGCA CAGTACGACA CGCCGTGTCC
TGTAGCGTCT CCATGTGCTT CGTTAGACGC ATATATTCTG GGCTTTTCAC CGTAAGAGAA
GCGGATGCAG CTTTGCTGTG CGCGATGTTT GTGAGTGCCG TCGGGAAAAG AAATCCTTTT
GCGCACATCG ATTACGTGAA CAAAGCGCAC TTGACAAAAA CTATGAATGA AGGGTTTGCA
TCAGTGACGC ACTATACCTT AGAACGAATG ATTCAGTGCA TGTTAAACGA TCAGCTGTGG
CGCACGGATG ACAAAGACAT CCTTGGACGG CGTCTCCGCG CGTGCGCGCT CATCGTCGAA
CACGTCGGCG ATCGCGTTGC CCCGTTCGTC GATTCAATCA TCAAGATGTT GCTAGATACT
TTGACCAAGA GTCTTGTTTG GCGCGATGCG GAATATTTGA TGGAAATATT GATGATTCAC
GCGCCCTCGA CGCGTAGGAC GTGGGTTGAA CCGACGATGC TGCACGTAAA CTCCGAGTTT
ACTTTGCACA TGGTTGCTTT GTGTGTATGC GAAAAATTTC TCTCTAGAAG AAAGCAGCAT
GGCGTCACAC ATGCACTCAA CTTGGATGAT GTTGCGTTCA TTTCAGAAAG TCTGGAGAAG
ACTATGTTTC GATTGGATTC ATGTGACAAG CTAGATGATT CAAGTGGCGT CCTAAATTGC
GTCGAAAGTA TCCTTGCGTC ATGTTTAGAG ATTCTTCGTC AGCAGAATAA GGCTAGCATG
GCGACGCGAA AGTTCAAGAA ACTTGAAGCA TGTGTGAAAA TTCAGACTAT GTACTTTTGT
GTGTGCGGAA TGTCTTCCAT CGACGTAACT GGCGAGATGT TGATGGCATG CACTCCAAAG
GATGAAGTTG AGGCATCGGA GATATTTCGT CTCATCGTCG ACCGTAACGT ACCACTACGT
TCGTTCCGGG CAAATACACT CACGGCGCTT TTTTGCCTTC TTCGCGGTAC GCAACTGGTT
GCTGAAATAT CGGAATGTCT CGCCCAACAA ATCGACCCAG CGACGTGTGA TGCATCATTG
GATTCACTTC TTACAGTCAA AGACTTCATG GTACCATTCA TCTTGTACTT GCGATCACGC
GGCGATGAGA CACTGAATAA GGCGGATTTT GTCACAAATA TCGCTCCAGT TCTGGACGCT
TTGCTCGATT GGATGCGATG GAGACCGGGT GAAATGACGG CGCAGCTTGC CGATATAGCC
GCCACGTGTT TCGCCGTTCA AAATTCATTG AGTGAGCCAA TTTCGTATAA TGATGAAATT
GGCGAGTTTG TGGAATCCCT GACAGATGAG CAATCCGAAG AAAGTGCGTT GCAAACATTC
TTGACGCGCT GGTTATCACT AGAAAATCGT TCAAAGCGAG TGATAGACGT GATATGTTCG
ATGTCATTCG AAAAAACGTC GTCTGTCTTT TGCCGGCAAA AGAGAGACGC CATGTTGCAT
TTGCAAAAAT GTGCATGTTT GGCAGACAGA CAAAACGCAG TAAAATTTCT CGAGCGCGTC
GCATCCAGAT ATTTAGAGCA AGTCTCGACT TCCTCAAGTA CTCAGGTTCG TGTTCGTTCG
TTGTTAGCCT TGGGGAAAAC CATTGCACAG TTGATACGCG ATTTTAACGA CCATCAGACG
GCTAAGAAGT TTGTGGGAGA CATCTTCGTG TTCGCCGATG ACGCCGACAA AGCCGTTCGA
GTGGGAGCAT GCAGAGTGTT AGGTGCTTTG CAACATTGTC ACTCACAAGT CGCCACAATC
GTGCGCGGAC TCTTAGACTC GGCGGATACT AAATTTTTCA TACACAAGGA CGCCCTGCAG
TGTCTTGAAC ATCCCGAGGC GCTTGAGAAG GAATTCGGGA ACGACGCGCG ATAG
 
Protein sequence
MEALQQRERM AVLSGLVKAA DDAFDRAEID DVTKELFLDT QIEDEMKEMF DIARRIQFFD 
ETGKLEGALE LVKDFEKYAR SEDSKVVAVA VWCGAWSAIQ NELFRRSEWI RVLEKLFWTI
HTLCLSHRFS KIRACAAIQC GELATAAMLK LCDLRVHEAY ARDMLRLSLD EKCTVRHAVS
CSVSMCFVRR IYSGLFTVRE ADAALLCAMF VSAVGKRNPF AHIDYVNKAH LTKTMNEGFA
SVTHYTLERM IQCMLNDQLW RTDDKDILGR RLRACALIVE HVGDRVAPFV DSIIKMLLDT
LTKSLVWRDA EYLMEILMIH APSTRRTWVE PTMLHVNSEF TLHMVALCVC EKFLSRRKQH
GVTHALNLDD VAFISESLEK TMFRLDSCDK LDDSSGVLNC VESILASCLE ILRQQNKASM
ATRKFKKLEA CVKIQTMYFC VCGMSSIDVT GEMLMACTPK DEVEASEIFR LIVDRNVPLR
SFRANTLTAL FCLLRGTQLV AEISECLAQQ IDPATCDASL DSLLTVKDFM VPFILYLRSR
GDETLNKADF VTNIAPVLDA LLDWMRWRPG EMTAQLADIA ATCFAVQNSL SEPISYNDEI
GEFVESLTDE QSEESALQTF LTRWLSLENR SKRVIDVICS MSFEKTSSVF CRQKRDAMLH
LQKCACLADR QNAVKFLERV ASRYLEQVST SSSTQVRVRS LLALGKTIAQ LIRDFNDHQT
AKKFVGDIFV FADDADKAVR VGACRVLGAL QHCHSQVATI VRGLLDSADT KFFIHKDALQ
CLEHPEALEK EFGNDAR