Gene OSTLU_1996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_1996 
Symbol 
ID5004486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp534515 
End bp536038 
Gene Length1524 bp 
Protein Length508 aa 
Translation table 
GC content55% 
IMG OID640419907 
Productpredicted protein 
Protein accessionXP_001420373 
Protein GI145352052 
COG category 
COG ID 
TIGRFAM ID[TIGR02244] HAD superfamily (subfamily IG) hydrolase, 5'-nucleotidase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.722598 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TGGTCAACGA CGAACGAAGC CGAACGAGCG GAACGCGCCG CCACAAAGGT GTTTTGCAAT 
CGAAGTCTGA ACATGAAACG AATCAACGCG ATTGGATTCG ACATGGACTA CACGTTGGCC
ATGTACAAGC CGGAGACGTT CGAGATGATG TCGTATTCAG AGACTAAAAA AAAGCTGGTC
GAGTCGTACA ACTACCCGCG CGCGTTGCTC GAGTCGTTCG AGTTCGATCC GGATTACATG
GTGCGCGGGT TGGTGGTGGA TAAGAAGAGA GGGAACGTTT TAAAGATGGA TCGACACAAC
TATGTCAAGG TGGCGTATCA CGGATTCACG GCTTTAGACA CCGATGAACG GTTGGCGACG
TATTGCGAGA CGAGCAAGCG TCAAAGTTTT GACGGTCCCG AATTTAGCGC TTTAGACACA
TTATTTTCCA TGGGCGAGGC CTACCTGTTC AGCCAGCTCG TGGAGGCGAA AGATTCGGGC
AAGCATGGCG AGTTTTTTGA GAGAAAGACA TTCATGCAAA TGTACGACGA GATACGAGCG
GCGGTTGACT TGTGTCACAG AGACGGCTCG TTGAAGCACG CCGTGGCGGA AAATCCGAGC
AAGTACATCA CGAAGGACGC AGACTTGGTT CCTTTGCTAA AGGCTTTGCG AGCGAGCGGG
AAACAAGTGT TTTTGCTTAC GAACTCTCTG TGGGATTACA CAAACGTCGT CATGAATTAT
CTCATCGACG ACAACGTTGG CGACCAAAAA AAACTCGAAT GGCTCAACTT GTTTGACGTC
GTTGTGACTG GCTCGGCGAA ACCGGGCTTC TTCGCTAACG ATTCGGCGAC GATATTCGAA
GTCGACACCG CGACTGGGCT TTTGCACAAC ACCGACAACG GCGCACCATT GACTCCAATC
GGCAGCGTGT CGGATTCGAC GCACAAACGA GCGCTCGCGA GTGGGTTGCG CGCCACGGGC
AAAGGTCCGG CGAGGGTGTA CCAAGGCGGT TCGTACACGC ACCTTCACGC CATGCTCGGC
ATTGAGATTG GCTCGAGATT GCTTTACGTC GGCGACCACA TTTATGGCGA TATTTTGCGA
GCGAAGAAAG AGATTGATTG GCGCACGATG CTCGTCGTCC CCGAACTCGC GCACGAAATC
GACTGCCTGG AACGCATGAA GGAAAAACCG CACGCGCTCA GGCGTTTGCG CACGCTGCGC
GATTCTTTGG ACGATCAAGT CGCGCGTCAC GCCTGGCTCG CGGCGAACGC CGCCGACCCG
AAAGCAAACG AAGAGGAGTT AGAACGCGCG AGACAACTGT CGGCGACGGC GCGCACCGCG
CATCGCGAGG GTATGCGCGA GTATCACAAG TCGTTTCACT ATGTCTGGGG TCAACTCATG
AAGGCTGGGT CGCAGAACTC TCGTTTCGCG TTTCAAGTTG AGCGGTACGC GTGCCTGTAC
ACGTCGCACG TGAGGAATCT GTGGGGCTAC TCGCCCGAGA AGGTTTTCCG CGCCCCCGCT
GACTTTTCGC CGCACGACTT GGAC
 
Protein sequence
WSTTNEAERA ERAATKVFCN RSLNMKRINA IGFDMDYTLA MYKPETFEMM SYSETKKKLV 
ESYNYPRALL ESFEFDPDYM VRGLVVDKKR GNVLKMDRHN YVKVAYHGFT ALDTDERLAT
YCETSKRQSF DGPEFSALDT LFSMGEAYLF SQLVEAKDSG KHGEFFERKT FMQMYDEIRA
AVDLCHRDGS LKHAVAENPS KYITKDADLV PLLKALRASG KQVFLLTNSL WDYTNVVMNY
LIDDNVGDQK KLEWLNLFDV VVTGSAKPGF FANDSATIFE VDTATGLLHN TDNGAPLTPI
GSVSDSTHKR ALASGLRATG KGPARVYQGG SYTHLHAMLG IEIGSRLLYV GDHIYGDILR
AKKEIDWRTM LVVPELAHEI DCLERMKEKP HALRRLRTLR DSLDDQVARH AWLAANAADP
KANEEELERA RQLSATARTA HREGMREYHK SFHYVWGQLM KAGSQNSRFA FQVERYACLY
TSHVRNLWGY SPEKVFRAPA DFSPHDLD