Gene OSTLU_92236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_92236 
Symbol 
ID4999633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp1089980 
End bp1090846 
Gene Length867 bp 
Protein Length288 aa 
Translation table 
GC content60% 
IMG OID640415054 
Productpredicted protein 
Protein accessionXP_001415685 
Protein GI145341167 
COG category[R] General function prediction only 
COG ID[COG1011] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.000528701 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCGCG AAGGCGATCC GAACGGCGCC TCGCGCGCGC CCCTGCGCGC GGTTTTCTTC 
GACTTTGACG ACACACTCGC GGAAACCACC CTGGCCGATC GCGTCGCGTA TCGCGAATGC
GCGATTCGCA TGGAGACCGT ATACGGGCTG TCGAAAAAGC GACAAGATGA AGTGATCGCT
GCGTACAAGC GGCGGCTCGC GGAGCGTCCC TGGAACGACG AGTTTGCGCA CGTGTGGACG
CACCGCGAGC GGCTGTGGGC GGAAGCGTTC GGCGACGACG ACAGAGGGCT CGCAATGCGG
CACGACGTGA ACAGCACATT TAGGGACTGT CGCTTGGAAC AGCTACGTTT AAACAGTTCT
GTGTGCGGTG GCATCGAGAA GTTGCGCGCG AAGAATGTGC ACGTCGTCAT CATCACGAAT
GGCCACCACG TCGTGCAGCG AGAGAAGCTC GCCGCGTGCG GGATATACGA AGTAGTGAAG
TTGGAAAACA TCCTCGTCGG TGGCGAAGAA GTTCTCGCCG GTCGCGACGA GAAACCAGAG
GCGTCCATCT TTCACGAGGC GTGCAAACGC GTCGACGTGG TACCAGACGA AGTTATGCAC
GTAGGCGACT CGTGGACCGC CGACATGGTC GGCGCCGAAA ACGCTGGTCT GCGTTGGAGA
GTGTGGGTGT CCCAACGTCC CGACGACGAG AAGTGCGAGA GCGAACAGGA ACTGTCATCG
TCGAAGCGAG CGAAAAAGGT CGACGCCGTT CCGCGCGTAG AAAACATCAA AGAATTTTTC
GAGCTCCTGG ACGAGTGGTT GGACGAAGAC GGCACGCTGC CCGCATCCAA CATTCTCTTG
AAGACGCGGA GCCGTAGCGA GCGTTAG
 
Protein sequence
MSREGDPNGA SRAPLRAVFF DFDDTLAETT LADRVAYREC AIRMETVYGL SKKRQDEVIA 
AYKRRLAERP WNDEFAHVWT HRERLWAEAF GDDDRGLAMR HDVNSTFRDC RLEQLRLNSS
VCGGIEKLRA KNVHVVIITN GHHVVQREKL AACGIYEVVK LENILVGGEE VLAGRDEKPE
ASIFHEACKR VDVVPDEVMH VGDSWTADMV GAENAGLRWR VWVSQRPDDE KCESEQELSS
SKRAKKVDAV PRVENIKEFF ELLDEWLDED GTLPASNILL KTRSRSER