Gene OSTLU_94831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_94831 
Symbol 
ID5003990 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp244871 
End bp245905 
Gene Length1035 bp 
Protein Length339 aa 
Translation table 
GC content60% 
IMG OID640419411 
Productpredicted protein 
Protein accessionXP_001420119 
Protein GI145351511 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0191] Fructose/tagatose bisphosphate aldolase 
TIGRFAM ID[TIGR00167] ketose-bisphosphate aldolases
[TIGR01520] fructose-bisphosphate aldolase, class II, yeast/E. coli subtype 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.179075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.245403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACG TGCTGAACCA CGCGCGTCAT CACGGATACG CGATTCCGGC GGTGAACTGC 
ACGATGTCGC CGGTGATCAA CGCGTGCCTC GAGGCGGCGA AGAAGGCGAA CGCGCCGATG
ATCGTGCAAT TCTCAAACGG CGGCGGATAC TTCATGGCCG GTAAGGGCAC GAGCAACGCG
GATGAGAAGG CCGCGGTCGC GGGATGCGTC GCGGGCGCGG TCATGGTGCG GGAATTGGCG
GCGCTGTACG GCGTGCCGGT GATTTTGCAC ACCGACCACT GCGCGAAAAA CTTGCTCCCG
TGGTTCGACG GCTTGTTGGA GGCGAACGAG GCGTACTTTG CCAAGCACGG TGAACCGTTG
TTCTCGTCGC ACATGCTCGA CTTGAGCGAG GAACCGATCG AGGAGAACTT GGAAATCTGC
AAGAAGTACT TGGCGCGCAT GAAGAAGATC AACTGCTTCC TCGAGATGGA GATCGGCATC
ACCGGCGGCG AGGAGGACGG CGTCGACAAC TCGGACGTCG CGCCGGAAGA CCTCTACTCC
AAGCCGGAAG AGATCTACCA AGTCTACGAA GCGCTCCAAC CGATCGCGCC GGACTTCTAC
TCCATCGCTG CGGCGTTCGG TAACGTGCAC GGCGTCTACT CCCCGGGTAA CGTCAAGCTC
ACGCCGAAGA TTTTGGCCAA CGCGCAAAAG TTCGTCAAGG AAAAGACGGG CGGCGACAGC
GACAAGCCGG TGTACTTTGT CTTCCACGGC GGCTCCGGTT CTTCCCGCGA AGAAATCCGT
GAGGCGGTTG GCTACGGTGT CATCAAGATG AACATCGACA CCGACACCCA GTGGAGCTTC
TGGGATGGCG TCAAGAACTA CGAAGCCAAG AACAACGCGT ACCTCCAAGG CCAAATTGGC
AACCCGGAGG GCGCCGAAAA GCCGAACAAG AAGTACTACG ATCCGCGCGC GTGGATCCGA
TCCGCGGAAC AAGCGACGTG CGACCGGTTG CTTGCGGCTT ATGAAGATCT CAACGCGACG
AATGCGCTCA GGTAA
 
Protein sequence
MIDVLNHARH HGYAIPAVNC TMSPVINACL EAAKKANAPM IVQFSNGGGY FMAGKGTSNA 
DEKAAVAGCV AGAVMVRELA ALYGVPVILH TDHCAKNLLP WFDGLLEANE AYFAKHGEPL
FSSHMLDLSE EPIEENLEIC KKYLARMKKI NCFLEMEIGI TGGEEDGVDN SDVAPEDLYS
KPEEIYQVYE ALQPIAPDFY SIAAAFGNVH GVYSPGNVKL TPKILANAQK FVKEKTGGDS
DKPVYFVFHG GSEIREAVGY GVIKMNIDTD TQWSFWDGVK NYEAKNNAYL QGQIGNPEGA
EKPNKKYYDP RAWIRSAEQA TCDRLLAAYE DLNATNALR