Gene OSTLU_34354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_34354 
Symbol 
ID5000741 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp633382 
End bp635171 
Gene Length1790 bp 
Protein Length567 aa 
Translation table 
GC content58% 
IMG OID640416162 
Productpredicted protein 
Protein accessionXP_001417006 
Protein GI145344989 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.311045 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACG GCGACGCGAG CGATTTGAAC AAGTGGTCGA GAAAGATCAC GCAACCGAAG 
AGCCAAGGAG CGTCGCAGGC GATGCTGTAC GCCACGGGCC TGACGGAGGC CGATATGAAC
AAACCGCAGG TGCGCGCGAG CGAAGCGAAG GGAATGCGCG AAGACGCGCG GCGACGGCTC
GACTGACGAA AGGAATCGCG TGTCGTTGGC GCTAGATCGG CGTGTCCTCG GTGTGGTGGC
AGGGAAACCC TTGTAACAAA CACTTGCTGG ACCTGGCGGG TAAAGTGGCG GAAGGCGTCA
AGGCGGCGGA TATGGTGAGC TTTCAGTTTA ACACCGTGGG GGTGTCGGAC GGGATTTCCA
TGGGTACGCC GGGCATGTCC TTCTCGTTGC AATCGCGGGA TTTGATCGCG GATAGCATCG
AGACCGTGAT GGGTGGACAG TGGTACGACG GAAATATTTC TCTCCCGGGG TGCGATAAGA
ACATGCCGGG TACGATCATG GCCATGGGTC GATTGAACCG CCCGTCGCTG ATGATTTACG
GCGGTACCAT TCGCCCTGGC CACTCCGCCG TGGATGGTGG CACACTCGAT ATCGTATCCG
CGTTCCAATC GTACGGACAG TTCGTTACCG GCGCTATCAC GGAGGAACAG CGCAAAGACA
TCGTGCGTAA CTCTTGCCCG GGCTCTGGCG CGTGCGGCGG CATGTACACC GCCAACACCA
TGGCCAGCTG CATCGAGGCT CTCGGCATGA CTCTTCCGTA TTCTTCCTCC ATTCCCGCGG
AAGATCCTCT CAAGATGGAT GAATGTTTCA TGGCTGGCGC TGCGATGAAG CATTTGCTCG
AAATCGACCT GAAGCCGCGT GACATCATGA CTCGCGCGGC GTTCGAAAAC GCCATGGTCA
CCGTCATCGC TCTCGGCGGT TCCACCAACG CGGTTCTTCA CTTGATCGCG ATGGCGCACT
CTGTCGGCAT CAAATTGACT CTGGACGACT TCCAAGCCGT CTCCAACAAG ACGCCGTTCA
TCGCTGACTT AAAGCCGTCC GGTAAGTACG TCATGGAGGA CGTCCACAAG GTTGGCGGCA
CTCCGGCGGT GTTGAAGTAT TTGATGTCTG AAGGCATGAT TGACGGTTCT TGCATGACTG
TCACCGGTAA GACCCTCGCC GAGAACCTCG CCATCTGCCC GGATTTGACG CCGGGGCAAG
ATGTTATTCT CCCGGTGAGC ACGCCCATCA AGAAGACTGG TCACTTGCAA TGCTTGTACG
GCAACATCGC CCAGGGAGGC TCCGTGGCGA AGATCACCGG TAAGGAAGGC TTGTACTTTA
AAGGCTTCGC GAAGTGCTAC GATAGCGAAG AGGAAATGCT CGAGGCCTTG GCGGCAGACT
CCGAGTCTTT CAAGGGTAGC GTCATCGTCA TTCGCTACGA AGGTCCGAAG GGTGGTCCGG
GCATGCCGGA GATGCTCACG CCGACGTCGG CCATCATGGG TGCCGGTCTC GGGAACGACT
GCGCGCTCAT CACGGATGGT CGATTCTCCG GTGGCTCGCA CGGTTTCGTC ATCGGTCACG
TTACGCCGGA AGCGCAAGTT GGTGGTAACA TCGGTTTGAT CAAGGACGGT GATATCATCG
AAATTGATGC CGAAGTTCGC ACCATCAACG CGCCGGATGT CACCGACGCC GAGTGGGAAA
AGCGTCGAGC GGCGTGGAAG GCGCCGCCTT TGGAAGCGAC GTCGGGTACG CTCTACAAGT
ACTGCAAGCT CGTCGCCAGC GCTTCGGAAG GCTGCATCAC GGACTTGTGA
 
Protein sequence
MPDGDASDLN KWSRKITQPK SQGASQAMLY ATGLTEADMN KPQIGVSSVW WQGNPCNKHL 
LDLAGKVAEG VKAADMVSFQ FNTVGVSDGI SMGTPGMSFS LQSRDLIADS IETVMGGQWY
DGNISLPGCD KNMPGTIMAM GRLNRPSLMI YGGTIRPGHS AVDGGTLDIV SAFQSYGQFV
TGAITEEQRK DIVRNSCPGS GACGGMYTAN TMASCIEALG MTLPYSSSIP AEDPLKMDEC
FMAGAAMKHL LEIDLKPRDI MTRAAFENAM VTVIALGGST NAVLHLIAMA HSVGIKLTLD
DFQAVSNKTP FIADLKPSGK YVMEDVHKVG GTPAVLKYLM SEGMIDGSCM TVTGKTLAEN
LAICPDLTPG QDVILPVSTP IKKTGHLQCL YGNIAQGGSV AKITGKEGLY FKGFAKCYDS
EEEMLEALAA DSESFKGSVI VIRYEGPKGG PGMPEMLTPT SAIMGAGLGN DCALITDGRF
SGGSHGFVIG HVTPEAQVGG NIGLIKDGDI IEIDAEVRTI NAPDVTDAEW EKRRAAWKAP
PLEATSGTLY KYCKLVASAS EGCITDL