Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_34354 |
Symbol | |
ID | 5000741 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 633382 |
End bp | 635171 |
Gene Length | 1790 bp |
Protein Length | 567 aa |
Translation table | |
GC content | 58% |
IMG OID | 640416162 |
Product | predicted protein |
Protein accession | XP_001417006 |
Protein GI | 145344989 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.311045 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGACG GCGACGCGAG CGATTTGAAC AAGTGGTCGA GAAAGATCAC GCAACCGAAG AGCCAAGGAG CGTCGCAGGC GATGCTGTAC GCCACGGGCC TGACGGAGGC CGATATGAAC AAACCGCAGG TGCGCGCGAG CGAAGCGAAG GGAATGCGCG AAGACGCGCG GCGACGGCTC GACTGACGAA AGGAATCGCG TGTCGTTGGC GCTAGATCGG CGTGTCCTCG GTGTGGTGGC AGGGAAACCC TTGTAACAAA CACTTGCTGG ACCTGGCGGG TAAAGTGGCG GAAGGCGTCA AGGCGGCGGA TATGGTGAGC TTTCAGTTTA ACACCGTGGG GGTGTCGGAC GGGATTTCCA TGGGTACGCC GGGCATGTCC TTCTCGTTGC AATCGCGGGA TTTGATCGCG GATAGCATCG AGACCGTGAT GGGTGGACAG TGGTACGACG GAAATATTTC TCTCCCGGGG TGCGATAAGA ACATGCCGGG TACGATCATG GCCATGGGTC GATTGAACCG CCCGTCGCTG ATGATTTACG GCGGTACCAT TCGCCCTGGC CACTCCGCCG TGGATGGTGG CACACTCGAT ATCGTATCCG CGTTCCAATC GTACGGACAG TTCGTTACCG GCGCTATCAC GGAGGAACAG CGCAAAGACA TCGTGCGTAA CTCTTGCCCG GGCTCTGGCG CGTGCGGCGG CATGTACACC GCCAACACCA TGGCCAGCTG CATCGAGGCT CTCGGCATGA CTCTTCCGTA TTCTTCCTCC ATTCCCGCGG AAGATCCTCT CAAGATGGAT GAATGTTTCA TGGCTGGCGC TGCGATGAAG CATTTGCTCG AAATCGACCT GAAGCCGCGT GACATCATGA CTCGCGCGGC GTTCGAAAAC GCCATGGTCA CCGTCATCGC TCTCGGCGGT TCCACCAACG CGGTTCTTCA CTTGATCGCG ATGGCGCACT CTGTCGGCAT CAAATTGACT CTGGACGACT TCCAAGCCGT CTCCAACAAG ACGCCGTTCA TCGCTGACTT AAAGCCGTCC GGTAAGTACG TCATGGAGGA CGTCCACAAG GTTGGCGGCA CTCCGGCGGT GTTGAAGTAT TTGATGTCTG AAGGCATGAT TGACGGTTCT TGCATGACTG TCACCGGTAA GACCCTCGCC GAGAACCTCG CCATCTGCCC GGATTTGACG CCGGGGCAAG ATGTTATTCT CCCGGTGAGC ACGCCCATCA AGAAGACTGG TCACTTGCAA TGCTTGTACG GCAACATCGC CCAGGGAGGC TCCGTGGCGA AGATCACCGG TAAGGAAGGC TTGTACTTTA AAGGCTTCGC GAAGTGCTAC GATAGCGAAG AGGAAATGCT CGAGGCCTTG GCGGCAGACT CCGAGTCTTT CAAGGGTAGC GTCATCGTCA TTCGCTACGA AGGTCCGAAG GGTGGTCCGG GCATGCCGGA GATGCTCACG CCGACGTCGG CCATCATGGG TGCCGGTCTC GGGAACGACT GCGCGCTCAT CACGGATGGT CGATTCTCCG GTGGCTCGCA CGGTTTCGTC ATCGGTCACG TTACGCCGGA AGCGCAAGTT GGTGGTAACA TCGGTTTGAT CAAGGACGGT GATATCATCG AAATTGATGC CGAAGTTCGC ACCATCAACG CGCCGGATGT CACCGACGCC GAGTGGGAAA AGCGTCGAGC GGCGTGGAAG GCGCCGCCTT TGGAAGCGAC GTCGGGTACG CTCTACAAGT ACTGCAAGCT CGTCGCCAGC GCTTCGGAAG GCTGCATCAC GGACTTGTGA
|
Protein sequence | MPDGDASDLN KWSRKITQPK SQGASQAMLY ATGLTEADMN KPQIGVSSVW WQGNPCNKHL LDLAGKVAEG VKAADMVSFQ FNTVGVSDGI SMGTPGMSFS LQSRDLIADS IETVMGGQWY DGNISLPGCD KNMPGTIMAM GRLNRPSLMI YGGTIRPGHS AVDGGTLDIV SAFQSYGQFV TGAITEEQRK DIVRNSCPGS GACGGMYTAN TMASCIEALG MTLPYSSSIP AEDPLKMDEC FMAGAAMKHL LEIDLKPRDI MTRAAFENAM VTVIALGGST NAVLHLIAMA HSVGIKLTLD DFQAVSNKTP FIADLKPSGK YVMEDVHKVG GTPAVLKYLM SEGMIDGSCM TVTGKTLAEN LAICPDLTPG QDVILPVSTP IKKTGHLQCL YGNIAQGGSV AKITGKEGLY FKGFAKCYDS EEEMLEALAA DSESFKGSVI VIRYEGPKGG PGMPEMLTPT SAIMGAGLGN DCALITDGRF SGGSHGFVIG HVTPEAQVGG NIGLIKDGDI IEIDAEVRTI NAPDVTDAEW EKRRAAWKAP PLEATSGTLY KYCKLVASAS EGCITDL
|
| |