Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_43599 |
Symbol | |
ID | 5006522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009374 |
Strand | - |
Start bp | 52878 |
End bp | 53786 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | |
GC content | 59% |
IMG OID | 640421943 |
Product | predicted protein |
Protein accession | XP_001422640 |
Protein GI | 145356857 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1877] Trehalose-6-phosphatase |
TIGRFAM ID | [TIGR00685] trehalose-phosphatase [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.014674 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.825443 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGAAA AGTATTGGAA CGATGAGGAG GAGTGGCGAG GGTATCACCC GAACGCGTGC GACGAGGAAA ACAAGTTTGT CGACGCGGCG GAGGGGAAAC TGTTGACGGT GTTTTTAGAC TACGACGGGA CGCTGGCGCC AATCGTGCCG GAGCCGGATA AGGCGTTCAT GAGCGACGAG ATGCGGGAAG CGGTGCGGCA GTGCGCGAAA AGGTTTCCCA CGGCGATCAT CAGCGGTAGA TCGAGGCAGA AGGTGTCGCA GTTCGTAAAG CTGGACGAGT TGTACTACGC CGGGTCGCAC GGGTTGGACA TCGCGGGGCC GAAGACGACG ACGGATGGCG AGCCGATTGA GAAGAAATTG GCGCATCAAC CCGCGCAGTG GGCGCGCGAC GTGATGGATC GCGTGACGAA GGAGTTGATC GAAAAGTGTG CGGACATTCC GGGGACGAAC ATCGAGCACA ACATGTTTTG CGTCTCGGCG CACTACCGCG CGGTTTCGGA AGAGCTCAGG CCTCGCGTCG AAGCCGTCGT GGATGAGATT TGCGCCTCGG AGGAGTGCTT GATCAAGCAC GACGGTAAGA TGGTTTGGGA GGTTCGGCCA CGCGTGGCGT GGGATAAGGG TAAGGCGCTG TCGTACCTGC GAGACGCCCT GCTCCCAGAC TTGAGCGAGA AGGGGTTCAG ACCCGAAGAC GTCTTCACAA TTTACATCGG CGACGACGTC ACGGATGAGG ACGCGTTTAT GGAGATTAAC GAAGAATTGG GCGATCACTT GGGCGTGGGA CTTCTGGTGT CGAGCTCACC CAAGGTGAGC GCGGCCAAGT TTAGCCTGCG CGACTCGGGT GAAGTATTGC GGTTCCTCAC TCGACTCCGC GAGCTCGGCG ACGCGGGCAC GATCAAAACG CTGCCTTGA
|
Protein sequence | MIEKYWNDEE EWRGYHPNAC DEENKFVDAA EGKLLTVFLD YDGTLAPIVP EPDKAFMSDE MREAVRQCAK RFPTAIISGR SRQKVSQFVK LDELYYAGSH GLDIAGPKTT TDGEPIEKKL AHQPAQWARD VMDRVTKELI EKCADIPGTN IEHNMFCVSA HYRAVSEELR PRVEAVVDEI CASEECLIKH DGKMVWEVRP RVAWDKGKAL SYLRDALLPD LSEKGFRPED VFTIYIGDDV TDEDAFMEIN EELGDHLGVG LLVSSSPKVS AAKFSLRDSG EVLRFLTRLR ELGDAGTIKT LP
|
| |