Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42354 |
Symbol | |
ID | 5003323 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 690181 |
End bp | 692178 |
Gene Length | 1998 bp |
Protein Length | 631 aa |
Translation table | |
GC content | 59% |
IMG OID | 640418744 |
Product | predicted protein |
Protein accession | XP_001419459 |
Protein GI | 145350096 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0774885 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGAGA TCCTGCGCGG CGCCGATGGA CCGCCGAGGA TGTTCAGTCC GTTGGTGACG CGCGGCGGCG CGAGACGCGG CGACGCGCCG TTAGCGGTGT ACTTGCCGGG ATTAGACGGC ACTGGATTCA GCGCGGCGTC GCAGTTTGAG TACATCGCCG ATGAATTCAA TCTCATCGCG CTGAACGTGC CCGCGGGCGA TCGTGGTGAC GTTTTCGATT TAGTGAAAGC GACGACGGCT TACTTGGACA CGCACGTCGC GGCGGCGCGC GCGAACGGTG AGAACGAGGA CGTCTATCTC ATCGGAGAGT CGATGGGTGG TATGCTGTCT TTGTGCGTCG CAAGTGAGCG TCCAGATTTG ATCACGCGCT TGATTTTGGT CAATCCCGCG AGTTCGTTCG ATCGAAGCGC GTGGCCGGCG CTCGGCCCGT TGCTGCCGAA CGTCCCGAGC GAATTGTGGG GCGCCGTGCC GTACGCGCTG ACGCCGGTGC TGATCGATCC CGTACGCATG GCACGCGGTA TGATGGATAA AGTCATGTCG TCCGCGGTGT CGGACGATCC GTTGACAACC ATCGCGGCGG GGGTGGAAGA GCTCGCCGGA TTGCTACCGG CGCTTGGCGC GCTGGCCGAA ATCATCCCGC GCGAGACGCT CGCACATCGA TTGGATAAAG TCCTTCGCAT GGGATGTGAA TACTTGAACA GCGATGATTA CGCCAAGCTG ACAGCGATTG ACGTGCCCAC GCTCGTCATC GCAAGTGAGA ACGATAATCT GATACCGAGT TTGGCCGAGA GCGAACGTCT CAGGAAGTTT TTGCCCCGCG CCAAAGTCGA GGTATTGAAA GGTGCGTCGC ACGCGGCACT TCAGGAGCCG GGGGTCAATG TAATGACCAT CGCGCGTCGA AATGGGTTCG TTCCAAAGCG TGCAGATGCG CCGGTGATGA CGCGTGACGC AAAGTTTGAT CCACCGTCGC CGGCGGACAT CGAACGCGCT CGCGAAAGTC TCGCAGGTTT GCGAGCGCTG ACGTCACCGG TGTTTTTTAG CACGCGACCG GATGGGAAAA TTGTGCGCGG TCTCAGCGCG GTACCAATAC GCCAACGTGG TTCGCGACCG ATCTTGCTAG TTGGGAACCA CCAAACGATG GCGCCGGATC TCGGATTTCT AGTAGATGAA TTCTTGCGTG AATACGACGT CTGCCTTCGC GGCTTGGCGC ATCCTGTGGT GTCGCGCGAA GGCGGTGGCG ATGGATTCGG CGGCGAAGAC GCACCGCGCT CGTTCGAAGA TACGCTTCGT GACGCTGTGA AGAACACGCC CGTGGAACCG TTACTGCCGC GTCGAGAGCC GAAGCCCCCG CGGCGCGCGA TGAATATTGT CGGCGGCGGG TCATCATTCA CGTCTTTCGG CGCCGTGCCC GTCAGTGGCT TCGCGTTGTT TCGCCTACTA AAACAAGGCG AGGCCGTGTT GCTCTTTCCG GGTGGCGTTC GCGAAGCGTT CAAACGAAAA AACGAAAAGT ACAAACTCTT TTGGCCTTCC AAGCCAGAGT TCATTCGCAT GGCAATCAAG CACGACGCGA TAATCGTCCC GTTCGCGGCG ATCGGCGCCG AGGACTCCAT CGACATCGTC GCCGACGCCA ACGACTTGAT GAATAACCCT ATCGTGGGCG ATTCCGTCCG TAAACGCTCG CAAAGCGTTC CGAAGGCGCG CGCCGTCGAC ACTCGCGTCA CCGCGGACGC GGGAGAAGAG GAGTTATTCA TCCAGCCTGT CGTCGTACCC AAAGCCCCTG AGCGCTTCTA CTTTCGTTTC ATGGCGCCTA TTGACGTGAG TGGAGCGGAT TTGGATGACG AAGAGCGCGT CAAGGCGATT TACGAGCGAG TATACGGTGA AGTTGAAGGC GGTATACAGT ATCTGTTGCG CGAACGCGAG AGCGATCCAT TCAAAGAGCT TGCGCCGAGA ATAGTGTTCG AAGCGGCGAC CTCTACGCAG GCGCCGACGT TTCGTTAA
|
Protein sequence | MREILRGADG PPRMFSPLVT RGGARRGDAP LAVYLPGLDG TGFSAASQFE YIADEFNLIA LNVPAGDRGD VFDLVKATTA YLDTHVAAAR ANGENEDVYL IGESMGGMLS LCVASERPDL ITRLILVNPA SSFDRSAWPA LGPLLPNVPS ELWGAVPYAL TPVLIDPVRM ARGMMDKVMS SAVSDDPLTT IAAGVEELAG LLPALGALAE IIPRETLAHR LDKVLRMGCE YLNSDDYAKL TAIDVPTLVI ASENDNLIPS LAESERLRKF LPRAKVEVLK GASHAALQEP GVNVMTIARR NGFVPKRADA PVMTRDAKFD PPSPADIERA RESLAGLRAL TSPVFFSTRP DGKIVRGLSA VPIRQRGSRP ILLVGNHQTM APDLGFLVDE FLREYDVCLR GLAHPVVSRE GEPKPPRRAM NIVGGGSSFT SFGAVPVSGF ALFRLLKQGE AVLLFPGGVR EAFKRKNEKY KLFWPSKPEF IRMAIKHDAI IVPFAAIGAE DSIDIVADAN DLMNNPIVGD SVRKRSQSVP KARAVDTRVT ADAGEEELFI QPVVVPKAPE RFYFRFMAPI DVSGADLDDE ERVKAIYERV YGEVEGGIQY LLRERESDPF KELAPRIVFE AATSTQAPTF R
|
| |