Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17975 |
Symbol | |
ID | 5005522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | - |
Start bp | 44201 |
End bp | 46233 |
Gene Length | 2033 bp |
Protein Length | 678 aa |
Translation table | |
GC content | 65% |
IMG OID | 640420943 |
Product | predicted protein |
Protein accession | XP_001421374 |
Protein GI | 145354188 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.796717 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000290426 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCGAGAA GCGGCGACGG ATGGGACGAC GACGCGTGCC TGGCGTGCGC GTGCGCGGTG ACGAAAAATA ATGATGGATT CAAGTGGTCG GACGCGGAGG GCGGGGACGG GGTGGAGAGC GTCGCGCGAA GGATGTCGCC GAGCGGGCGG TGCTCGTGGT GCGGCAAAGA GACGCGACAT CGAGTGACGC GAGCGACGGG GAGAAGAGGG ACGTATCTGA GTCACTGCGC GCGGTGCGCG AGGGCGACGC ACAAGTGCGA ACGATGCGCG GAGGGATTCG CGAAGATCGG GGACGGGAGG TGCGCGAAGT GCGCGGGATG GGTGGAGACG TGGACGACGA GCGAGGCGTT CGCGGCGGCG ACGAGACGCG TCGCGTGGTG CTCGTGGTGC GGGGAAAAGA GTTCGCACGT CAGGCGTGGG ACGCGCGGGA GCGACGCGTA CGAGTGCATG GTGTGTGGGG GGGGGACGGC GGCGTGCGAG CGGTGCGGCG AGGACGCGGA GGTGATGCGC AAGCGCTCGC GGCCGGGTGG GGGATCGTGC GCGAGGTGCG CCGCGGCGGA GACGACGACG CGCAGAATAA ACATCGGTGG ACACGTGTCC GTGCCCACGA TCGGGGCGTT GGCGAAGGCG TTTAAGCGCG CGCTGAGCCG CGGGGACGAC GCGGCGACGG CGACGGCGTC GGCGGAGGAG ATCGCGCGCG GGTGGGATTT GCGGCTGGCG AAACGCGAAG CCGCGGACGA GCGCGCGGGG TTCATATTTG ACGTTCTCGA CCGCGAGAGC GATTACCGAG ACAAGGCGTA CCGCGCCGGT TTGATACGCC CGTTCTTACT CCTCGCCACG CTTCCTCCGC GCGAGCGCGT TCGACTCGGC ATGCGGTTGG GAGTGACGCT GTGTCGAAGC TCGGCGTACT TGGATCCGCA CGCCGAGGCG TGGAAGTTGC TCAGGGATCC GATGTGCGGA CTGCAGACGA GAGGCGGGAG CGTGTCGCGC GTCGTCGAAA AAGTCACCGG CGTCGGACGC GGGGCGAATT GGATCGATAT ATTGTACTCC GCGCTGACCC TCGGCGCGGA CACCGGAAAG TGTCCGGCGA CGGATCCGAG CGAGCTCGAC GCGCTTCCGA AATTTCGTTC AACCGGTCAC GCGATGTTTG CTCTTCGCGT CGCATCGCAC CCGTCGTTGA GCGCATTTGA AGTCGCGACG CTGCGTCTGG TCGGTCGCGC GCAACGCGGC CGACTCGCAC CGGCGTCGAC GATTGTTCTG GACGGCGTGT GCCGACATCC TCGCATGGCG ACGCTGCGGT CGCGATTAGC CAAGGCGTAT CCGCGCCACG CGGACGAAGT TTCCCGACAC GCCGTGACGT GCGCGTTCGC TTCGTCAGAA TGGGCGTCGA CGATGCGACC ATCGAGTCCG GGAGATGTCG AAGAGACGGC GGAAGAAGTG TTCGGAATGT TGCTCGACGG CGCGCCGTTT TCGCGCCGCT GCGGCGCCGC GGACGAAGAC GTACTCGACG AAAGCGCCGA CGCGCACTTG TCGCGAGGCG CACCTTCGTT GCTAGGATCT TTAGCCTCCG CGACGTCGAT TGGACTCGCA TCTTACGCCG CCGCTCAATT TGCGCCGAAG AAATTCGCCG TGCTCACGCC CAGGGACATC ATGGATTTGA CGACGGGCGT TCGCACGCCG ACGCTGACGT CGTCGGGCAT TTTCGAACCG GTGGCGGTGA TGCTCATACA CAACGTCCTT CTCGCCGCGA GAAACGTGCA TGTGGACGAG CACTTGCCGG ACGAAGCGTC GCGATTGTCG AGAGACGCTC TGATGTCCGC ACAGTCGCCA TCCGTTGCCG CGCCGACGCC CGAACCGGCG AGCGAGACGA GTCCAGGGTC GCCCATTGAC GACGAACCCG GTTCGCCCTG GGCGCCTAAA TACGCGCATC TTCAAGACGA GGACGCGAAA GTGCGTTTAG AAGAATTAGA AGAGGCTGAG GAGTCCGCGG CGATGCTCGC GAGATACATC AACGCGTTGG ATACCTCGGA TCT
|
Protein sequence | MARSGDGWDD DACLACACAV TKNNDGFKWS DAEGGDGVES VARRMSPSGR CSWCGKETRH RVTRATGRRG TYLSHCARCA RATHKCERCA EGFAKIGDGR CAKCAGWVET WTTSEAFAAA TRRVAWCSWC GEKSSHVRRG TRGSDAYECM VCGGGTAACE RCGEDAEVMR KRSRPGGGSC ARCAAAETTT RRINIGGHVS VPTIGALAKA FKRALSRGDD AATATASAEE IARGWDLRLA KREAADERAG FIFDVLDRES DYRDKAYRAG LIRPFLLLAT LPPRERVRLG MRLGVTLCRS SAYLDPHAEA WKLLRDPMCG LQTRGGSVSR VVEKVTGVGR GANWIDILYS ALTLGADTGK CPATDPSELD ALPKFRSTGH AMFALRVASH PSLSAFEVAT LRLVGRAQRG RLAPASTIVL DGVCRHPRMA TLRSRLAKAY PRHADEVSRH AVTCAFASSE WASTMRPSSP GDVEETAEEV FGMLLDGAPF SRRCGAADED VLDESADAHL SRGAPSLLGS LASATSIGLA SYAAAQFAPK KFAVLTPRDI MDLTTGVRTP TLTSSGIFEP VAVMLIHNVL LAARNVHVDE HLPDEASRLS RDALMSAQSP SVAAPTPEPA SETSPGSPID DEPGSPWAPK YAHLQDEDAK VRLEELEEAE ESAAMLARYI NALDTSDL
|
| |