Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_15211 |
Symbol | |
ID | 5001559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | - |
Start bp | 837819 |
End bp | 839849 |
Gene Length | 2031 bp |
Protein Length | 676 aa |
Translation table | |
GC content | 61% |
IMG OID | 640416980 |
Product | predicted protein |
Protein accession | XP_001417618 |
Protein GI | 145346276 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0000157617 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAATG AGGTTAGCGA CGCGCACGCC GCGTGTCGAT GGATTTCAGA ACTCTTCGCC GAATGCGACG GTCACGAGTC GAGCGTGAAC TGCAGCTGGC GAACTCTAGA TTTAGTTTTG CGCGTGTTGC TACGCGATAG GTATTGGGCG TGCGAAGCCA CGCGATTCGC GCTGTGCGAA ACGTTGCTAT TGCGAAAGAG CGCGCCGCGC GCCGCGCTGC CGGCGCTCTT ACGGTTGACG ATTCTGCGAC CCATTCGTCG AGGCGCCGAC GCCACGCGCG TCAAAGCGGC GAGCGATGCG AATATATTCG CACTCTTGGA TACTTGGTCC GGGGAAGGTT TCGTGCGTGA GGCGTCCGTC GAGCTACAAC GGACGCTCAC GTCGAGCGTT CGAGCAATTC TTGCGGCGTT GCCGAGCGAA GAGTGGGACG CTATGCGAGG AAAGCCGACG CAAATGTTAT TGAAAGGCGT GAGCGCGCGT CTCGATTCGC CATCATTGCG TCCGCGCAGA CACGCGAGCA AAGTCGCCTT GGAACTTTCG CTGAAGATGG ACGCGAGCAA GCCTTTGCGA CTCGTCGATG ACGGCGACGA CGTCGACGCG TCATCGGACG AGGAATCGGA ATGGGAACAG ACCATAGAGT CGATCGTTCA AGACGACGTC TTCGTCGTCT GCAATGATGA CGACGGCGGC GGCGGCGGCG ACGATGAAAA AGAATCCATC GCCGCCAACG TCAATCTAAC CGGTTCGTCT GGGTTTCAGA TTAAAGTCGA AGACCCGGAC GAAGTTGTCG ATATGTGGAG CCTTCGCCGC GACGAGAGCG ACTCGGATGC CTCCGATAGC GATGACTATT CCGATGACGA GCTCGTACCG TATGATATGG ATAGTGATGA CGACGCGACG CTTCGTTCTC GTGATCCAGC GAGTTTGACT GAAGCGCGAA TCGCGTCGCT GCCGAAGCCG CAAACTTTAC GAGAGTGCAT CGGCGCCTTG AGGCAGGCGC GTTCGGGCGA TGCGTCGACT CGGCAGACCG ATATCGATAT TGCGGACGCA GCTGAAGGTG CGGTGCACGC AGTTTCCGAT ATCGTCATGC GCCAGCCGCA CGAATTGGCG TCGTGCGCCG CGGATCTCGC TGTCACCATT TTGCACGCGC AACCGCCCAC GCCGGACTCC GACCCACTCG ACCGCGCGCG GCGCCAAGGC TTAGCTTCGG TGCTCACGGT GACCCCTGGT CTCGCAGGCC CAACAATTAT CGACCACGCG TTGTCGGAGA AATGCGACGC GTCCCAAGTT ATGGACACGC TGAGCGCGTT GGACATGGCG ATGACGGAGC TCGCGTCGCC GTCAAAAGCC GAGCACCTGC TCGATGCTTC GGCGTCGACT CGTTCGACTG CTCGCGTGGG TATCGAGCGA AGGTTTGCGC CGAAATCCAT GGAGATGCAA GCGAAAGGTG CGACATCGGC GCCGACGCGA AGTCATCTCA TAGGCGAGTG CTTCGTCGCG CCGCTCTTGC GCGCGGCGGC GCAAAGGCTC GAGCGTCAGT CCGCGCCAGA ATACAACCCC GATGGCCTGG ACGCCATGGT GAACGGACAT ATTCTCTACA CACTCGGTCA GTGCGCCAAG CACGCCAAGA ACGCGACCGA TGGACCGTTG ATCGGCCGCG CGGTGCTTGA ATTCGCCGTC ACTCCCGCGT TGGCAGACTC CGACCAACCC CATCTCCGCC GTTCCGCCCT CGTCGCCGGT GCCCTCGTCG CCACTTCTCT CCGAGACACC CCCGTCGCCA TCGCATACGC CGAAAACTCC CCTCTGTCCA CCGCTTTAGA GCATTTCACC GCCATCGCCG CGCGGCGCCA TCGCGCCGAC TGCGACGTCG ACGTTCGCGC CGCGGCTTCC TTCGCCGTCG CCGCCGCCGC TGATTGCAAA GCTCGCGCCT TAACAGCGCT CGAGCGCATC GCGGACGACC CAATCGCGCT CGACGACGCG CGCTCGTCGG CAATCACCAC CCGAATCCCT CGATTAGACG TAAAATTGTA G
|
Protein sequence | MMNEVSDAHA ACRWISELFA ECDGHESSVN CSWRTLDLVL RVLLRDRYWA CEATRFALCE TLLLRKSAPR AALPALLRLT ILRPIRRGAD ATRVKAASDA NIFALLDTWS GEGFVREASV ELQRTLTSSV RAILAALPSE EWDAMRGKPT QMLLKGVSAR LDSPSLRPRR HASKVALELS LKMDASKPLR LVDDGDDVDA SSDEESEWEQ TIESIVQDDV FVVCNDDDGG GGGDDEKESI AANVNLTGSS GFQIKVEDPD EVVDMWSLRR DESDSDASDS DDYSDDELVP YDMDSDDDAT LRSRDPASLT EARIASLPKP QTLRECIGAL RQARSGDAST RQTDIDIADA AEGAVHAVSD IVMRQPHELA SCAADLAVTI LHAQPPTPDS DPLDRARRQG LASVLTVTPG LAGPTIIDHA LSEKCDASQV MDTLSALDMA MTELASPSKA EHLLDASAST RSTARVGIER RFAPKSMEMQ AKGATSAPTR SHLIGECFVA PLLRAAAQRL ERQSAPEYNP DGLDAMVNGH ILYTLGQCAK HAKNATDGPL IGRAVLEFAV TPALADSDQP HLRRSALVAG ALVATSLRDT PVAIAYAENS PLSTALEHFT AIAARRHRAD CDVDVRAAAS FAVAAAADCK ARALTALERI ADDPIALDDA RSSAITTRIP RLDVKL
|
| |