Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_44173 |
Symbol | |
ID | 5004528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | - |
Start bp | 457349 |
End bp | 458800 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | |
GC content | 52% |
IMG OID | 640419949 |
Product | predicted protein |
Protein accession | XP_001420513 |
Protein GI | 145352351 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.844643 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGAGC GGGCAAAAGC GCTCCTGGAC GTGCGCACAG AGGAGGCGGA GGCCTTGTTG AGTCACTTCT CGTACGATTT CGAAGCCGCC GCGACGGCGT GGTTCGAGGA CACGAGAAAG GTGCGCGAGA CGTCGGGGTT GATCGATGCA AAGACGAGAC GCGAAAATAG CGAAGCAGCG ATGTCGTCGG GTGGAACGCG AGGGTGCGGG ATTTGCTTCG AGGACTTTCC AGGGGATGCT TTGACGACGG TCGGGTGCGC GCATGAATTT TGCGACGAGT GTTGGTCGGG ATGGGTGACG AGCAAGGTGA ATGATGGGCT TTCCGTGGTC AACACGCGGT GTCCTATGTG TCCTGCCAAA GTCCCCGAGT CCATGATTCG AAAGTTTCTT AGTGATGAAG ATGAAACGAA GTTTGATACA TTCTTGCGGC GGTCGTTTTT GGAAAACAAC GCCAAGTTGC GCCCTTGCAT TGGCGTCGAT TGCGAATGTG CCATCGCCGT CGAGCAACTG CCGACCAATC CCGTGAGTGT GAAATGCAAC TGTGGTGCCG AATTCTGCTT TTCGTGCCAG AGTGAGCCCC ACGTGCCTGT GAATGATTGT GAAGTCGCGA AGAAGTGGAT GGACAAAATC AACTCCGACG GTGTGAACTC GGAGTGGATG CTAGCTAACA CGAAGGGATG TCCGAAGTGT CATCGACCGA TCTTGAAGAA TGGAGGATGT ATGCACATGC ACTGCTCACA GTGCCATTGC TCGTTTTGCT GGCTTTGTCT CGGACCTTGG GATTCCGGGC CGTACGCCTG CGCCAGACGC TGCAACAAAT ACAGTGGAGA CAAAACCGGC GACGAAAACA GGCGGAAACG AGCCAGAGAT TCTCTCGAGC GCTACGTGTT CTACTATGAA CGCTATAGAG CGCACGAGGA TGCGAGTAAA AAAGCCGAAC AAGACGTCGA GAGATTCAAA GACAGCGTGC TTGACATATT GATCGATTTA CAGCGTACGT CCAAGCAACA AGTTGTTTTC ATCATGGATG CGCTCAGGCA AGTGACCGAG TGCAGGAAAA TTTTGAAATG GACTTACGCG TACGCGTATT ACGAATTTGC CGACGATCAG AGCAAGAAAG AGTTCTTTGA GTACATTCAA GGTGACATGG AGCGTTGTCT CGAGCTCCTG TCTCGCATGA TTGAATCAGA CATCAAACCA TTCCTTCCGC CAGAGCCGGA AGATGATGAA CAGAAACAAA ACGTGTCGCC GCCGTCGACG CTAACTGATG AACTTCAAGA TGGGAAATAC CAGTACGCAC CCGAAAAGCA AGAGAGTCTG GAAAACGACT TTGCCCTATA CAAAGCTCGA CTCATCGACA CCACTGCCGT GTTGCGCAAG TTCACGGATA CGTTGGTTTC GGAAATGGCT AAAGGTTTGC TCGGCGCTAG AAACATAGAC AAAACTGATT GA
|
Protein sequence | MIERAKALLD VRTEEAEALL SHFSYDFEAA ATAWFEDTRK VRETSGLIDA KTRRENSEAA MSSGGTRGCG ICFEDFPGDA LTTVGCAHEF CDECWSGWVT SKVNDGLSVV NTRCPMCPAK VPESMIRKFL SDEDETKFDT FLRRSFLENN AKLRPCIGVD CECAIAVEQL PTNPVSVKCN CGAEFCFSCQ SEPHVPVNDC EVAKKWMDKI NSDGVNSEWM LANTKGCPKC HRPILKNGGC MHMHCSQCHC SFCWLCLGPW DSGPYACARR CNKYSGDKTG DENRRKRARD SLERYVFYYE RYRAHEDASK KAEQDVERFK DSVLDILIDL QRTSKQQVVF IMDALRQVTE CRKILKWTYA YAYYEFADDQ SKKEFFEYIQ GDMERCLELL SRMIESDIKP FLPPEPEDDE QKQNVSPPST LTDELQDGKY QYAPEKQESL ENDFALYKAR LIDTTAVLRK FTDTLVSEMA KGLLGARNID KTD
|
| |