Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_43066 |
Symbol | |
ID | 5005458 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | - |
Start bp | 501888 |
End bp | 503783 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | |
GC content | 54% |
IMG OID | 640420879 |
Product | predicted protein |
Protein accession | XP_001421483 |
Protein GI | 145354420 |
COG category | [R] General function prediction only |
COG ID | [COG0488] ATPase components of ABC transporters with duplicated ATPase domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0251063 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.287268 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATTC ACATCAGCGG TATTCAGCTC TACGGCGGAC GCGATGAGTT GATTCAGGAA GGGGTTTTGA AGTTAGTGTT CGGGACGAAG TACGGTTTGG TCGGTCGAAA TGGATGCGGC AAGTCGACGC TGTTGCGAGC GATTTCCGAA CGCGTCATCA AGTTGCCAGA ATTTTTGCAC ATCATTCACG TCGAGCAAGA GGCGTCACCG GACGAGCGGT CCGCGTTGCA GACCGTCGTG GAGACCGACA CCGAGCGTCT GTACTTGCTC AATCTAGAGA AGCGAATGTT GGACGAAGAA CTCGATCAAA TCGATGGCAT CGACTTGAAC GAGGTGTACG AGCGCCTGGA CGAAATAGAC GCCGACACGG CGACGGCGCG CGCGGGACAA ATTCTGGGCG GTCTCGGTTT CGATCCCGAG GAGCAAATGA AGGCGACGAA AGAATTTTCC GGTGGTTGGC GCATGCGCAT CGCGCTCGCG GCGGCGCTGT TCATGACGCC GGATTTGTTG TTGCTGGACG AACCGACGAA TCACTTGGAC GTGCACGCTT TGACGTGGCT CGAAGAGTTT TTACGAAAGT GGGAAAAGAC GGTCCTCATC GTGTCGCACG ATCGCGGCTT TTTGAACGAT TGCACGACGG CAACAATTTT CTTGCACCAC AAAAAGTTGC GCTACTACGG CGGATCCTAC GATACCTTCC TCAAGGTGCG CGCTGAGCAC CGAGCCAACG AGGAGGCGAT GCAGCGAAAC CAAAGTTTGC GAGAGTCATC TCTGAAGCAA TTCATTCAGC GATTTGGGCA AGGTCACAAG AAAATGGTGC GTCAGGCACA ATGTCGCATG AAGATGCTCG AAAAATTACA GAGCGAGCGG GTGGACGTGG ATTACGATGA TCCGTATCTA CGCATCAATT TCCCATCCGC GTCGCCGTTG CCGCCACCGT GCATCTCCGT GATGAACGTC GCGTTCGGTT ACGAAGGCTA TCAAACCTTG TACCAAAACT TAGACTTTGG GTTAGACATG GATAGCCGAG TCGCAATCGT CGGACCGAAC GGGGCTGGTA AATCGACGTT TTTGAAGCTT CTCGAGGGTG ACATTTTGCC CACCAAAGGT TGGATTAATC GTCACACCAA GCTTCGGCTA GCGCGTTTCT CTCAACATCA CTTGGAGACG ATGAACTTGG AGGAAGATTG CGTAGCGCAC ATGAAGAGAC TAGACAGCGA AATGCCTATA GAGACGGCGC GAGCGTACTT GGGTCGATTC GGCCTGTCTG GCGAGCTCGC GACAAAGCCG ATTAAAGTTT TGTCCGGCGG GCAAAAGTCA CGTCTAGCAT TTGCCGAACT CGCGTGGAAG CAACCTCACA TTCTCCTTTT GGATGAACCT ACTAACCATC TTGACTTGGA AACGATCGAG TCACTTGCCA TGGCGCTCAA CAACTTCGAA GGCGGCGTCG TGTTAGTCTC GCACGACGAG CGTCTCATTT CGCTCGTCGT CGATGAAATT TGGATTGTCA CGAAGGGCGA CATGAAATCC AATCCTCCGG TCCCAGGAAG CGTACAAGTT TTCAATGGAT CGTTCGACGA TTACAAAGCC AAGTTGCGCG AAGAGTTTTC GGGCGGCAAC TTGTTGTCGG AGAAACGAAA AGCCGAAAAG AAGAGGGGAG GTGGGCGTGC GCCCGAACCC GAACCCGAAC CCGAGAAGGC GCCCGCACCG GCGAAGCCTC CCGGAAAGAT CATGATGGAC AGTGCGTTCA CAAAGTCCAC GTCAAACGAT ATCTCCAACG AACAAGATCG CCCGACTTCG TCCCAGGTGA AATGGGTTCC GCCTCACTTG CGAGGCGCCG CGCAACAAGA CCAAGAGAAC GCGGGCAGTG CCGACCAAGC GTGGGGCGAA GAATGA
|
Protein sequence | MDIHISGIQL YGGRDELIQE GVLKLVFGTK YGLVGRNGCG KSTLLRAISE RVIKLPEFLH IIHVEQEASP DERSALQTVV ETDTERLYLL NLEKRMLDEE LDQIDGIDLN EVYERLDEID ADTATARAGQ ILGGLGFDPE EQMKATKEFS GGWRMRIALA AALFMTPDLL LLDEPTNHLD VHALTWLEEF LRKWEKTVLI VSHDRGFLND CTTATIFLHH KKLRYYGGSY DTFLKVRAEH RANEEAMQRN QSLRESSLKQ FIQRFGQGHK KMVRQAQCRM KMLEKLQSER VDVDYDDPYL RINFPSASPL PPPCISVMNV AFGYEGYQTL YQNLDFGLDM DSRVAIVGPN GAGKSTFLKL LEGDILPTKG WINRHTKLRL ARFSQHHLET MNLEEDCVAH MKRLDSEMPI ETARAYLGRF GLSGELATKP IKVLSGGQKS RLAFAELAWK QPHILLLDEP TNHLDLETIE SLAMALNNFE GGVVLVSHDE RLISLVVDEI WIVTKGDMKS NPPVPGSVQV FNGSFDDYKA KLREEFSGGN LLSEKRKAEK KRGGGRAPEP EPEPEKAPAP AKPPGKIMMD SAFTKSTSND ISNEQDRPTS SQVKWVPPHL RGAAQQDQEN AGSADQAWGE E
|
| |