Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_38443 |
Symbol | |
ID | 5001766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | + |
Start bp | 589432 |
End bp | 592392 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | |
GC content | 51% |
IMG OID | 640417187 |
Product | predicted protein |
Protein accession | XP_001417814 |
Protein GI | 145346684 |
COG category | [R] General function prediction only |
COG ID | [COG1444] Predicted P-loop ATPase fused to an acetyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.856005 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.626674 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAAGA AAGTGGACGC GCGCATACGC ACGCTCGTGG AGAATGGGGT GAAGAACAAC GAACGCACGA TGTTCGTGCT CGTGGGCGAT CGAGGACGAG ATCAAATCGT CAACTTGCAT TACATGCTGA GTAAAGCGGT GGTGAAGTCG AGGCCGAACG TGCTGTGGTG CTATAAGAAG GATTTGTACC TGTCTTCGCA TAAAAAGAAA CGAGCGAAAC AGATCAAGAA GATGCAAAAC TCTGGCTTGA TGGATTCGGA GAACGAGGAT CCGTTCTCAC TGTTCGTGGC GTCGACGAAT ATTCGGTACT GTTATTACGC GGATACGCAG AAAATTTTGG GGAACACGTA CGGGATGGCA GTGTTGCAAG ATTTCGAGGC GCTGACACCG AACTTACTGG CACGAACGAT CGAGACCGTG GAGGGTGGAG GGTTGATTGT GCTGCTGTTG TCGAACATGG AAAGCTTGAC GCAGCTGTAT AACATGACGA TGGACGTGCA CTCGCGTTTT CGCACAGAGG GCCACCAAGA CGTGGTCGCT AGATTTAATG AGCGATTCAC GTTGAGTTTA GGGGCGTGTC AGACGTGCAT CATGATGGAT GATGAGTTGA ATATTTTGCC AACGAGTTCG CATATTAGAG GCATAGAACC AGTTGAAGAA AAGCATGCGG GAGAGGCCCA AGAGCTTACC GATCTGAAGG AGAGCATGGA GGAAGTTGAG CCGGCTGGCC CGCTTGTGAA GTGCTGTAAG ACTATGGATC AAGCCAAAGC AGTGGTGACG TTTCTCGACG CTGCGAGTGA GAAGACCCTC CGATCAACAG TTGCGCTCAC CGCCGCGCGC GGTCGCGGTA AATCTGCTGC CATGGGTATA GCTGTCGCAG GAGCCGTCGA AATGGGTTAT GCAAACATTT TTGTCACTTC TCCCAGTCCT GAAAACTTGA AGACATTCTT TGAATTCATT CTCAAAGGCT TCGATGCGCT TGGATACAAA GAGCACTTGG ATTACGATCT CGTAGAGTCC ACCAACCCCG CGTTTGGAAA GTGTATTGTG CGTGTCAACG TCACGAGACG TCATCGACAA ACGATTCAGT ACATTTTGCC TCAACACGCA GAGCGTGCGA CGCAAGCCGA GTTACTCGTC ATCGATGAAG CAGCCGCCAT TCCGCTGCCT ACGGTAAAAG CGTTGCTCGG GCCGTATCTC GTGTTCCTTT GTTCTACAAT CAACGGTTAC GAAGGCACAG GGAGGGCGTT GAGCATCAAA CTCATCGGTA ATTTGCGAAG GGAAGCGGCA ACGTCACAAC AGGCGAACAA GGATGGGAAG CTGGCGACAG GTGGATCTCG ATTGCTCAGA GAAGTAGCTC TTGCTGAACC TGTTCGTTAC GCTGCTGGAG ATAGGATCGA AAAATGGTTG AATGATTTGT TGTGTCTCGA TGCGGCGGAT GCATTAACGC CGCTCATCCA CGCGCTGCCT CCGCCCAGTG CATGCGAACT TTATGAGGTG TCGCGTGATA CCTTATTCAG TGCGCACGCC GCAAGCGAGC AGTTCTTAAA GAAAATGATG GCACTATATG TTTCTAGTCA CTACCGCAAC ACGCCGAACG ATTTGCAGCT CATGTCTGAT GCGCCGGCTC ACCGCCTCTT TGTTCTCTTG GCGCCGGTGG ATGAAACACG AAACATGTTG CCAGAGATTC TTTGCGTCAT TCAGGTAGCT TTAGAGGGTG CGATCTCGCA GAAGAGTGCG CACGCAACAC TGGCTGCGGG GTTATCTCCA CAAGGTGACC TCATTCCGTG GACCATGGCA TCGCAATTTC AGGATGAAGG CTTTCCTTCC TTCACGGGGG TGCGCATTGT TCGCATCGCC GTGCATCCGG ACTTGCCTCG TCAAGGTTAC GGTTCTCGTG CGTTGCAGCT CATCCACGAT TATTACGAAG GAAAATTGGC TGATTTACGC GAAGAAGAAG TGTCGAACAT CGACGCCAAG ACGAATACTG AGCACTGGCA GCACACAGGC AAGCTCACAG AGGAGACACT CAAGCCTCGC GAGAATTTGC CGCCGCTTTT GACAAACCTC TCAGAGCGCA AGCCAGAGAG GGTGAACTGG ATCGGCACCG CATTCGGACT CACGTCCGAA CTTTACAGCT ACTGGAGCAA GGGTGGCTAT AAGCCTGTGT ACCTGAGACA AACGTCAAGC GAGACTACGG GCGAACACTC GTGCATCATG TTACGCCCTT GCTTTCCAGT TGGCGAGGAG GAAAGCCCGG ATGGTCACTG GGTGGATGCG TTCTACGAAG ATTTCCGCGT GAGATTCACT TCTTTGATGG GATCTGCTTT TAGAGACCTT GCTCCAGGTC TATGTTTGTC TTTGATGGCG CCAAAGCTAA ACTGGGACGA AAAACAAGGC GCGGGGCGCG ACTCCGTCTT GAAGGCGGAC AATGAGATTT TGAGCCCGCA CGATTTACGA CGCATCCAGA AATACTCAGC CGCTTTGGTG GACCATCATT TGATTGGTGA TTTGGTCCCC CCACTCGCGC GCGCATACTT TGCAAAACGC ATTCCCGTGA CGCTTTCGTA CACGCAAGCT GTGATTTTGC TCATCCTTGG TCTGCAACTG AAGACCATCG ACGACGGAAT GAAAACTTTG GACTTGCCCG GACAACAAAT AATGGCATTA TTCAACAAAG CTATTCGTCG GATACACGGC TCACTTCTCA AGGCACGCGA AGCCGATATC GAGAACTCGC TGCGTTCTGT GAGTATGCCA GATTTGAGAC CCCATGCGGT TGGATTAGAT GAAGAATTAG ATGAAGGGTA CGTCGACTTC AACTTTTTTT CGCGTTCGCC CGACTCGTGT CTCGCTCCAA AGATGATGGA AGTGATCGAA AACAATATAG TTCCAGAATT TCACGCTCTT TGGCGCCACC GTTTAGGCGG GGTTATAAGT CATAGAGTTC TATGTTGGTA A
|
Protein sequence | MRKKVDARIR TLVENGVKNN ERTMFVLVGD RGRDQIVNLH YMLSKAVVKS RPNVLWCYKK DLYLSSHKKK RAKQIKKMQN SGLMDSENED PFSLFVASTN IRYCYYADTQ KILGNTYGMA VLQDFEALTP NLLARTIETV EGGGLIVLLL SNMESLTQLY NMTMDVHSRF RTEGHQDVVA RFNERFTLSL GACQTCIMMD DELNILPTSS HIRGIEPVEE KHAGEAQELT DLKESMEEVE PAGPLVKCCK TMDQAKAVVT FLDAASEKTL RSTVALTAAR GRGKSAAMGI AVAGAVEMGY ANIFVTSPSP ENLKTFFEFI LKGFDALGYK EHLDYDLVES TNPAFGKCIV RVNVTRRHRQ TIQYILPQHA ERATQAELLV IDEAAAIPLP TVKALLGPYL VFLCSTINGY EGTGRALSIK LIGNLRREAA TSQQANKDGK LATGGSRLLR EVALAEPVRY AAGDRIEKWL NDLLCLDAAD ALTPLIHALP PPSACELYEV SRDTLFSAHA ASEQFLKKMM ALYVSSHYRN TPNDLQLMSD APAHRLFVLL APVDETRNML PEILCVIQVA LEGAISQKSA HATLAAGLSP QGDLIPWTMA SQFQDEGFPS FTGVRIVRIA VHPDLPRQGY GSRALQLIHD YYEGKLADLR EEEVSNIDAK TNTEHWQHTG KLTEETLKPR ENLPPLLTNL SERKPERVNW IGTAFGLTSE LYSYWSKGGY KPVYLRQTSS ETTGEHSCIM LRPCFPVGEE ESPDGHWVDA FYEDFRVRFT SLMGSAFRDL APGLCLSLMA PKLNWDEKQG AGRDSVLKAD NEILSPHDLR RIQKYSAALV DHHLIGDLVP PLARAYFAKR IPVTLSYTQA VILLILGLQL KTIDDGMKTL DLPGQQIMAL FNKAIRRIHG SLLKAREADI ENSLRSVSMP DLRPHAVGLD EELDEGYVDF NFFSRSPDSC LAPKMMEVIE NNIVPEFHAL WRHRLGGVIS HRVLCW
|
| |