Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24443 |
Symbol | HAC3501 |
ID | 5001511 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | - |
Start bp | 604058 |
End bp | 607393 |
Gene Length | 3336 bp |
Protein Length | 1100 aa |
Translation table | |
GC content | 56% |
IMG OID | 640416932 |
Product | predicted protein |
Protein accession | XP_001417549 |
Protein GI | 145346136 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00582257 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTCTC GTCAAGGCGC GCAACAGGGC GGAAGAGGAG GCGGGAGCGC TGCGGCGGCT TCGCGTCAGC AGATGGGAGC GTTGTTCCCA GGGGGATTAG GGGGGAATAT GGCGCAACCC GCGGCGGGAC AGATGATGGG GGGTAACGGT ATGTTACCGA ATGGGGCGCC GATTTTGCGC GGCGCGGCGG CGGCGCAAGG ATCGCAATAC GCGGGAGGAT CTTCGCAAAT GATGGCCGGC GTGCCACCTG GGACGATGAT TCCGACGCAG GGTGTGGGTT CGATGATGCC TGAGGCAGCG ACGGCGGCGG CGCCCAAGGG ACGAGCTAAG GCGCCGCCGA AGGGTAAAAA GCTCACCAAG GCGCAGCAGG CGGCGCAGGC GAAGCAGCTC GCGCAACAGC AACAGATGGC CCAACAGCAA CAGGCGGCGA TGCGCGGGCG CCAGCAACCG GGTAGTGGTG GGGGTGGTGT CGTTGCTAAT TGGCGAGACA TCAATGACGA GCAGTTACGA AAGGCGTACA TCGTGAAACA GCAACGATGG CTTCTCTTCC TTCGACACGC GAGCAAGTGT CAGGCGCGAC ACGGGCACTG CCCCTACACG CCGCACTGTC ACGTCGCGAA GCAGTTGTGG GAGCACGTGT TGAAGTGTAC GCTGACGCAG TGCAACTATC CTCGATGTTT GGCGTCGCGT GAGTTGTTGA AGCATCATCA GACGTGCAAG GACGCTGGGT GTCCGGTGTG CGGTCCCGTG CGCAACGCCA TGCTCAAGCA GCGTCAACAG GCGCAAATGC AAATGGCGCA CGGCAACAAG AGAATGAAGC TCGATCACGA TCTCGATCGT GGTCAACTCA TGGTGAAGGG TACGAGCGGT TTGAAAACGG ATCGCAAGCC AGGTGGCGAA GGGACGTCGT TAATGGAGTG CTTCACTCCG GAAGAAATCC GTACGCATCT CGCCGCCTTG CGTCTCGCCG ACAAGGAAAA GGTGCAAGGT CAAGGTCAAC CGAGCGCTCG TCAGCTTCAG AAGGAAGCTG AGCGGGCGGT GATCAACGCG ACAGAAAGTT CGTGCCGTGC ATGCGGCGTT GAACGATTGA CGTTCGAACC ACCGCCGCTT TACTGCTACA GTTGCGTCGG TCGCATCAAG CGCGGACAAG TATTCCACCA GATGCCTAGC CTCGGGGGCG AGACTCGAAG AGATGCGTGG TGTAACCCGT GCTTCAACGC CATCCAAGGG TATGTCGATG TCGAGGGTCA ACGGTTTCCG AAAGCGACGC TCATCAAGAA GAAGAACGAC GACGATCTCG AGGAGCCATG GGTGCAATGC GACTACTGCG AGGATTGGTA TCACCAACTC TGCGTTCTTT TCAACGGTAG ACGCAATGAA GGCGGTGAAG CGCCATTCAC GTGTCCAAAC TGCATCTTAT CGCAGTTGGA TAAGAACGAA CGTCAAGTTA CTGCTGAACG TCCGTCTTCA CAACAGCCGG CGAGCTCGTT ACCGAAGACA AAGATGTCGA CGTTTTTGGA AGAACGCCTC GCGTCGAAGC TCTCGGCGGA ACGCGTGGAG CGCGCGAAGC AGTTGGGCGT ACCCATAGAA AACGTCCCGA CCGCTGAGAA CTTGACCATT CGTGTTGTGT CTCAAACACT CAAACAAATG GACACCAAGC CGCACTACTA CGCCCACTTC AAAGAACAGG GCATTCCGGC GCACTTTACT TACCGTTCGC GCGTCATCCT CCTGTTCCAA AAGCTTGAGG CCACCGATGT GTGTTTGATG GCTATTTATG TGCAAGAGTA TGATGACGAG TGCCCTGAAC CGAACCGGCG AAGAATTTAC CTCTCGTATT TGGACTCTGT CAAGTACTTC CGACCGGACA ACGTCACCGT GGCCACGGGC GAAAACTGCG CGTTGCGCAC GTACGTCTAT CACAACATTC TGATTGCGTA CCTGGACTAT GTCAAACAGC GCGGCTTCAC GTCGTGCTTT ATTTGGGCGT GTCCGCCGTT CCAAGGAGAC GATTACATTC TGTACTGTCA CCCCAAGGTG CAAAAGACGC CAAAGGCTGA CAAGCTTCGT GAGTGGTACC TCAAGATGTT GCGCTCGGCG CAGAAGGATG GCATCGTCAT TTCAACTTCG AACGTGTACG ATGAATTCCG ACTCGGCAAT CAAAATCACG ATATTCGATG CGCCACTGAG TATCCGTATT TCGATGGTGA TTACTTCTCC GGCATTGCGG AAGATTGGAT ACCGACCATC ATGAAGGAAC TCGAGGAGGC AAAGAACATC GAGGCGAAGA CAAAGTCGTC CACGGTTAAG ATCAGCGCGC GCAAGGCGGC CAAGGCAAAG AGTGGCACGA TCGCCGCTGA CGCTGAACTG AATAAAGAAC TAATGAAGAA GCTCGGGACG ACGATCAGCA ACATGAGAAA CGACTTCATG CTCGCCCATC TTGCGCACCA ATGTTTGTGC TGCCGCAGAA CCATCGCCGG TGCCAACCGC TATTACGCGA CGGAAGGCAC ACCCTTGGTG CTTTGCGAAG AATGCAAGGA GACTGAAGAT GCGATGCCAG AGAACGAAAA ACGTTACGCT GGCCGCAAGC TCGAGTGCGA AAAGTGTGAG GAAATTCCCA CGCTGACGAA GGAACAGAAG GACGAGGAGG AAAAGCTCGA GAGCGAATTC TTTGACACAC GTCAAGCGTT CCTGTCCTTG TGCCAAGGCA ATCACTTCCA GTTCGACTCG CTTCGTCGCG CCAAACATAC GACCATGATG GTGCTGTATC ACTTGAACAA TCCATCTGAG CCTGCATTCG TCGCATCATG CAACGTTTGT TCACGCGAGC TCGAGCCCGG AAAGGGTTGG CGCTGCGAGA CGTGCCCCGA TTTCGATATT TGCGACAACT GCCGCATCAG AACTGGTCAC CAGCACCCCC TCATGCGACA AGGTCGTACC GCCGGAGATC GCACGGCGTT GTCTCAAGCC GAGCGCGAGA ACCGCGCGGC GCAGATCGAG CGAACCATGG AACTCTTGCT CCACGCGTGC AAATGCCGCA AAGAACGATG CGAAAACAGC AACTGCCCGA AAATCAAACA CTTGCTCAAG CACGCCTTGA GCTGCACGGT CAAATCAGCA GGCGGGTGCC AGCTCTGCCG TAAAACGTGG ACGCTGTTGC AAATTCATTC TAAGGGATGC ATGGAGGACG ACTGCCCCGT GCCCAGGTGC CGCGATCTCA AAGAGTACCG TCGTCGCGGT CAAGAACAAA TTGAAGAGCG CCGACGCGAG CAATACAGAC TTTACCTGAA CGCCGCGCGA TGAGCGCGCG ACGATCAGAA CAACACGATT AACGAC
|
Protein sequence | MISRQGAQQG GRGGGSAAAA SRQQMGALFP GGLGGNMAQP AAGQMMGGNG MLPNGAPILR GAAAAQGSQY AGGSSQMMAG VPPGTMIPTQ GVGSMMPEAA TAAAPKGRAK APPKGKKLTK AQQAAQAKQL AQQQQMAQQQ QAAMRGRQQP GSGGGGVVAN WRDINDEQLR KAYIVKQQRW LLFLRHASKC QARHGHCPYT PHCHVAKQLW EHVLKCTLTQ CNYPRCLASR ELLKHHQTCK DAGCPVCGPV RNAMLKQRQQ AQMQMAHGNK RMKLDHDLDR GQLMVKGTSG LKTDRKPGGE GTSLMECFTP EEIRTHLAAL RLADKEKVQG QGQPSARQLQ KEAERAVINA TESSCRACGV ERLTFEPPPL YCYSCVGRIK RGQVFHQMPS LGGETRRDAW CNPCFNAIQG YVDVEGQRFP KATLIKKKND DDLEEPWVQC DYCEDWYHQL CVLFNGRRNE GGEAPFTCPN CILSQLDKNE RQVTAERPSS QQPASSLPKT KMSTFLEERL ASKLSAERVE RAKQLGVPIE NVPTAENLTI RVVSQTLKQM DTKPHYYAHF KEQGIPAHFT YRSRVILLFQ KLEATDVCLM AIYVQEYDDE CPEPNRRRIY LSYLDSVKYF RPDNVTVATG ENCALRTYVY HNILIAYLDY VKQRGFTSCF IWACPPFQGD DYILYCHPKV QKTPKADKLR EWYLKMLRSA QKDGIVISTS NVYDEFRLGN QNHDIRCATE YPYFDGDYFS GIAEDWIPTI MKELEEAKNI EAKTKSSTVK ISARKAAKAK SGTIAADAEL NKELMKKLGT TISNMRNDFM LAHLAHQCLC CRRTIAGANR YYATEGTPLV LCEECKETED AMPENEKRYA GRKLECEKCE EIPTLTKEQK DEEEKLESEF FDTRQAFLSL CQGNHFQFDS LRRAKHTTMM VLYHLNNPSE PAFVASCNVC SRELEPGKGW RCETCPDFDI CDNCRIRTGH QHPLMRQGRT AGDRTALSQA ERENRAAQIE RTMELLLHAC KCRKERCENS NCPKIKHLLK HALSCTVKSA GGCQLCRKTW TLLQIHSKGC MEDDCPVPRC RDLKEYRRRG QEQIEERRRE QYRLYLNAAR
|
| |