Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_16043 |
Symbol | |
ID | 5002652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009361 |
Strand | + |
Start bp | 177856 |
End bp | 179784 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | |
GC content | 61% |
IMG OID | 640418073 |
Product | predicted protein |
Protein accession | XP_001418631 |
Protein GI | 145348388 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.168876 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATCC GCGGCCGCGC GCGATCGATA TGCGTCCTTC TCGCCGCGCT CGCACTGACG ACGAGCGCGT ACGCCGCGCC GACCTCAGCG CCGCGCGCGC GCAGCCGCGC GACGCCCGCG CCCGCGACGT CTCGTCCCCG CGATTCCCCG GACATCGAAA CATCGAACAT CGACATGAAC GACGTCGAGG CGTTTCTCAC GCAAGCGCGC GATGAATTCA TCGCGACCCG CGCGTCGCGC GGGTCTTCAC ACGGCCGCTC GACGACGACG TCTCCGGACG AAGCCGGTTT GGACTGGATC GCGTCGTTCT TCTCGCCTCG GGAACGTCGC CGCGGCTCTG CGGGTCGCGC GGCGCGTAAT CGACGCGACT CGAACTTGCC TCCGGTTGAA GTGGATCAGA CCGGCGTTGA CTTCTACCTC ACCGCGCGCG CGTACGCGCA AGAACACGCG GTGAGCGATC CTCGGGGCGG TCGGATGCGC ATCGCTCGCG CAGAGGAACC GAATCAGTTT CAGTGGTGGT ACGATTTCAT TTGGCAAACG CAACAACAAA TCATCGCTTG CGAAGGACGA GAATCGTGCG AAGGTCTTTG GCATCAGTTC GGCGCGCAAG GCAGCATTTG TTGTGACGCC GGTGCCGAAT TTTACGGTCA CGCTCACTTT TCGTGCGTCG AAAGTGTCGA ACAATGCGAG GCGCTCACCA CGTGCGCCTC GTCCGCCGAT TGCGGCGTGA ATCAAGTGTG TTGCGCGACG AAGCCGCATT CTGACGATTT GATGTGCGTG ACGAGCTTTC AAGACTGCGC GGCGTACTGT CACTCCGATG CGCAGTGTAA AGCGGAGAAA GGCGAACAGT GTTGCTACGA CGAAGTACTC GGATACACGA TTTGTATCCC GGAAGGGCTA TCGTGTCCAC CGCCCCCGCC AGAGTGCCCG ACCACGGGAC AGCCGACGTG CCGCTCGGAC TCCGAGCGCA CGTGTTGCGG TGGCGTCTGT TGTCCGCCGG ATGAAAACGG TGTTGAATGG TTGTGTTGCA GAATCTGTGA CGAAAATGTT TGCTACCGAG CTACCCCCTC TCTGGGGGAC GGAAGCTACG AGTGTCCGGA CCCATTTTGC CCGCGGCCTC CAGAGTGTTC GTCGCAAGCA GAGTTGAGTC AGTGCACGGA TCCGGTGAAC CCGATCGTCG CCGCGCAATC TGCGCGCAAT GGCGATCCTC CGACCGGTAG CATTTGCTGC GGTGGTGTGT GCTGTGAGAT CGGGGACCCC GATCTGGTGC CCATCGGCAC ATTCGGCCCG AACTTTTGCT GCTACGATTT CCCGGACGGT CCTTCTGGTT GGAGTTGCCA ACCAGGCATC CCTGGTGACC CGCTTCCGAC TCCCCCGCCC GGTTGCGCGC GACAACCTCC CAGCGCTCAG TGTCCGGCCG GTTCTGAGTA TTTGGATACG TGCGTTGCTG ACGATCAGTC TATTGGCGTG TGCTGTGGAC CGGAAGTTGA CGGCGGCGGG CTGACATGCT GCCCTGATGA ATCCGTGTGT TGCGCCAACG TCGTGGACGG AGCGACCGTC GGTTACGAAT GCAAAGCCCA GAATGAGTGC CAAGAGGGTG AGTTATGCAG GATACTCGGT GACTGTCCGA ACTCTCAGCA ATACTCGGTC TGTGGTTCGT GCGGACAGGG CAATAATGAC TGCGCGCTGT CCTGCCAAGC GGGCGCAGGT GAGCCTCCGA ACGACCCTTC GTGGATTTGT TCCACCGTCG ATGGTTGCGA CCCAGTCGCC GACTACAACA ACGGCCCGAG CGATGCCACG TGCGTGTGTA ATAACGGAGC GTGCGGCGGC GCGACTCAAT GTAAGACAGG AGGGAATTGC TGCGCGTGCC AAGCTCGTGG CCCGCGCGGT GGCGTGCAAT GCGACAACCC ACCAACCGTA TTGGTTTGA
|
Protein sequence | MTIRGRARSI CVLLAALALT TSAYAAPTSA PRARSRATPA PATSRPRDSP DIETSNIDMN DVEAFLTQAR DEFIATRASR GSSHGRSTTT SPDEAGLDWI ASFFSPRERR RGSAGRAARN RRDSNLPPVE VDQTGVDFYL TARAYAQEHA VSDPRGGRMR IARAEEPNQF QWWYDFIWQT QQQIIACEGR ESCEGLWHQF GAQGSICCDA GAEFYGHAHF SCVESVEQCE ALTTCASSAD CGVNQVCCAT KPHSDDLMCV TSFQDCAAYC HSDAQCKAEK GEQCCYDEVL GYTICIPEGL SCPPPPPECP TTGQPTCRSD SERTCCGGVC CPPDENGVEW LCCRICDENV CYRATPSLGD GSYECPDPFC PRPPECSSQA ELSQCTDPVN PIVAAQSARN GDPPTGSICC GGVCCEIGDP DLVPIGTFGP NFCCYDFPDG PSGWSCQPGI PGDPLPTPPP GCARQPPSAQ CPAGSEYLDT CVADDQSIGV CCGPEVDGGG LTCCPDESVC CANVVDGATV GYECKAQNEC QEGELCRILG DCPNSQQYSV CGSCGQGNND CALSCQAGAG EPPNDPSWIC STVDGCDPVA DYNNGPSDAT CVCNNGACGG ATQCKTGGNC CACQARGPRG GVQCDNPPTV LV
|
| |