Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18607 |
Symbol | |
ID | 5006096 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | - |
Start bp | 391843 |
End bp | 394185 |
Gene Length | 2343 bp |
Protein Length | 780 aa |
Translation table | |
GC content | 56% |
IMG OID | 640421517 |
Product | predicted protein |
Protein accession | XP_001422056 |
Protein GI | 145355619 |
COG category | [A] RNA processing and modification [D] Cell cycle control, cell division, chromosome partitioning [L] Replication, recombination and repair |
COG ID | [COG5049] 5'-3' exonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCATCA CTGGGTATAA CAAATACCTC CAACGCGAAT TCAACGGCGC GTTCCTGCGC GGAGGACGAC ACAAACGGCG GCCGAAGCGG TACGATCACG TGTACGTGGA CGTGAACAAT CTTCTGCACG TCGCGGCGCA CAACACGAAC AGCGAGCGGT CGTTTTTTAA AAAGTTGTTC ACGCTGCTGG ATAACAGGTT GACGAAGACG AACCCGAGGC ACAGCGTGAC GTTGGCGCTG GATGGACCGG CACCGATGGC GAAGACGATC ACGCAGAGAC GACGGAGGAT TCGACTGAGT GCGGGGGCGG CGACGCCGCT GAGCGATGAT ATGAGCAAGC TGTTGAAGAT TGGAATCACG CCGGGGAGCG TGTTGGCGCT GAAAATTGAC AGGGCGTTGG AGTATTACGT GGCGCGGAGG ATGTTGCGAC GCGATCACGC GGGTTCGCCG GCTGATAACG TGCTGTACGA GATATCGAGT ATGCGCGTGG CGGGAGAGGG GGAGATTAAA CTCGTCAAGT CGATTCAACA GCGATTGCAG AACCCACGGT TTCAAGGGCA CTCGCACTGC ATCGTGACGG AGGACAGCGA CGCGTTGTTG CTCGCGATGA CGCTCTTCGG GCAAGGTCAA AAGACGCGAT ACGCGTCAAA CGAGGAGTTT CAGGTGTACG TTTTGAGTGG AAACGTAGTT TTCAGCGCGC GGCTGTTTGA TCAGTTGCTG CTGCAATCGT TACCGAAGGG TGCGTCGCTC GACAGCGCCA GACGCGACTT CATCGGGTTG TCGGCGATGA TGGGGAACGA TTACATCACC GGCAGCAAGC TCGGGGCGAA GACGAGCTGG AAAGCATACT TGGAAATGCG AGGTACTTAT CTCTATCGCG ATGATCCGCT CTTTCCGATG CCTGCAAATC AAGAGCTAAG CGCACAAGCA AAGCCGGATG GCGCCGGATA TAAAAAGAAG CAAAAGAGCG CGACGGGAAT TCAAACTTCG GTGAATTGGG CGTTCTTAAA GCAACTGTCC TTGAAACTTG CCGATACATC GTATGCGGCA AAATCGGCAA GTAATGCGCT GGCAAGTTCT TCTAACCCCG CTCAGAATGA CGTCAAGAAG AAGCGCGTGT ACGACTACTT GTACGGCATC GAGTGGATGC TCAACATGTA CTACCAAGGA GAATGTACCG ACTTTAGCTT TTACACGTAT ACGCAAGGAC CGGATATGAT GGATTTTGCG TCAATTGGTG ACGAGTATGA CGTCTCATGC GACCCGCTCC GCGATCTCAA GCGCGCGGAG GCGAGTTTTT ACAATTTGCG ACCGATCACG CCCTTGGCGT ACTCGCTCGC CGTGATTCCC CGAGGCGGTA GGGCGCAGAT ATCGAAAAAT GTTCGCCAGC TCGTCGACCC TGGATCGCCC ATACGGGAAC TGTTTGCGCT GGACTACTGT CCGCAGTGCA TCAATCATCG CATTCACGTC TCCCCGATGG AGAACGCGTT GCAAAACTCG CTCACGGCTG CGGATCCCGG CTTCGCCGAG GTAGTGCCAT CCAACGCACA ATTTTCAAAG TACTCTAGAT ACACTGACGA CGAAGGATAC ATCATTCACC CCGACACTGG AGAGTATATG TCCATGGATG AAATGCGTCA AGAGGTGAAG GAGCTCAATC GCATGCATTT GCATCACTTG CACACCGCCA AGCAACACGT GCACACGGAT CCCATTTGCT TGCCCACGCT CGAAGCGGCG GTGGCGCGAG CGAGCGCGGA CAATCTGCTC ACCGAAGACG AAGAGATGCT GCGGACGTTG GCTTCCCCTG TATTGTTTTG GCGTCACGCC GTGTACGACC CTAAGGATTT GGACACGCGC GACTTCGCGA GTGAAGAAGA GCTCACCGAG TGGCGCTCGA AAACCATCCC TGACTCTCAA TTCGAGATTC TCGACAAGCG CAGCGTTTAC GAGCTCAGAA AGTTTGATGG TGACGTCGGT GATGTTATGC GGCGCTGGGG TTGCGATAAC GACGCGTTCG CGCGATTCGA CAGCGAGATC GCCGAAGGAC GAACTCACTC GCGACAGCAC ACCGTCGGCA AGACTCGAAG CGCGTTGGAC GAGCGACTGC AAAAGTGGCG TAGCGAGCGT CGTGGCGTTG GTCGTGTAGA TGACGCGAAA TCTACGAGCG ATATCAGCAT CAAAACGAAC GATGCCAACG GCGCGGCGTC GTCTTTGGCA CCGAACCGCG CTCCTAAACC GCGCCCAGGC GGCGCAGGCA GCAAGCGTCG AAATTTCAGC AAATCTCCTC GCGCGCCCTC AGCTTTCGCT CGCGTCGCCA CGGCGCACTC GCGTGTATTC TAA
|
Protein sequence | MGITGYNKYL QREFNGAFLR GGRHKRRPKR YDHVYVDVNN LLHVAAHNTN SERSFFKKLF TLLDNRLTKT NPRHSVTLAL DGPAPMAKTI TQRRRRIRLS AGAATPLSDD MSKLLKIGIT PGSVLALKID RALEYYVARR MLRRDHAGSP ADNVLYEISS MRVAGEGEIK LVKSIQQRLQ NPRFQGHSHC IVTEDSDALL LAMTLFGQGQ KTRYASNEEF QVYVLSGNVV FSARLFDQLL LQSLPKGASL DSARRDFIGL SAMMGNDYIT GSKLGAKTSW KAYLEMRGTY LYRDDPLFPM PANQELSAQA KPDGAGYKKK QKSATGIQTS VNWAFLKQLS LKLADTSYAA KSASNALASS SNPAQNDVKK KRVYDYLYGI EWMLNMYYQG ECTDFSFYTY TQGPDMMDFA SIGDEYDVSC DPLRDLKRAE ASFYNLRPIT PLAYSLAVIP RGGRAQISKN VRQLVDPGSP IRELFALDYC PQCINHRIHV SPMENALQNS LTAADPGFAE VVPSNAQFSK YSRYTDDEGY IIHPDTGEYM SMDEMRQEVK ELNRMHLHHL HTAKQHVHTD PICLPTLEAA VARASADNLL TEDEEMLRTL ASPVLFWRHA VYDPKDLDTR DFASEEELTE WRSKTIPDSQ FEILDKRSVY ELRKFDGDVG DVMRRWGCDN DAFARFDSEI AEGRTHSRQH TVGKTRSALD ERLQKWRSER RGVGRVDDAK STSDISIKTN DANGAASSLA PNRAPKPRPG GAGSKRRNFS KSPRAPSAFA RVATAHSRVF
|
| |