Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_42867 |
Symbol | |
ID | 5003471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | + |
Start bp | 384044 |
End bp | 385183 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | |
GC content | 54% |
IMG OID | 640418892 |
Product | predicted protein |
Protein accession | XP_001419147 |
Protein GI | 145349453 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGCTTAC CGACGTTCGT GGACATACAC ACACATATAG ATAAGGCACA CACGTGTGAG AGGAGCCGAA ATTTAAACGG CACCCTCGCT GGCGCGGACG CCGCGTGCGC GAACGATTTC CAACACTGGA CGCTCGAAGA CGCCAAGCGG CGCATGGGCT TTGCGATTCA GTGCGCGTAC GCGTACGGGA CCTCCGCCAT GCGCACGCAC TTGATGTCGG GTGAGGAGAG ACAGAGCAAA ATCGCCTGGG AAGCGTTCGG GAAGTTACGC GAGGAATGGA GAGGGAAAGT TGAGTTGCAA GGCGTGTCAC TCTCGGTGTT GAGCTTTTTC CGCGACGAGA CCAAGGCGCG CGCGCTGGCT CGCATGGTGA AATCTTATGG AGGTATACTT GGCGCCGCCG TGTCGTGTAG CGATGCAGGT GGTACACCAC TGGATGTCCA CACGACGTGC GGTGCCGATA TGCCCAAGTT ATTGGACGTC ATTTTCTCTC TGGCGAAGGA ATACAATTTG GATGTCGACT TTCATTGCGA CGAGAACGGC AATGAATCCT CCAAAGGCTT GCTCCACATT TCTGAGGCCG TCATTCGAAA CAATTTCAAA GGATCTGTGG TGTGCGGTCA CTGCTGCAGT CTCGCAGTTC AGCCAAACGA ACAGGCAAAG CGCATCATCG ACGCCGCCCG CGAAGCGGGC GTCACCGTTG TTTCGCTTCC AATCGTTAAC CAATGGCTTC AAAATCGTGA TCCAACGAAC GAAGCGACGC CAACGCGGCG AGGCGTTACT CGGGTGAAGG AGCTTGCCCG AGCTGGCGTG CCGGTGTGCC TTTCGAGTGA TAATACGCGA GATCAATTTT TTCAATACGG TGACTGTGAT ATGCTCGAGG TATTTCGATC GAGCGTTTGC ATCGCGCATC TCGATCGACC TTTTGGATCG TGGCCGTTGG CTTTAGCTGC GAACCCATCG CGCGCGATGC GACTTGGCGA AAAGTCGGGA ATGATTGCTC CAGGAGCGAA GGCAAACTTT GTCTTGTTTC GCGCTAGGAA TTACAGCGAG CTGTTTTCTA GATCGCAACA TGATCGAGTC GTCATTCGTG ATGGTGTCCG CATCGCTACT GCTTTACCAG ATTATGAAGA GCTAGACTAA
|
Protein sequence | MCLPTFVDIH THIDKAHTCE RSRNLNGTLA GADAACANDF QHWTLEDAKR RMGFAIQCAY AYGTSAMRTH LMSGEERQSK IAWEAFGKLR EEWRGKVELQ GVSLSVLSFF RDETKARALA RMVKSYGGIL GAAVSCSDAG GTPLDVHTTC GADMPKLLDV IFSLAKEYNL DVDFHCDENG NESSKGLLHI SEAVIRNNFK GSVVCGHCCS LAVQPNEQAK RIIDAAREAG VTVVSLPIVN QWLQNRDPTN EATPTRRGVT RVKELARAGV PVCLSSDNTR DQFFQYGDCD MLEVFRSSVC IAHLDRPFGS WPLALAANPS RAMRLGEKSG MIAPGAKANF VLFRARNYSE LFSRSQHDRV VIRDGVRIAT ALPDYEELD
|
| |