Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_37270 |
Symbol | |
ID | 5001269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | - |
Start bp | 798752 |
End bp | 801655 |
Gene Length | 2904 bp |
Protein Length | 967 aa |
Translation table | |
GC content | 56% |
IMG OID | 640416690 |
Product | predicted protein |
Protein accession | XP_001417607 |
Protein GI | 145346253 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0013] Alanyl-tRNA synthetase |
TIGRFAM ID | [TIGR00344] alanine--tRNA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.0476971 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGTGT GCGAGCAAGC CGCGACGTAC GATCCTAGTG GACCGAAGGA GTTGCCGCGG CCGGTCAAGT ACGTAGAAGA ATGGCCGTCC TCGCGCGTCA GAGATACCTT CGTCAACTAC TTCATCGAAA AGCAGGGCCA CGTGAACTGG CCGTCATCGC CAGCGGTGCC TGTCAACGAC CCCACGCTTC TCTTCGCAAA CGCGGGCATG AATCAATACA AACCGATTTT CCTGGGCAAG GCTGATCCGA AGACGGCGTT TGCCAAGCTC ACCAGGGCGA CGAATACGCA AAAGTGCATT CGCGCAGGCG GGAAGCACAA CGATTTGGAC GATGTTGGAA AAGACACGTA CCATCACACC TTCTTTGAGA TGTTGGGTAA CTGGTCTTTC GGCGATTACT TCAAGGAAGA AGCCATCGGT ATGGCGTGGG ATCTTCTCAC GAATGTGTAC GGTCTCGCGC CCGATCGCTT GTATGCGACG TACTTTGGCG GTGACGAGTC GCAAGGCTTG CAACCCGACT TAGAGGCCAA GGCAATTTGG TTGAAGTATC TTCCGGAGTC TCGCGTCATG CCATTTGGGT GCGCGGACAA CTTCTGGGAG ATGGGCGACG TCGGCCCGTG CGGTCCATGC ACTGAGATTC ACTATGATCG CATCGGTGGC AGAGACGCGG CCTCCTTGGT GAACATGGAT GACCCGAACT GCCTCGAAAT CTGGAACGTC GTGTTCATTC AGTACAATCG CGAAGAGGGC GGTGTTCTCA AATCGCTTCC CTCGAAGCAT GTCGACACCG GTATGGGTTT CGAGCGTCTG ACGTCGATCT TACAAAACAA GATGAGCAAT TACGATACGG ATGTCTTTAT GCCCATCTTC AAGGAGATTC AGCGCATCAG TGGTGCCGCA CCCTACACTG GCCTTCTCGG CAAGGAGGAC GTCGGGGAGA AAGACATGGC TTACCGCGTT GTCGCTGATC ATATTCGCAC ATTGTCCATC GCCATCGCCG ATGGCGCGGC TCCCGGGTCA GATGGCCGCA ACTACGTGTT GCGTCGCGTC CTTCGCCGAG CGGTCCGCTT CGGACGTGAG AAGCTCGGTG CAAAGCAAGG TTTCTTTCAC AAACTCGTCC CGTGTTTGAT TCAGCAGCTT GGAGCTGTAT TCCCAGAACT CGTCGCCAAG CAAACGCACA TTACCGAAAT CATCGCCGAC GAAGAGGAGT CGTTCGGCCG AACGCTCCAA AAGGGTATCG ACCAATTCGG CAAGGTTCTC GCGGCGGCTA AGCAAGAAGG TAGAACAGTG ATTTCTGGTC CGGAAGCGTT CTTGTTGTGG GAATCGTATG GTTTCCCGAA TGATCTTACG GAGCTTATGG CCGAGGAGAA CGGTTTTACG CTCGACAATG AAGGTTTCGC GCAGGCGTTT GCCGAAGCGC AAGAAAAGTC TCGCGCCGGC GGTAAGAAGT CTGGAGGTGT GCAGTTGCTG TTCGAAGCGG AAGCCACTGC GTGGTTGCAA AACAATGGTG TGGCAATCAC GAAGGATGAA GAAAAGTACG CAAGCGGACG ACCGACGCTC GAAAGCACCG TCACCGCCAT CATGTCGCCA AGCGGATTCA TAAAATCCAC CTCCGACGCC GAAGGACCTT ACGGGTTCTT CATGGACGCA ACGACGTTTT ACGCCGAGTC GGGTGGTCAA GTGTGTGACT CGGGTTTGAT TACCACCCCG AGTGGCTCGA TGTCCGTATC GGATGTCAAG GTTGCTGCGG CGTACGTGAT GCACACCGGC GACGTATCCG GCACAGTGAG TGTTGGCGAT GCCGCGAAGT GCGCAGTCGA TTATGACCGT CGAGATAACA TCATGCCCAA CCACACGATG ACGCACGTGC TCAATTATGC GTTGCGCAAG GTGATGGGTG ATGGTGTCGA TCAAAAAGGT TCGTTAGTAG ATGAAGAAAA GTTGCGATTC GATTTTTCTC ACAACAAGGG GGCGACGACG AAACAAATTG CTGAAATCGA GGCCATCGTG AACGAGCAAA TCAAATCAAA GCTCGCAGTA GACAAACGCG AGGTAGCGTT AGACAAGGCG ATGACTATCA ACGGTTTACG CGCGGTGTTC GGTGAGGTGT ACCCCGACCC CGTTCGTGTG GTGTCTGTAG GTCCGAGCAT CGACGACTTG CTCGCCAACC CGTCGGATGA CAAGTGGAAA AACTACTCCA TAGAGTTCTG TGGCGGCACC CACTTGGCGT CGACCGATTT CGCGGAGCAA TTCGTCATTC TTGAAGAAAG TGGTATCGCA AAGGGTATTC GTCGCATCAC CGGGGCGACG CGCGAAGGGG CTAAAGCGGC GCTCGCGCGC GCTGCGGACG TTCTCGCTCT GGTGAAGAGT TGCGATTCGT TGTCCGGCGA AGCGCTCGAC AAACAACTGG GTGTGTTGAA AAACGTCGTC GACACTGAAG TTCTTCCCGT CATTCAGCGC GAGGAAATTC GTGCCGCTGT CACTAGTCAA GTCAAGCGAG TCCTGGACGC GCAAAAGGAA GCCGCCGCGG CGGCCAAGGC GCAAGCCATC GTAGACGTTC AAGAAAAAAC CGCCGCGACA AAGTCTGCGG GTGCAAAGTA CTTTGTCGCC ACCCTGGCGG ACGGTACTGA TGCTGGCGCT ATGAAGGAAG CGGCGGCGGT CGCTTTCGCC GAAGGTATCG CGTGCACACT CTTGGCGAAC TGCAAGGGTA AAGAGTTCGT TTACTGCAGC GTGCCTCCGG ATGTCGGCAT CGACGTCAAG GGCTGGCTTG CGGCGTCGTG CGGCCCGCTC GGTGGTAAGG GTGGCGGCGG TAAGGGTGGT TTGGCGCAAG GTCAAGGACC GAACGTCGAC GCTGTTCCAG ATGCCGTCGC CGCCGCCGAA GCGTTCGCGA AACTCGCCAT TTAA
|
Protein sequence | MPVCEQAATY DPSGPKELPR PVKYVEEWPS SRVRDTFVNY FIEKQGHVNW PSSPAVPVND PTLLFANAGM NQYKPIFLGK ADPKTAFAKL TRATNTQKCI RAGGKHNDLD DVGKDTYHHT FFEMLGNWSF GDYFKEEAIG MAWDLLTNVY GLAPDRLYAT YFGGDESQGL QPDLEAKAIW LKYLPESRVM PFGCADNFWE MGDVGPCGPC TEIHYDRIGG RDAASLVNMD DPNCLEIWNV VFIQYNREEG GVLKSLPSKH VDTGMGFERL TSILQNKMSN YDTDVFMPIF KEIQRISGAA PYTGLLGKED VGEKDMAYRV VADHIRTLSI AIADGAAPGS DGRNYVLRRV LRRAVRFGRE KLGAKQGFFH KLVPCLIQQL GAVFPELVAK QTHITEIIAD EEESFGRTLQ KGIDQFGKVL AAAKQEGRTV ISGPEAFLLW ESYGFPNDLT ELMAEENGFT LDNEGFAQAF AEAQEKSRAG GKKSGGVQLL FEAEATAWLQ NNGVAITKDE EKYASGRPTL ESTVTAIMSP SGFIKSTSDA EGPYGFFMDA TTFYAESGGQ VCDSGLITTP SGSMSVSDVK VAAAYVMHTG DVSGTVSVGD AAKCAVDYDR RDNIMPNHTM THVLNYALRK VMGDGVDQKG SLVDEEKLRF DFSHNKGATT KQIAEIEAIV NEQIKSKLAV DKREVALDKA MTINGLRAVF GEVYPDPVRV VSVGPSIDDL LANPSDDKWK NYSIEFCGGT HLASTDFAEQ FVILEESGIA KGIRRITGAT REGAKAALAR AADVLALVKS CDSLSGEALD KQLGVLKNVV DTEVLPVIQR EEIRAAVTSQ VKRVLDAQKE AAAAAKAQAI VDVQEKTAAT KSAGAKYFVA TLADGTDAGA MKEAAAVAFA EGIACTLLAN CKGKEFVYCS VPPDVGIDVK GWLAASCGPL GGKGGGGKGG LAQGQGPNVD AVPDAVAAAE AFAKLAI
|
| |