Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_3820 |
Symbol | |
ID | 4999417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009355 |
Strand | - |
Start bp | 776288 |
End bp | 777415 |
Gene Length | 1128 bp |
Protein Length | 376 aa |
Translation table | |
GC content | 55% |
IMG OID | 640414838 |
Product | predicted protein |
Protein accession | XP_001415929 |
Protein GI | 145341672 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0436] Aspartate/tyrosine/aromatic aminotransferase |
TIGRFAM ID | [TIGR01264] tyrosine aminotransferase, eukaryotic [TIGR01265] tyrosine/nicotianamine aminotransferases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGATATCGC TCGCGCAAGG AGATCCGACG GTGTTTGGAC ACCTTTTGCC GCCGAAGACG GCGATGGATG AGGTGGCTGG GGCGTTCTCG ACGAGCGCGC ACAACGGGTA CACGGCGAGC GCGGGTTCGG CGACCGCGCG GGCGGCGGTG GCGATGCGGT ATTCGTTACC CGATCGTCCA CCGTTGAGAA CAGAGGACGT TTTCATGACA GTTGGTTGCT CCGAGGCGCT CTCACACTCG TTCGCGGCGA TGGCGGTGGA GGGAGCAAAC ATTTTACTGC CGAGGCCCGG TTTTCCGTTG TATGAAACTT TGTGTCATAG GCACGGTTTG GGATACAAGT TTTATGATTT AGACGACGAA AATGGATGGG AAGTCAAGAT TGACGATGTT CGCAGGCTTC GGGACGAAAA CACGGTGGCG ATCGTCGTGA ATAACCCGAG CAATCCTTGC GGCGCGGTGT TTAGTGAAGG TCACCTGCGA GAAATTTGCG AGACTTGCCA CGAGTTGCGC TTGCCAATCA TCGCCGATGA AGTGTACGAA GACGTCGCTT TCGATGAAGA CAGGCCGTTT CTGTCGATCG CAGCTTTTAG TGGTAGAGTT CCCGTCATGG TGGTGAGTGC GTTGAGCAAG CGCTGGCTCG CGCCCGGATG GCGCATTGGT TGGCTTGTCC TTCACGACTA CGATCATATT CTACAGACTG CAGGCGTGCA GCTTGCGATT AACAACTTGT GTCAGGTGTC GTTAGGTCCG CCGACGCCGA TCCAAGCCGC GATTCCGGGA ATTTTCAAAG CCAACGAGAC GGAGTGGCTA AAGGCTACGC TCGGCGTCTT GCGTCGCGCA AGCCAGCGCT GCGTCGAACG CTGTGCGCGA GTTCGTGGTT TGACTGTTCC TTGTGAACCT CAAGGAGCGA TGTATGTGCT GTTGAAAATG AATGGTGATG CGTTCAAGGA CGCAAATGGG TTTTTCACTG ATGTCACCTT CGCCAAGCGC CTGCTTGCGG AGGAATCAGT ACTCGTGTTG CCGGGCACGT GCTTTCACGC GCCCGGATAC TTACGTCTAG TGATTACAGT TCCAGATGAC GAATTGCAGA ACGCGTGGGA TCGCATTGAG ACGTTTTGTG AACGTTAC
|
Protein sequence | LISLAQGDPT VFGHLLPPKT AMDEVAGAFS TSAHNGYTAS AGSATARAAV AMRYSLPDRP PLRTEDVFMT VGCSEALSHS FAAMAVEGAN ILLPRPGFPL YETLCHRHGL GYKFYDLDDE NGWEVKIDDV RRLRDENTVA IVVNNPSNPC GAVFSEGHLR EICETCHELR LPIIADEVYE DVAFDEDRPF LSIAAFSGRV PVMVVSALSK RWLAPGWRIG WLVLHDYDHI LQTAGVQLAI NNLCQVSLGP PTPIQAAIPG IFKANETEWL KATLGVLRRA SQRCVERCAR VRGLTVPCEP QGAMYVLLKM NGDAFKDANG FFTDVTFAKR LLAEESVLVL PGTCFHAPGY LRLVITVPDD ELQNAWDRIE TFCERY
|
| |