Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24477 |
Symbol | TPR4a |
ID | 5001650 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009358 |
Strand | - |
Start bp | 885979 |
End bp | 887942 |
Gene Length | 1964 bp |
Protein Length | 612 aa |
Translation table | |
GC content | 66% |
IMG OID | 640417071 |
Product | TRP-containing protein |
Protein accession | XP_001417631 |
Protein GI | 145346302 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.000106986 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGCA GCGCGGGCGG ATCCACGGGC GCGGCGTCGC TGGCGACGCT CGACGCGACG ACGTCCGAGG GTGACACCCA TCTGAGTCCG ATGCTGATGC GAACGGACGG CGCCGAACGC CTCGACGACG ACGCGCTGCG GGCGTCGCCC GAGCGCGCCG CGGCGACGCG CGCGGCGCAA TCGTCGGGAT CGGCGTTCGG GATGGGCGGC GCGGGCGGCG CGGGCGAAGC GCGGCGACGG GGACGCGGGA CGCGACGAGG GGACGAAGAC GCGGACGACG ACGCGGACGT CGAACGGTCC GCGAGAGACT TGGACGACGA CGGCGGATAC GAGACACGAA TGCCCGCGGT GGACGCGGCG GCGGTGACGC GGTCGATCGA AAATCCGGGA CACGCCACGG TGATCGGGGA CGAAGGGGCG CGACGCGCGG CGACGCACTC GATTCAAAAC TCGCAAGCCG TGCGGTCGAA CGACGACGTC GTGAACGGAG ACGAAGCCGG GAATGTGAGG GCGCGCGCGC GGGCGATGGG TGAGGAGGCG GCGGCGACGG TCGACGCGAC GAATGCGGAC GGCGCGGCGC GGCGCGGCGC GGCGCCCGGG TTGACGCGAA GTCGCGTCGG CGGGCGCGGA GGCGCGACGG GGAGTTCGAG CACGGTGCAC GGACACGCAC AGCGGCCGGC GAATAGTGCG GGAGATTCGA ACACGGCGGC GGTGTCGAGC GACCGCGCGC GTCCGGTGGA TTCGACCGAG GCGCAGAGGC TTACGAATTC GCGCGGCGAA GGCGGGCGCG CGGATTCGGG CCTGGGCAAT AGCGAAGAGG TGGAATGTGG CGACATCAAG GCGGCTGGAA GCTTGCTCGC CGGTGAGGGA GGAGCGGCAG GTGTCGGTAC CGAGGTCGAA TCGAAGCAAA TCATTCTGCC CGCGCCGACG CGGTGGGATA ATCAAGAACC GTGGGACGTA CCGCGAGAAT CGCTCGTTCC GGCGTCGCTC GAGCTCAAGG CTCGCGCGGA TGCGCTCAAG GCGATCGAGA ATTTCGGCAC CGCCGAGCGC GCGTACGGAC ACGCCCTGCG ATTTATCGAT CAAATTGAAA TGTTTGAAAC CGAAGAGACG GGTGGATCGT CGGAGCGAAA TTCACAGTTG AAGACGGCGT GCTTGCTCGA AGCGAGCGCG TGCGCGCTTC GGAGAGCCGA TCCCGTGCGA GCGGGTGAAC TCGCAGCTCG CGCGCTCGCG CGCGAACCGA ACAATGCACT CGCACTCAAG GCGCGAGCGC TCGCCGCGGT GACCGAGGGA GATTTCGGCA CCGCCATCGG CGACTTGACC AAGGCGTTGG AAATGACGCC CGATGATCCG GGGCTCGTCG CGGATTTGCG CGAAGCCACT CACCGACGTG ATGTCGCTCT CCACGCACCA CGACGGGGTG GTATCGCATC GTCGTTCATC GGCGCACCGG GAGGAGTGTC CTGGGGGACG ATGCCCATGA GCGGCATGGC GGGATCGAGC GCGCAGGGCG GGCAACTGTT CTCTTACCCC AACGCCGGCT ACGTCGCCAG CGGTGGCTTG GGTATGCACG GTGAAGTTGA GGGGGAAACC GATGGCGCCA TGAGTGCTGG AGTTTTCGGA GGTGGGTTCA GCTCTTTCCA AAACGTCGAC GATAACGCTG CCATGGATAC ACGAGCGATG CTTGACGGTA CGCACTTGCG CCGACCAGAC AGCCCGTCGC ACGCCACCGG CACGGAATCG CGCATGGATC AACCGGGAAC AACTTTCAAT CGCGTCGGTG GTACGCCGAG CGTGAGTAAA GGTACGACTA TCTCTATGCA GAAGAATGCG ATGCAGTCGG ATGCCTTTAT GCTCGGTGAC GAAAGCGAAG AAGACGGCGA AGAGGTGTGC GGCGGCGGCG GAAAAGAGCC GGTTTCGAGC GAATGATGTT AAGCGAGCAA TGTAGACGAC GACTGGATTC GTAA
|
Protein sequence | MARSAGGSTG AASLATLDAT TSEGDTHLSP MLMRTDGAER LDDDALRASP ERAAATRAAQ SSGSAFGMGG AGGAGEARRR GRGTRRGDED ADDDADVERS ARDLDDDGGY ETRMPAVDAA AVTRSIENPG HATVIGDEGA RRAATHSIQN SQAVRSNDDV VNGDEAGNVR ARARAMGEEA AATVDATNAD GAARRGAAPG LTRSRVGGRG GATGSSSTVH GHAQRPANSA GDSNTAAVSS DRARPVDSTE AQRLTNSRGE GGRADSGLGN SEEVECGDIK AAGSLLAGEG GAAGVGTEVE SKQIILPAPT RWDNQEPWDV PRESLVPASL ELKARADALK AIENFGTAER AYGHALRFID QIEMFETEET GGSSERNSQL KTACLLEASA CALRRADPVR AGELAARALA REPNNALALK ARALAAVTEG DFGTAIGDLT KALEMTPDDP GLVADLREAT HRRDVALHAP RRGGIASSFI GAPGGVSWGT MPMSGMAGSS AQGGQLFSYP NAGYVASGGL GMHGEVEGET DGAMSAGVFG GGFSSFQNVD DNAAMDTRAM LDGTHLRRPD SPSHATGTES RMDQPGTTFN RVGGTPSVSK EECDAVGCLY AR
|
| |