Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_18797 |
Symbol | |
ID | 5006359 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009372 |
Strand | + |
Start bp | 37780 |
End bp | 38807 |
Gene Length | 1028 bp |
Protein Length | 323 aa |
Translation table | |
GC content | 57% |
IMG OID | 640421780 |
Product | predicted protein |
Protein accession | XP_001422289 |
Protein GI | 145356125 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 0.872308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.00206186 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATCATG GATGCCAACC ACCACGGACG TTTCAAGCGA GGCACTCGTG TCGACTGTGG TTCGTTTTCA CCGTCTCAGT GGCGTGCGTC GTCGTCGGAT CGCGCGGGGC GTCGTACGGA ACGTACGAGA ACGACGGCGG TGTCACTCGG GAGCGGCGCG CGGATCGTGC GCGACGACCG GCTATAGAGT CTAGATCATC CTTGAGCGAA GTGACTCGTG AAACGTACGA GCGCGCACGA ACGGTCTTCG AGGAATTTTT AAATGAGCGG AGCGCGCAGT GCGGGCGCGC GTTCACGGAG ACGCGCACGC ACTTTGGCGC TCCCGTGGAA ACACCTCCAT ACCCTGGGGG ACCTCCTCCT GGTACCGGTG CGGTGCTCAC AGAGATGGTG CGAGCTGCGG CGGCGGCGTG GCATAGAAAC ATGTCGTACG TGATGCGTGG TAAATGGGCG TTGGTTTCGG ATAGCGACTG CGCTGAGGCA GAGTTTCAAG GATTTGCCTG CATGTTCCCA GGATTGTCGC GCTCGTGTCG AGCCTTGGAA GGTGAAAACC CGATGTTGAA TTTACAAGAC GCGCAAGACT TCTGGCGTCA CTGGAGCAAA AATCTCAGCG TTGATGCGAT TTTTTTATTT GGGCTCGCCG CGGATGCGCT GACAGACGTC CGTCGCGCGA GCGTCAAAGT GAAGAGTTTT CTCGAACACG ACCTATCGAG TGCATTAGCG GGATCTGAGA GCGAAGGAAT CGCAGTCGGT ATACATTTTC GCAACCGTGG AGATATCAGA CTCGATGGGA GATTGAGGAT ACCTCTGGAG CGTTACGTCG AGTGGGTGGA TGGGCTTGGG AATTCAACGA AAATTCGTGC CGTTTACGTC GCCACAGATC ACGATGGACT GGATGTTCGC GAGCTCAACG AGCGTTTTCC CGGTCGTTCG TACGAATTCC GAATGATCAA GCGTTTGTGG GCACCAGCGA CCGCCACAAA CCTGTGGCAG ACGACGACGA CGGCGACGGC GGGTGGTTTG AAGTTTGA
|
Protein sequence | MDHGCQPPRT FQARHSCRLW FVFTVSVACV VVGSRGASYG TYENDGGVTR ERRADRARRP AIESRSSLSE VTRETYERAR TVFEEFLNER SAQCGRAFTE TRTHFGAPVE TPPYPGGPPP GTGAVLTEMV RAAAAAWHRN MSYVMRGKWA LVSDSDCAEA EFQGFACMFP GLSRSCRALE GENPMLNLQD AQDFWRHWSK NLSVDAIFLF GLAADALTDV RRASVKVKSF LEHDLSSALA GSESEGIAVG IHFRNRGDIR LDGRLRIPLE RYVESRWTGC SRAQRAFSRS FVRIPNDQAF VGTSDRHKPV ADDDDGDGGW FEV
|
| |