Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_30851 |
Symbol | |
ID | 5000807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009357 |
Strand | - |
Start bp | 865978 |
End bp | 867123 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | |
GC content | 62% |
IMG OID | 640416228 |
Product | predicted protein |
Protein accession | XP_001417078 |
Protein GI | 145345135 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00449336 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGCGGAGG CGTGCAAAAG AACGCGCGAC GACGAAGATG GACTCGGGAG TGGCGCGTGC GCGTTCGCGT ACGCGCGCGC GAAGGCGACG ACGGGGTGCG TGTTCGTGGA CGACGGCGGC GCGGGATCGA AGACGCTGTA CGAGGCGCTG CGACCGAGCG CCACGGCGAG ATTAGCGGAG ACGCTGTCGG CGTCGAGCGC GGGCGTGGAC GCGGACGCGA AGGCGCGAGA TGAACTGCAG GCGGGATTGC ACTGGGAGTT TTTGCGCGCG CATCCCGAGA CGCGGCCGTT GTTGGGGATC ACGGGCGCGG ACGCGAGGAA AGGATTACCG GGAAACGGCG ACGACGACGA GGCGCCGGAG ACGGCGTCTA TGTGTGGCGT CGTCGCCGCG GTCGTGGCGA GTAAGATATT GAGTTATTTG GACGCGAGTG AAGCCGTCGC AGTGGCGCGC GAGTGCGATC AGTGGACGCA CGTGGAGGCG TTGGCGTTGT ACAAGCCCGA TCGCCCGGAC TCGGCGCCCG TGCCAAACGC GTCCGAATAC GAGGTTCTTC CGAGCGGGGA CGATATGGAT GCCGACGACG CGCAAGACGG TGATGAAGAT CGCGAGGGTT TCGATACGCA CGCGAGACTT TTCGGCTACG ACGCGTTGGC TAAGCTGAGA GAGATGTACG TCCTTGTGTG CGGCACGGAT TCGCTCGCCA ACGACGCGTG CGTCTGCGCG CTTGCCGCCG TCGGCGTCGG TAACGTCGAC GTATACGGGG CGAGTGGATC AAAAGTATTT GTTCGGCACG ATTGCGAAGT GGACGATTTG TGCGATTTAG ACGATTTAGA AACGTACAAA TACCACGTCG TGGTTCGCAC GAGCGCGTGC GCCACCGCCG ACGAAGTCGT CGCCATCGCG AGAGAGGCAA AATCGCCGGT GATTGAAATT ACGAGTACGA GCGTGGGGTC GTGTGTTGTG GATGTTAGTT TAGGCACGGA CTCGTCCTTC ACGCGTGCGT CGTTCTCGGG ATGGATGGAT GCGCCGACGG CATGCGTCGC CGCACACATC GCCGCGATGG AAGTAGTGCG GATAGCGCAA GACCGAAGAC GCGCGACGAC GATAGTGTTC GACGGGAAAG GGATATTTAC CAAGGCAAGG ATGTAA
|
Protein sequence | MAEACKRTRD DEDGLGSGAC AFAYARAKAT TGCVFVDDGG AGSKTLYEAL RPSATARLAE TLSASSAGVD ADAKARDELQ AGLHWEFLRA HPETRPLLGI TGADARKGLP GNGDDDEAPE TASMCGVVAA VVASKILSYL DASEAVAVAR ECDQWTHVEA LALYKPDRPD SAPVPNASEY EVLPSGDDMD ADDAQDGDED REGFDTHARL FGYDALAKLR EMYVLVCGTD SLANDACVCA LAAVGVGNVD VYGASGSKVF VRHDCEVDDL CDLDDLETYK YHVVVRTSAC ATADEVVAIA REAKSPVIEI TSTSVGSCVV DVSLGTDSSF TRASFSGWMD APTACVAAHI AAMEVVRIAQ DRRRATTIVF DGKGIFTKAR M
|
| |