Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_17441 |
Symbol | |
ID | 5004436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009365 |
Strand | + |
Start bp | 538626 |
End bp | 539654 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | |
GC content | 62% |
IMG OID | 640419857 |
Product | predicted protein |
Protein accession | XP_001420375 |
Protein GI | 145352056 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.123803 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.178855 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGCG GTTCCGATGC CGACGACCCG CGCGCGCCCG CGCTCACCTC CGTATCGCGC GCGGTGAGCG GTGAAACCTT TCGTATACTC ACTCGCGCGT CGACGTCCGC GATCGACGTC TCCGCGCGCG ACGCGAGCGG GAAAACCTTT GTCGGAAGAT TGGATTTGCA ACGCGCGCCG TTTGAACGTT TCGGTGAATT CGCGCGCGCA GCGCTCGGCG ATCCGCGACG CGAACGCGTC GAACGCGCGT TCACGTTCAC GTTCGAGCGC GCGGAGGAGA ACGGGGACGA GTACGAGCTC GTGTGGCGGT TTAGGGAGGC GGAGGCGGAG GCGACGCTGG ACGACGGTGC CGAGTTTGCG ACGACGCGAG CGAAGACGGC GAACGACGGG AAGAATTACG GCGCCGTTCG CACGGGTGTG TGCGGTATGA GGCGCGTGGA GAGCGACGAC GCGTTTTGTG ATGGAGGATT CGCGTACGAG GCGTTGGGGG AGGCGTTGGA GATGGAGGCG CGACGACGCT TCAACGCCGA GCGAGACGGG AAGAGGGCGA TCGAGGCGCG CGAAAGGGCG AATGAGAACT GCGAAACGGC GCTGCGAGAG ATGGATGCGT TGAAGGAGCG ATTAGCGCAA AACTTTCTAT CGCTTTTAAA CGAGAAAAAG AAGGGCATGA GGAAAATGGC CGAAGAGTTG GAAGCCGCGA GGGCGGAGAA CGACGAGTTG AAGGCTGACC TGGCGAGCGC GAGACTGGAA CACTCGAAGC CCGACGTCGA CGGTTTCGTG GCGGTCAAAG GCGAGCGAAG CGTCGATGCA GACGTCGACG AGGACCGCGA CGAGAGCGAG GACGAATACA ACACGGATGA TGAAACGGAA CGAACGCGAA GGCGCGGTCG CAAACGCGAC GCGTCGCAAC CGTCGCAGCG AAAATCGCAG CCATCGCAGT CGCAACCGTC GCAAAAACGC GCCCGAAATG CGACGGCGAA AAAGAAATCC TTTTTAGCCG ACACCTTAGC CCTGCTCGAC GCCGATTAA
|
Protein sequence | MKRGSDADDP RAPALTSVSR AVSGETFRIL TRASTSAIDV SARDASGKTF VGRLDLQRAP FERFGEFARA ALGDPRRERV ERAFTFTFER AEENGDEYEL VWRFREAEAE ATLDDGAEFA TTRAKTANDG KNYGAVRTGV CGMRRVESDD AFCDGGFAYE ALGEALEMEA RRRFNAERDG KRAIEARERA NENCETALRE MDALKERLAQ NFLSLLNEKK KGMRKMAEEL EAARAENDEL KADLASARLE HSKPDVDGFV AVKGERSVDA DVDEDRDESE DEYNTDDETE RTRRRGRKRD ASQPSQRKSQ PSQSQPSQKR ARNATAKKKS FLADTLALLD AD
|
| |