Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_28422 |
Symbol | |
ID | 5006340 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009372 |
Strand | + |
Start bp | 11327 |
End bp | 12537 |
Gene Length | 1211 bp |
Protein Length | 306 aa |
Translation table | |
GC content | 48% |
IMG OID | 640421761 |
Product | predicted protein |
Protein accession | XP_001422282 |
Protein GI | 145356110 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4421] Capsular polysaccharide biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAAGT TTCACCAGCT ACCACTGAGG GCGTACGTGG ACGTAAAAGT CGACCCGGTG ATTTTCACGC ACGCTCTCAA TAAGAACCAC ACGTCGAATC TGTTCGTGAT CGCTCCAGAC GACAAACGCG GCGCAAATCC CTTTCATTTC GCGCAATCTG CCATGTTCTT TTTTCACAGC GCTCTCAGGA CGCTAGAGAA TTCGGATAGC TCTTCTGTCG TGATAATTTT TAGACAGCGG CCGCCCGCTC AAAGTTGGAT TGATAGTTTG ACTCAGCAGA TCTTTGGCGA CGTCAGGGTC GTTTATGGTG ACGAGTTGAC ATCGCCTATT TGTGCCAGAA GAGTTGTCGT CGCTGGAACT ATGATAGGTT TACTTCAAGG CCCATATGAC GCTCAATTAT TTCGAGACCG AGTGTATGGC AATCTCAAAA TCAACCCGAA ACGAATAAAC AGAGCGGATT TGCGAGTGAC ACTGATTGAT AGAAAGAAGA GACGTGTCAC TAACGTAGGC GAATTACAAG AAATATTGGA TGAGCGTCGA CTTTGGTACA AAACTGTTCG GCTCGATACG CTTTCTTTCA AGGAGCAAGT GTCCCTCATG TCAGAAACAG ACTTGCTCAT TTCATCGCAT GGCGCCGATC TTACGAACGT TATATTCATG CAACGAGAAA GCGCAGTCAT TGAGCTCTTT CCTTCGACGG TTTGGTACTA TGAGCTCTAC GCAAAAATCG CACGGAACGC CGGATTGTTC CACACGTACG CTCTCGGCGA TCAAACGCAC GCCGTTACGA AGACCATTGC GGAGTGCTTT GAAAGTGCCT GTCTGACCGA ACTGAAACGC GACTTTATGA TACCGCCTGA ACGTTTTCGT ACTTCTCTCG ATCACGCGCT CAGTCTCCTT GGAGTCGCCA ACGCAGTCTA GTAGATTGAT CCGCTTCGAG TTGCGGCATT AGACACGACG TTCAGTGGCG GTCCAAAAGT CCCAACGCGC GTTCGTGCAA TCATCAAAGT ACGAGAAGCA CGGGTGAGGA GGAAACGTGA TCGAGGAAGC TAAGCCGCTG TAAGTACTAT TCGATCCAAC AATACACGAT GCTTGAGATA GAACATAAAG CTCAGCAAAA GTATCCATAT TCCCTTTTCG CATACTCCGC TTGAAGACGC TCCAACTTCT GTCGATGTGA AAAACATTGG TACTGTGCAC AGTGCGCACG C
|
Protein sequence | MSKFHQLPLR AYVDVKVDPV IFTHALNKNH TSNLFVIAPD DKRGANPFHF AQSAMFFFHS ALRTLENSDS SSVVIIFRQR PPAQSWIDSL TQQIFGDVRV VYGDELTSPI CARRVVVAGT MIGLLQGPYD AQLFRDRVYG NLKINPKRIN RADLRVTLID RKKRRVTNVG ELQEILDERR LWYKTVRLDT LSFKEQVSLM SETDLLISSH GADLTNVIFM QRESAVIELF PSTVWYYELY AKIARNAGLF HTYALGDQTH AVTKTIAECF ESACLTELKR DFMIPPERFR TSLDHALSLL GVANAV
|
| |