Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_37524 |
Symbol | |
ID | 5005947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009370 |
Strand | + |
Start bp | 356869 |
End bp | 357819 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | |
GC content | 56% |
IMG OID | 640421368 |
Product | DMT family transporter: UDP-galactose/UDP-glucose |
Protein accession | XP_001421918 |
Protein GI | 145355333 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.0090213 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000550159 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCAGTCGC AGACGCAGAT GACGAGGATC GCCATGTGCG TCGTCGGCGT CGTCGGTTCT CTCATCGTCT ACGGCATCTT GCAAGAACGG ATCATGACGA GACCGTACGG GGTCGAGAGC GAATACTTCA AATACAGCGT CTTCTTAGTG CTGAGTAACC GCGTGCTGAG CGCGTCGCTC GCGGCGGCGA TCCTGGCGTA CACGAAAGGG ATGGTGCAAC CGGCGGCGCC GATTTGGAAG TATGCGGGGG TGAGCGCGAG CAACGTGCTG GCGACGACGT GCCAATACGA AGCGCTGCGA TACGTGTCGT TCCCGGTGCA AACGCTCGGA AAGTGCGCGA AGATGATACC GGTGATGATT TGGGGATATT TCATCAATCA GCGACGGTAT ACACTGAACG ATTACGTCAT AGCGTCGTGC GTCACGCTCG GTTGTACCAT TTTCGCGTTG TACGGCGATT TGACGCACAA GCACAGCGCC AAGAGTTCGA ACACCAGCGC GAAAGGATTG ATGTTAATGT TAGGCTATCT CGGTTTCGAC GGATTCACGA GCACGTTTCA GGATAAGCTG TTCAAAGGAT ACCAAATGGA GACGTACAAT CAAATGTTGT ACGTCAACGG AGTCAGCGCG TGCCTCTCCG TCGCCTGGCT CTTATCCGAC GGCGCGATTT GGCAGGCCCT GGAGTTCATC GCCCGTCACC CCGCGGTGTT ATCCGATATC ATCACTCTGT CATTGAGTTC GATGTTCGGT CAGCTCTGCA TCCTGTACAC CATCAAAGAG TTCGGCGCGT TGCTCTTCGC CGCCATCATG ACGACGCGAC AGCTACTCAG CATTCTTCTC AGCTGCGTGT TATTCCTCCA TCCACTGACG TGGCAGCAAT GGTGTGGCAC CGCCTTAGTT TTCTCCGCGC TCTACGCCCA GGCGTATTTG AAGAACGCGC AGCCGCGTTA G
|
Protein sequence | MQSQTQMTRI AMCVVGVVGS LIVYGILQER IMTRPYGVES EYFKYSVFLV LSNRVLSASL AAAILAYTKG MVQPAAPIWK YAGVSASNVL ATTCQYEALR YVSFPVQTLG KCAKMIPVMI WGYFINQRRY TLNDYVIASC VTLGCTIFAL YGDLTHKHSA KSSNTSAKGL MLMLGYLGFD GFTSTFQDKL FKGYQMETYN QMLYVNGVSA CLSVAWLLSD GAIWQALEFI ARHPAVLSDI ITLSLSSMFG QLCILYTIKE FGALLFAAIM TTRQLLSILL SCVLFLHPLT WQQWCGTALV FSALYAQAYL KNAQPR
|
| |