Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_38421 |
Symbol | |
ID | 5001889 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009359 |
Strand | - |
Start bp | 510322 |
End bp | 512223 |
Gene Length | 1902 bp |
Protein Length | 559 aa |
Translation table | |
GC content | 59% |
IMG OID | 640417310 |
Product | predicted protein |
Protein accession | XP_001418031 |
Protein GI | 145347132 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0452022 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000596343 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGGATT CGGGAGCGGC GGAGTTGACG GTGAAGCTGT GCGATGGAGC GGACGAAGAC CACTGGTACG GTGGGTCGCA TTTGCTGAAG CAGCTCTGGC CGCTCGAGCG AGCGAAAATC GAAATGGGAC CGTTTTACCC GTTCGACCAC GGTCCGAACG GCGTCGGCAA CGTTCTCGGC ACTCACTGGG TGTCGAGCGG TGGGACGTTG ATATTCGCCG ATCCAAATAG TGATTTATTA CACGTCGGCT TGAACGCACC GGCATTCGTC CCGCGAGGTC AAGCGCCGCG GTATTTCGGT GTCGGTATTC AAAACGCTTC CCGGCCGTCG TTGCCGCTGG AAGAGGCTCT GACGTCGGAA CGCGGCGACG GACTACTTCG ATTGCAATCA CGGGCGAATT ACGCGGACGC GAACATGTTG CATCCGTGGC AAAGCATCAA TACGAATGAC GCGCGAACGC GCTTGCAGCT ACGCGTCGCC GTATCTGCGC AGGAAGACGC GAGGAGCGCG ACGAGAGTCG CCTTGGCGCA GCTTCCAAAG CCGAGCGCCG CGCCGGACAA GTCTCTGATG TTCTATCCAA TCTGGACGAC TTGGGCGACG TCGCACGCTG ATGTCACGCA AGAGTCAACA CTTGCAGTAG CTAAGGACAT CGCAAAAGCC AAATCTGACT CGGGATTACC GAACGGTTCG ATCATCGAAA TCGACGACCG CTGGCAAGCG CGATATGGCG AACTTCATTT CGATCCCGTG AAGTTTCCGG ATCCGAAAGG CATGATCAAG GAGCTTCACG ACATGGGATT TTTAGTAACG GCGTGGGTGA TGCCGTTCTT GCAAGAGAGT TCAGCGGCGT GCGAGGAGGC AAAGCGATTA GGGTACTTAC TTGAAGGCTC GCAACCGCCG AACGAAGTCG TCGAGGTGCT CACGGGAGGC GTCGGTCAAG TCCTCGGGAC TACTGTGAAA GTGTTCGTCG ATCGCTACGA CTGGCCACCC GGTCACTGGG AAGGCGGCGG TGGTGGGGGC AACCTTGAGC CTGGACAGTT CCGATGGTGG GGCACGCAGC CCGTCCGCGG CATCGATTTC ACCAATGACG AGGCGTGCGA ATGGTTCGTG CGTCGACTCA AGAAACTGCA AGAAGAAGTC GGACTCGACG GTTTCAAGTT TGACGCCGGC GAGCCTTGTT TCATGCCTTA CGGCGCCCGC CCGCACACGC CCCTGAAACA TCCACAAGAG TACAGTCAGG CATACGTGGA AAAAGTTTGT TCTAAATTTG CCTTGAGCGA AGTGCGCGTG GCGATGGGAA CGAACAATTA CAACGGCTTA ATTCGCATGG GCGATAAAGA TACCGTGTGG GGCGTCGATA ACGGTCTACA GAGTCTCATT CCGAGCTTAC TCACGTCCGC AGTTATCGGG TTCCCGTTCA CGCTTCCGGA CATCATCGGC GGCAACGCGT ATTGGAATCA AACGCCCGAT ACCGAGTTGA TGATTCGCTG GGCGCAAGTC AGCGCGTTCA TGCCCGCGGT GCAATGGTCG ATTCCACCGT GGGAAGTGAG CTCAGTGGCG TACGAGGCGA GTGTCAAGGT GATGCACATG CGCGAACGGC TGCTCTTACC CAAGCTCGCC GCGCTCGCGG AGGACGCCAA AGCCTCCCTC GAACCCATCT GTCGTCCAAT GTGGTGGCTC GACCCCAACG ACGCCCAAAC GTTCGCCATC GACGACCAGT TTGCCGTCGG TACCGACATG ATCGTCGCCC CTGTGGTCGA AAAAGGTGCG ACATCGCGCA CGGTTTACCT CCCCGCCGGG TCTTGGCGGC GATACGACGA ATCGAACGCC GTCGCCGGCG GTCGCCGCGT CGTCGTCGAC GCCCCGCTCG ACGTCCTCCC CGTGTTCATC CGCGCCGACT AG
|
Protein sequence | MEDSGAAELT VKLCDGADED HWYGGSHLLK QLWPLERAKI EMGPFYPFDH GPNGVGNVLG THWVSSGGTL IFADPNSDLL HALTSERGDG LLRLQSRANY ADANMLHPWQ SINTNDARTR LQLRVAVSAQ EDARSATRVA LAQLPKPSAA PDKSLMFYPI WTTWATSHAD VTQESTLAVA KDIAKAKSDS GLPNGSIIEI DDRWQARYGE LHFDPVKFPD PKGMIKELHD MGFLVTAWVM PFLQESSAAC EEAKRLGYLL EGSQPPNEVP GQFRWWGTQP VRGIDFTNDE ACEWFVRRLK KLQEEVGLDG FKFDAGEPCF MPYGARPHTP LKHPQEYSQA YVEKVCSKFA LSEVRVAMGT NNYNGLIRMG DKDTVWGVDN GLQSLIPSLL TSAVIGFPFT LPDIIGGNAY WNQTPDTELM IRWAQVSAFM PAVQWSIPPW EVSSVAYEAS VKVMHMRERL LLPKLAALAE DAKASLEPIC RPMWWLDPND AQTFAIDDQF AVGTDMIVAP VVEKGATSRT VYLPAGSWRR YDESNAVAGG RRVVVDAPLD VLPVFIRAD
|
| |