Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_89073 |
Symbol | |
ID | 5005397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009368 |
Strand | - |
Start bp | 118692 |
End bp | 121787 |
Gene Length | 3096 bp |
Protein Length | 1031 aa |
Translation table | |
GC content | 56% |
IMG OID | 640420818 |
Product | predicted protein |
Protein accession | XP_001421393 |
Protein GI | 145354228 |
COG category | [R] General function prediction only |
COG ID | [COG0488] ATPase components of ABC transporters with duplicated ATPase domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.161283 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.722598 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCCGA GCAAGTGCAA CGTGCCCGCG GCGGCGAAGG TGACGCCGGC TGACGTCGAA GGGATCAAGA CGGAGGCGGC GAAGAAGAAG GACGAGGACG CGCGCAAGGC GGCGGCGGAA AAGATTACGG AGATTGCGAA CGCGGCTTCG TTCGCGGAGG AACCGTATTT GATTGATTTG CTCGAGGTTG CGATCACGCT CGCGGGTGAT AACAAGTCTA GCAATGTGCG CGCGGCTGGC GACGCGGCGG TGGCGGCCAT CGCGCCCAAG TTGAGCGAGT TCGCGGTTCG CCCGGCGTTG CGAGCGATCT TTGTTGGTTT TCAGTCTCAG TTCTGGCAGT CCACCATGGC TGCTTTGCGT GTGCTCGATG CTTTCGTCGA TCGCAACCGC AAGGCGGTCG CGGCGAACTT GCCGGAAATC ATCCCGGAGC TCGCGCAAGT CATGGTGCAC ATGCGCTCGG AAGTCAAGGA GGCGTCCACC GCGTCCATGG CCAAGGTTGC GACGTGCGTC GGTAACTTGG ACATCGAGCC TTTCATCCCG ACCTTGATTG AATGCATCAA CAACGTCGAT GAAGTTCCGG AATGCGTGCA CAAGCTCGCG GCGACGACTT TTGTTCAACA AGTTGAATCC CCGACGCTCT CCCTCCTGGG TCCGCTTCTC CAGCGCGGTT TGTTCTTCCA GCAAACCACC CCGATCAAGC GTAAGTCTGC CGTCATCATT GACAACATGT GCAAACTTGT CGAAGATCCG ATGGATGCCG CGCCGTTCTT GCCCAAGCTT TTGCCGCTTT TGAAGCGCGC CATGGACGAA GTCGCCGACC CGGAGTGCCG CCAAGTGTGT ACTCGCGCGT ACAAGACTTT GCTCCAAGCC GCGGGCAACG AAACCGGTGC CGAAGATGGC CAAGAAGGCA AGGGTGTCTA CAACGAAGAA ACCGTCAGCG AGCAGTTCAT CGCGCTTTTG GCCGAATCCT CCGGTTCTTC CAAGGAAGAC GTCACGAAGT TTGTCCAAGG CGACGGCGTC AAGGTGTACT TTGACTACAT TTGCTCCTTG TCCGCCAACG CGTTGTTGGC GAAGAACTTC GATCTCGACA CCTGGACCAA GTCTTGCGCG ACCTCTTACT TGAAGCTTTT CTTCTCCGAG GCCGATGCCA TGACCAAGTC TCTCGTCGAA CGCGCTCACG CCGTGTACGA AGCCTCCAAG AAGGTCTTCG TCGTCGAGGA CGAAGAAGGC GAAGATCTTT GCAAGTGCGA CTTCTCTTTG GCGTACGGTG CGCTCATTTT GCTGAACAAC GCCACGCTTC ACATGAAGAA GGGTAAGCGT TACGGTCTGT GCGGCCCGAA CGGTTGCGGT AAGTCCACTT TGATGAAGGC TATCAACAAC GGCCAAGTCG AAAACTTCCC GCCGCCGGAG GAACTCCGCA CGGTGTACGT CGAGCACGAC ATCCAGGGTG ATCAACACAC GATGAACGTC GTTGAGTTCG TTCTCTCCGA CTCTGTCATC CAAGGCCACG GCACGAGCAA GGAATCTGTC GCTTCCACGC TCTCCTCTTT CCAATTCACC GACGAGATGA TCAACGGCCC GGTTGTCGCC CTCTCCGGTG GCTGGAAGAT GAAGCTCGCG CTCGCGCGCG CCATCCTCAT GAAGGCGGAC ATTTTGTTAC TCGATGAGCC GACGAACCAC TTGGACGTGA AGAACGTCGC GTGGTTGGAA GAGTACCTCA ACTCGCAAAC GCAAGTTTCC TCCATGATTG TGTCTCACGA TTCCGGTTTC TTGGATCGCG TGTGTACTCA CATCATTCAC TACGAGAACC GCAAGTTGGT GACGTACAGA GGTAATCTCA CCGAATTCGT CAAGCAGTGC CCGGCGGCGA AGAAGTACAC CGAGCTCTCC AACGACGAAC TCAAGTTCAT CTTCCCGGTT CCGGGTTTCC TCGAGGGCGT GAAGAACAAG GACAAGGCCA TCGTCAAGGC GACCAAGTGC TGGTTCAAGT ACCCGAACAC GACGCGTCAA ATCATCCAAG ACGCGACCAT TCAGCTCTCT TTGAGCTCTC GTGTCGCGTG CCTCGGCCCG AACGGTGCTG GTAAGTCTAC CTTCATCAAG CTTCTCACCG GTGAAGCCGA ACCGGACCAG GGTACCGTCT GGCGTCACCC GAACATGCGT TACGCGTACG TCGCGCAGCA CGCGTTCCAT CACGTCGAGC AGCACTTGGA CAAAACGCCG AACGAATACA TCCGCTGGCG TTTCTCCACG GGCGAAGACA AGGAAAACTT GACCAAGGTC ACCGCGCAAT ACACCGAAGA GGAAGAGCGC ATGATGAAGG AAAAGATTCC GGTCCCGCAA GAGGATGGTT CCATTCTCAA GCTCGTCGTC GAGAAGATTC TTGGTCGCCG CCAAAAGAAG TCCAAGTACG AGTACGAGTG CCAATGGAAG GGTCTCTCCA TGGACTCCAA CTCTTGGATG GAGCGCGAAA AGCTCGAAAA GTATGGTTTC ACCAAGTACC TCAACCGCGT TGACGAGCGC GAAGCCGCTC GCGCGGGCTT GTACGCGCGC CCGCTCACGC AAGCGAACGT CGAGAAGCAC TTGATCGACT TTGGTTTGGA TGCCGAATTC GGTACCCACA ACCGCATCAA GGGTCTTTCC GGTGGCCAAA AGGTTAAGCT CGTGCTCGGT TCCGCCATGT GGCAGCAACC GCACATCGTC GTCATGGACG AACCGACCAA CTATTTGGAT CGCGACGCCC TCGGCGCGCT CGCGTGCGCC GTCAAGGAAT ACGACGGTGG CGTTCTTCTC ATCACGCACA ACTGCGAATT CGCCGATGCG TTGAAGGAAG AAACGTGGAA CGTTCCGGGT AATGGTTTTG TTGAAATTGA AGGTAACAAG TGGGGTCAAG GCAAGTCCGC TAAGGGTGCC AAGGTTGAAT TCGAAGTCCA AGAGGACACC GTCGACGCGC TCGGGAACAA GGTTAAGGTC AAGGGACCGA AGAAGAAGTT GTCTCGCAAG GAGATCAAGG CTATGCAAAA GACGAGAGCG GCTAAATTAG CCGCTGGCCA GGACATAACC ACAGACTCGG ATTGGGACTT GGACCAGGTT AGTTGA
|
Protein sequence | MAPSKCNVPA AAKVTPADVE GIKTEAAKKK DEDARKAAAE KITEIANAAS FAEEPYLIDL LEVAITLAGD NKSSNVRAAG DAAVAAIAPK LSEFAVRPAL RAIFVGFQSQ FWQSTMAALR VLDAFVDRNR KAVAANLPEI IPELAQVMVH MRSEVKEAST ASMAKVATCV GNLDIEPFIP TLIECINNVD EVPECVHKLA ATTFVQQVES PTLSLLGPLL QRGLFFQQTT PIKRKSAVII DNMCKLVEDP MDAAPFLPKL LPLLKRAMDE VADPECRQVC TRAYKTLLQA AGNETGAEDG QEGKGVYNEE TVSEQFIALL AESSGSSKED VTKFVQGDGV KVYFDYICSL SANALLAKNF DLDTWTKSCA TSYLKLFFSE ADAMTKSLVE RAHAVYEASK KVFVVEDEEG EDLCKCDFSL AYGALILLNN ATLHMKKGKR YGLCGPNGCG KSTLMKAINN GQVENFPPPE ELRTVYVEHD IQGDQHTMNV VEFVLSDSVI QGHGTSKESV ASTLSSFQFT DEMINGPVVA LSGGWKMKLA LARAILMKAD ILLLDEPTNH LDVKNVAWLE EYLNSQTQVS SMIVSHDSGF LDRVCTHIIH YENRKLVTYR GNLTEFVKQC PAAKKYTELS NDELKFIFPV PGFLEGVKNK DKAIVKATKC WFKYPNTTRQ IIQDATIQLS LSSRVACLGP NGAGKSTFIK LLTGEAEPDQ GTVWRHPNMR YAYVAQHAFH HVEQHLDKTP NEYIRWRFST GEDKENLTKV TAQYTEEEER MMKEKIPVPQ EDGSILKLVV EKILGRRQKK SKYEYECQWK GLSMDSNSWM EREKLEKYGF TKYLNRVDER EAARAGLYAR PLTQANVEKH LIDFGLDAEF GTHNRIKGLS GGQKVKLVLG SAMWQQPHIV VMDEPTNYLD RDALGALACA VKEYDGGVLL ITHNCEFADA LKEETWNVPG NGFVEIEGNK WGQGKSAKGA KVEFEVQEDT VDALGNKVKV KGPKKKLSRK EIKAMQKTRA AKLAAGQDIT TDSDWDLDQV S
|
| |