Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_119579 |
Symbol | AatP |
ID | 5000431 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 598915 |
End bp | 601415 |
Gene Length | 2501 bp |
Protein Length | 586 aa |
Translation table | |
GC content | 43% |
IMG OID | 640415852 |
Product | Novel AAA ATPase |
Protein accession | XP_001416188 |
Protein GI | 145342435 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0464] ATPases of the AAA+ class |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00226042 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACGGT CCAGAGAGAT ACTCGAGTGC TTCTTCCCAG TATGTCAATT TTCATTTCAG CAATATGAAA ATAATCTCTG TGACGACTGT ACACGACCGG TAAAACGTTT TGGCCAGAGA GAGAAATCTA TCACAGCTTT CTCGCAGTCG GAGATGCACA ACGCCAATTT CATGCTACTG ATGCTCGATT CCCCAAAAGC TCATGGGGCA GCACTGCGCC ACACTATGTT ACGGCGCATC CAGCGCGTGT TTGCAAAACA ACAAGTTACT TGGGTAAGTT CTGTGATTCT GTATTTTCAT ACAACCTTAG CCAAAGTTGA AAGACCGGAT TTCAAGTAAC GCTACCCTGC GAAGGAAGTC CAAGAATTAA GAAACCAAGC TTCTCTCTGC AAGTAGCGAA AATTAGTTCT CGAGACGGCA ACCATTTACT TTATGAGCCT GGTGAGTGCT TTATTTCAAC ACTTCGAAGC GGCGATGACT TATTTTCGTT AGTGGACACG CGTATCGTGG TAAAAGCTCG AGAAAATGAA TTTGTACCTT ACAAGCACAC GATGGTAGTA TCACAGGATT TTCGTGCTCG ATTTGAGGCT GTGAAGTCAG CACTTATTCG TCTGTCCTCA CCGCTTAACT GCAGCGTATG TCATACCAAG ACTACAAATG ATGTTTGCTG ACTCGTCCAA AGTCAGGTCG GCTAGCGTCT GACACACCTT TACTTCTCAA ACAGTCGTCA AAGCACTTTT TGCGCTGGCT ACAGTCAAAG TTGCAGTCGA GTGTTCGCAT GCAATCACGA GTTATTTCAT GCCAATACTT AGTGTTGTTA AAATCTCAAA AACAAGTAAG GAGTGCACAG ATTCTTAGAA GATACAAAGC TAAAAACACG AGTAGATTAG TTTGATCATA GAACGAGCCC TTACGGGTAT CGACTCTGAT ACTGCTTTGA TTTTGGAAGA CGTGGATGTT CTTGTTAAAG ATAAAGATGG ACTCAATTTG CAATGTTTGC TATCTACACT GAACGACATT GCAGACTTTC GCACAATCCA AGGTGCGCTT GATCTGAGTA AGACGTTTCA ATTGACGCAT GACTCAGGCT CTTTCCATAC GTTTATAATT GGAACAACCG CAGAATTTAA GGTGCATTCA GATTTACATC GCCACTTTCC ATTGCAGGTA TGTTGAACAC GCTGAGCTGT AAGAAAAACT GTTAATTTTG TATTCTCTTC GCAAGATTCC AAGTCCTTCA CAAGACAGCG ATGTCCATGT ACAGCAGATA AGACGCGATG AAGGATTGAA GCAAATTCGT TTAAAGGTTC AGAGTGCACT ATCCATCGGA GGGCACCGAC AACAAAAGGC AATGCGAATT GTACCACTCT CTACGAGCCA ACAGGTAAGT CTTACTATGT CGTGATATTT TCAGGCTAAC ACGATCAAAT AGAACAACGG AAATGTTCGC TGGGATGACG TCGGTGGTCT AGAAGATATA AAAGCTGCTT TGTGCGACAT GCTGAAATGG CCTATTAAAT TCCCGAAGCT TTTCAAGAAT TACTCGAAAG GAGCTCTCCT TCACGGTCCA CCGGTGAGGA AGATGTAAAT ATGTCCATTC TTACGTGATG AGTGTCAATA GGGCACAGGG AAGACTCTCA TAGCAAAAGC TGCAAGTGTC GAATCAGGCT TGACATTTTT CAACGTCAAA GGACCAGAGC TTCTAGGGAT GTATGTCGGT GAAAGCGAGG TGAACTCTTT CTCTCCTCGA TGTACTTCAT TTTGACCTCT TAATAGAAAT GTGTGCGAGA ACTATTTAAC AAAGCCCGAG AATTGGCCCC CGCATTGATT TTCTTTGATG AATTTGATTC TCTCTGCATG GTGTGTTTCT CAGTTGGAAT GCCTGTGTTA AAGTGACGAA TCTGTAGGTT GATGCACCGC CTGGAACACT AATAAGCCGT GTCATCAGTC AAGTCGTTAC GGAATTGGAT TCGCTGAAGG TGTTCACGTC TTAGATTCCA GCACTCTTCA ATTGATAACA GTTTTCCAGG GGTCTCAAGT ATTTGTATTG GCAGCTAGCA ACAGATTAGA ACTTATTGAA CCGACAATTC TGCGGTCTGG GCGGTGAGTC AGCTCACGCT TTTTTACTCA AAAACCAACG GAAACTTTCA GACTTGACCG AGTTCTACAT GTTCCTCTTC AGAACGACGT TCGCTCAAAG GTTGCAATTT TCAAGGTTTG TTGTCAAAAC CACTGAGTCA AAGGTCGTCT TAACTTGGCA ACCCAGGCTC TAACCAAAAA CTTCATATTT TCATCAAACT TCGAGATTGG TCAGGTAATC TTAGATAGGA ACAATTGATG AAACTCTGAC TGAAAATTGT CTCAAGTTGG CGTTAGTTAG TGAAGATGTT AGTGGCGCTG ACATCTACTC TGCGTGTGTA CGAGCTTGGA TCAAGGCTGC CAAGCGGGCG ATAGCGTACA ATTCCACCGG TATAGTGGTT CAGACGATGG ATTTCTATTA A
|
Protein sequence | MARSREILEC FFPVCQFSFQ QYENNLCDDC TRPSEMHNAN FMLLMLDSPK AHGAALRHTM LRRIQRVFAK QQVTWTGFQV TLPCEGSPRI KKPSFSLQVA KISSRDGNHL LYEPVDTRIV VKARENEFVP YKHTMVVSQD FRARFEAVKS ALIRLSSPLN CSSGRLASDT PLLLKQSSKH FLRWLQSKLQ SSVRMQSRVI SCQYLVLLKS QKQISLIIER ALTGIDSDTA LILEDVDVLV KDKDGLNLQC LLSTLNDIAD FRTIQGSFHT FIIGTTAEFK VHSDLHRHFP LQIPSPSQDS DVHVQQIRRD EGLKQIRLKV QSALSIGGHR QQKAMRIVPL STSQQNNGNV RWDDVGGLED IKAALCDMLK WPIKFPKLFK NYSKGALLHG PPGTGKTLIA KAASVESGLT FFNVKGPELL GMYVGESEKC VRELFNKARE LAPALIFFDE FDSLCMVDAP PGTLISRVIS QVVTELDSLK GSQVFVLAAS NRLELIEPTI LRSGRLDRVL HVPLQNDVRS KVAIFKALTK NFIFSSNFEI GQLALVSEDV SGADIYSACV RAWIKAAKRA IAYNSTGIVV QTMDFY
|
| |