Gene OSTLU_119579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119579 
SymbolAatP 
ID5000431 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp598915 
End bp601415 
Gene Length2501 bp 
Protein Length586 aa 
Translation table 
GC content43% 
IMG OID640415852 
ProductNovel AAA ATPase 
Protein accessionXP_001416188 
Protein GI145342435 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00226042 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACGGT CCAGAGAGAT ACTCGAGTGC TTCTTCCCAG TATGTCAATT TTCATTTCAG 
CAATATGAAA ATAATCTCTG TGACGACTGT ACACGACCGG TAAAACGTTT TGGCCAGAGA
GAGAAATCTA TCACAGCTTT CTCGCAGTCG GAGATGCACA ACGCCAATTT CATGCTACTG
ATGCTCGATT CCCCAAAAGC TCATGGGGCA GCACTGCGCC ACACTATGTT ACGGCGCATC
CAGCGCGTGT TTGCAAAACA ACAAGTTACT TGGGTAAGTT CTGTGATTCT GTATTTTCAT
ACAACCTTAG CCAAAGTTGA AAGACCGGAT TTCAAGTAAC GCTACCCTGC GAAGGAAGTC
CAAGAATTAA GAAACCAAGC TTCTCTCTGC AAGTAGCGAA AATTAGTTCT CGAGACGGCA
ACCATTTACT TTATGAGCCT GGTGAGTGCT TTATTTCAAC ACTTCGAAGC GGCGATGACT
TATTTTCGTT AGTGGACACG CGTATCGTGG TAAAAGCTCG AGAAAATGAA TTTGTACCTT
ACAAGCACAC GATGGTAGTA TCACAGGATT TTCGTGCTCG ATTTGAGGCT GTGAAGTCAG
CACTTATTCG TCTGTCCTCA CCGCTTAACT GCAGCGTATG TCATACCAAG ACTACAAATG
ATGTTTGCTG ACTCGTCCAA AGTCAGGTCG GCTAGCGTCT GACACACCTT TACTTCTCAA
ACAGTCGTCA AAGCACTTTT TGCGCTGGCT ACAGTCAAAG TTGCAGTCGA GTGTTCGCAT
GCAATCACGA GTTATTTCAT GCCAATACTT AGTGTTGTTA AAATCTCAAA AACAAGTAAG
GAGTGCACAG ATTCTTAGAA GATACAAAGC TAAAAACACG AGTAGATTAG TTTGATCATA
GAACGAGCCC TTACGGGTAT CGACTCTGAT ACTGCTTTGA TTTTGGAAGA CGTGGATGTT
CTTGTTAAAG ATAAAGATGG ACTCAATTTG CAATGTTTGC TATCTACACT GAACGACATT
GCAGACTTTC GCACAATCCA AGGTGCGCTT GATCTGAGTA AGACGTTTCA ATTGACGCAT
GACTCAGGCT CTTTCCATAC GTTTATAATT GGAACAACCG CAGAATTTAA GGTGCATTCA
GATTTACATC GCCACTTTCC ATTGCAGGTA TGTTGAACAC GCTGAGCTGT AAGAAAAACT
GTTAATTTTG TATTCTCTTC GCAAGATTCC AAGTCCTTCA CAAGACAGCG ATGTCCATGT
ACAGCAGATA AGACGCGATG AAGGATTGAA GCAAATTCGT TTAAAGGTTC AGAGTGCACT
ATCCATCGGA GGGCACCGAC AACAAAAGGC AATGCGAATT GTACCACTCT CTACGAGCCA
ACAGGTAAGT CTTACTATGT CGTGATATTT TCAGGCTAAC ACGATCAAAT AGAACAACGG
AAATGTTCGC TGGGATGACG TCGGTGGTCT AGAAGATATA AAAGCTGCTT TGTGCGACAT
GCTGAAATGG CCTATTAAAT TCCCGAAGCT TTTCAAGAAT TACTCGAAAG GAGCTCTCCT
TCACGGTCCA CCGGTGAGGA AGATGTAAAT ATGTCCATTC TTACGTGATG AGTGTCAATA
GGGCACAGGG AAGACTCTCA TAGCAAAAGC TGCAAGTGTC GAATCAGGCT TGACATTTTT
CAACGTCAAA GGACCAGAGC TTCTAGGGAT GTATGTCGGT GAAAGCGAGG TGAACTCTTT
CTCTCCTCGA TGTACTTCAT TTTGACCTCT TAATAGAAAT GTGTGCGAGA ACTATTTAAC
AAAGCCCGAG AATTGGCCCC CGCATTGATT TTCTTTGATG AATTTGATTC TCTCTGCATG
GTGTGTTTCT CAGTTGGAAT GCCTGTGTTA AAGTGACGAA TCTGTAGGTT GATGCACCGC
CTGGAACACT AATAAGCCGT GTCATCAGTC AAGTCGTTAC GGAATTGGAT TCGCTGAAGG
TGTTCACGTC TTAGATTCCA GCACTCTTCA ATTGATAACA GTTTTCCAGG GGTCTCAAGT
ATTTGTATTG GCAGCTAGCA ACAGATTAGA ACTTATTGAA CCGACAATTC TGCGGTCTGG
GCGGTGAGTC AGCTCACGCT TTTTTACTCA AAAACCAACG GAAACTTTCA GACTTGACCG
AGTTCTACAT GTTCCTCTTC AGAACGACGT TCGCTCAAAG GTTGCAATTT TCAAGGTTTG
TTGTCAAAAC CACTGAGTCA AAGGTCGTCT TAACTTGGCA ACCCAGGCTC TAACCAAAAA
CTTCATATTT TCATCAAACT TCGAGATTGG TCAGGTAATC TTAGATAGGA ACAATTGATG
AAACTCTGAC TGAAAATTGT CTCAAGTTGG CGTTAGTTAG TGAAGATGTT AGTGGCGCTG
ACATCTACTC TGCGTGTGTA CGAGCTTGGA TCAAGGCTGC CAAGCGGGCG ATAGCGTACA
ATTCCACCGG TATAGTGGTT CAGACGATGG ATTTCTATTA A
 
Protein sequence
MARSREILEC FFPVCQFSFQ QYENNLCDDC TRPSEMHNAN FMLLMLDSPK AHGAALRHTM 
LRRIQRVFAK QQVTWTGFQV TLPCEGSPRI KKPSFSLQVA KISSRDGNHL LYEPVDTRIV
VKARENEFVP YKHTMVVSQD FRARFEAVKS ALIRLSSPLN CSSGRLASDT PLLLKQSSKH
FLRWLQSKLQ SSVRMQSRVI SCQYLVLLKS QKQISLIIER ALTGIDSDTA LILEDVDVLV
KDKDGLNLQC LLSTLNDIAD FRTIQGSFHT FIIGTTAEFK VHSDLHRHFP LQIPSPSQDS
DVHVQQIRRD EGLKQIRLKV QSALSIGGHR QQKAMRIVPL STSQQNNGNV RWDDVGGLED
IKAALCDMLK WPIKFPKLFK NYSKGALLHG PPGTGKTLIA KAASVESGLT FFNVKGPELL
GMYVGESEKC VRELFNKARE LAPALIFFDE FDSLCMVDAP PGTLISRVIS QVVTELDSLK
GSQVFVLAAS NRLELIEPTI LRSGRLDRVL HVPLQNDVRS KVAIFKALTK NFIFSSNFEI
GQLALVSEDV SGADIYSACV RAWIKAAKRA IAYNSTGIVV QTMDFY