Gene OSTLU_37691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37691 
Symbol 
ID5006074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp310752 
End bp312827 
Gene Length2076 bp 
Protein Length691 aa 
Translation table 
GC content61% 
IMG OID640421495 
Productpredicted protein 
Protein accessionXP_001422034 
Protein GI145355574 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.556949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0104163 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCG AACGCCTGGA GAAAAGCAAG GATAAGAAGC GAGGGAAAAA AGGTAAGGGG 
CATCGTCGCA AGCGGAAGGG TGACCGAGAC AGCGACAGTG AGAGTTTGGA TGAGGATGAG
TTTGGACGAA CGACGAGCGC GAAGGCGTTT GCGCAAATGA CGTCGCCTCG CTCGGTGAAG
TTGTCGGATT TAGGCGGGAT CGAGGACTCG CTGAAGGACA TCAAGGAGCT CATACTGTGC
CCTTTGATGC ATCCCGAGTT GTACGCATGG CTCGGGGTCG ATCCGCCGCG CGGGGTGTTG
CTTCACGGCC CGCCGGGTTG CGGCAAGACG ACGTTGGCGC ACGCCATCGC GCAAGAGGCG
AAAGTGCCCT TCTTCTCCAT AGCGGCCACG GAGATTGTGA GTGGAATGAG TGGCGAGTCT
GAAGCGAAGA TACGTGAGTT GTTCCAGTCC GCCGCCGCGC ACGCGCCGTC GCTGATTTTC
ATCGACGAGA TCGACGCAAT CGTCCCGAAG CGCGAGAGCG CGCAGCGAGA GATGGAACGT
CGAATTGTCG CCCAGCTGTT AGCGTCCATG GACGATCTTC AATCAACAAT CGATGGCACG
GACGAGGTGG ATCGACTAGC GCGGTGTCGT CGCCACGTCA CCGTCATCGG CGCCACGAAT
AGACCGGACG GTATGGACGC CGCGCTTCGT CGCGCCGGAC GCTTTGATCG CGAAATCATG
CTCGGCATTC CAGACGAAGC CGCGCGAGAG CGCATTTTGC GAGTGCAGGC GACCAAGCTT
CGCTTGAATG GAGATTTAGA CTTGCGCGAA ATCGCAAAGA AAACGCCCGG CTATGTCGGC
GCGGATTTAT CGGCGTTGGC CAAGGAAGCC GCCGCGTCGG CGGTCACGCG CATCTTTAAA
AAGCTCGAGG ACGAGGAAAG GGCGAGCGCG GATGTGACGA TGGACGAGGG TGTCGCGCCC
GCACTGGGGG GGGACACTCG TCTCGCGACT GGTCGCTTAG CGGATCCGCG TCCGCTCACC
GAGGACGAAC TCGAGGATCT AGCAATCACC ATGGAAGATT TCTCCCTCGC GCTCACGCGC
GTGCAACCGT CGGCGCAACG CGAAGGTTTC ACCACGACGC CGAACGTGAC TTGGGACGAC
GTTGGCTCGC TCACAGAAAT TCGCGAAGAG TTGAAGTTCT CCATTGCTGA GCCCATCGCT
CATCCCGAGC GATTCCAAGC GATGGGTTTG AACATCTCTA CGGGCGTCTT GCTCTACGGC
CCACCGGGGT GCGGCAAAAC GCTCGTCGCC AAGGCGACGG CGAACGAGGC GATGGCGAAT
TTCATATCCA TCAAAGGTCC AGAGTTATTA AATAAGTACG TCGGTGAGAG CGAGCGCGCG
GTGCGGACGC TGTTCCAGCG CGCACGAAGT GCGAGCCCGT GCGTGTTATT CTTTGACGAG
ATGGATTCTC TGGCGCCGCG TCGCGGAAGC GGCGGCGACA ACACCTCAGC CGAGCGCGTC
GTGAACCAAC TTCTCACCGA GATGGACGGT CTCGAAGCGC GAAACGCGAC GTTCTTGATC
GCGGCGACGA ACCGACCCGA CATGATCGAT CCAGCGATGC TGCGTCCCGG GCGCTTGGAC
AAGCTCTTGT ACGTTCCGTT GCCGCCGCCG GACGGCCGAG TCGCCATCTT GAAGACGCTC
ACGCGCCGAA CGCCCATCGC ACCAGACGTA CGCGTGGATC AAATCGCGCT CGGTCGATCG
TGCGAAGGCT TCAGCGGCGC CGACTTGGCG GCGCTCGTGC GCGAAGCGTG CGTGGCGGCG
TTGAAATCGA TGACGCTCGA ATCGACGCCG ACGGTGACGA CGAAGCACTT CGAAGAGGCG
TTCACGAAGG TGCAACCCTC GGTGAGCAAG TCGGATCACG CGCGTTACGA TGAATTGCGT
CGAAAGCTCC GTCGCGAGCG CGGGACGATC AACAGCGCGC GCCGCTCTTC CTCCGCCGAA
AATCTCGCCG TCGAGCCCGC GTCCAACAAG CGCGTTCGCC CCGGCGACGA CGACGACCGC
GACGACCGCG ACGCGCCCGA ATTAGCCACC TCTTAG
 
Protein sequence
MKIERLEKSK DKKRGKKGKG HRRKRKGDRD SDSESLDEDE FGRTTSAKAF AQMTSPRSVK 
LSDLGGIEDS LKDIKELILC PLMHPELYAW LGVDPPRGVL LHGPPGCGKT TLAHAIAQEA
KVPFFSIAAT EIVSGMSGES EAKIRELFQS AAAHAPSLIF IDEIDAIVPK RESAQREMER
RIVAQLLASM DDLQSTIDGT DEVDRLARCR RHVTVIGATN RPDGMDAALR RAGRFDREIM
LGIPDEAARE RILRVQATKL RLNGDLDLRE IAKKTPGYVG ADLSALAKEA AASAVTRIFK
KLEDEERASA DVTMDEGVAP ALGGDTRLAT GRLADPRPLT EDELEDLAIT MEDFSLALTR
VQPSAQREGF TTTPNVTWDD VGSLTEIREE LKFSIAEPIA HPERFQAMGL NISTGVLLYG
PPGCGKTLVA KATANEAMAN FISIKGPELL NKYVGESERA VRTLFQRARS ASPCVLFFDE
MDSLAPRRGS GGDNTSAERV VNQLLTEMDG LEARNATFLI AATNRPDMID PAMLRPGRLD
KLLYVPLPPP DGRVAILKTL TRRTPIAPDV RVDQIALGRS CEGFSGADLA ALVREACVAA
LKSMTLESTP TVTTKHFEEA FTKVQPSVSK SDHARYDELR RKLRRERGTI NSARRSSSAE
NLAVEPASNK RVRPGDDDDR DDRDAPELAT S