Gene OSTLU_31752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31752 
Symbol 
ID5002097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp476504 
End bp479707 
Gene Length3204 bp 
Protein Length1048 aa 
Translation table 
GC content60% 
IMG OID640417518 
Productpredicted protein 
Protein accessionXP_001417784 
Protein GI145346620 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0703442 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAGA AACTCGGCAG TCGCAAAGAT CGCGTCGTCA CGTTCGTCGT CACCGTGCGC 
ACGCTCGAAC CGTGGCCCAC GCGCGCGTCG GGCCGTCCCG GACAGTTCGC CATCGGTTGG
CAGCGCGGCG CGAATAAACG CGGAACGACG CCCGTGCGCG CGGGCGAACG CTCCGATGAC
GGCGCGCGCG CGACGTACGC GTTCGATCAC ACGTTTGAGG TCGAGGCGAC GGTGCGCAGG
GCGGGAAAGA CGGGACACAA GGAGAAAACA CTGACGCTGT ACGTGTTGGC GCTGCCCGAA
GACGCGACGC GCGGACGCGA GGTGACGGCG GCGAAGGTCG GGGCGTGCGA CGTCGATCTG
GCGAAATACG TCGATCGCAC GGAGGATGAG ACGATAATGA TCGACGTGGA GTGCGGCGAA
GGGGTGCGAC GGGCGGTGGG GACGCCGAAG CTTTCGATTT CCGTGCGCGC GAAGGAGGGC
GGCGCGAACG CGGAGGGACG CGAGGGCGAG GCGAACGCGG CGAACGCGTC GCCGTTGAAA
TCACCGACGG CGAGCGAACG CGGGGGCGAG TACCAGTGGG CGTCTTCGAG GTTTCAGTCG
GAGCAGGCGA GCTCTTCGCA GGCGGATCAA GTGGAGGCGT TGACGAGCAT GGCGAGCATG
TTCAAAAAGC GCGCCGCGCC CGTCGAGGAA ACCTCGGTCG TCGACGACGT CGTCGACGAG
GGTTCCGAAG CCACGGCCAA TGACGAAGCC GACGTTGAAG CGAATACGGC TCGAGCGGAA
CTAATCGTCG CGCGGGCGGC GACGCCGAAA TCGGACGAAT TTTCGTCGAC GCCCGACGGC
ACTCCCGCGT TGGAGCAAGA AGATCCAGAG TTGACGCGAG CGCGCGACGA ATTGTTTGGT
GCGCCACCAA AAGATGCCGC GACGTCACCA CGCGGTGACG TCGACTCCGA TGGTTTCTTG
CTCGATTCGG ATCTGGACAC CGAGGGCGAA GCGGACGAAG AGACCCCAGC GGAGTTCACT
CGTCCAGTCG AAGAGTCGCC GGCGCGAGAC GACTCTGAAA ACGCCGCGGA GCAAGCGAGA
TTGGCCGAAG AAGAGGCGAG AATACGCGCT GAAGAAGACG CTGCTGTAGC TCGCATCGAA
GCCGAACGCA AGGCATTTGA AGAAGAGGAG CGCCAGCTCG AAGAGCAAGC GCGACTCGAA
GCTCAGCGAG CAGAGGAAGA AAGAGTTCGC GTCGACGAGG AAGCGAGGTA CGCACGCATG
GAAGCCGAAC GCGCGCAAGC GGAAGAAGAA GCTCGAAGGT TGGCGGAAGA GGACGCGCTC
TTCGCTGAAA ATGCCGAGTA TCAAAGACGC GCAGAGGAAG AGCAGCGCCT GCGCGCAGAG
GAAGAGCAGC GCCTGCGCGC AGAGGAAGAG CAGCGCCTGC GCGCAGAGGA AGAGCAGCGC
CTGCGCGCAG AGGAAGAGCA GCGCCTGCGC GCAGAGGAAG AGCAGCGCCT GCGCGCAGAG
GAAGAGCGAC GATGGGCTAT GGAAGCCGAG GCTGAGCGCG CGCGCATCGA AGAAATTGAA
AAGGCCCGAG CTCATGAAGC CGAAGCTGCG CGTAGAGCCG TGGAAGATGA AGACGTGGCG
GCGGCGCAAA TCGCTTCGAT TGCACAGCAA GAACGTCAAC AGCTGGCAGA AGAGGAAGCG
ATTAGAGCCG CTCAGGAAGA AGAGGAGCGA CAGCGATTAG AAGATGAAAA TCTTCGCACA
TCTGAACACG AGGCTCGATT GCAAGAGGAA CGAGAACTTC AAGCCGAAGA GAACGCGAAG
GCGGCTGCAC GCGGCGACAT TGATGTGTAC GCTAAAGCAG TCATCACCGA GGGCGCGTCA
TCTGTCTTAT TTTCGGACTC TGCTGACGAC GTGACTGCGT TTGGTACACC GTCATCGCAC
AGCGACGACG CGTTCTACAC TCCAGCCACT CGCGGGACGC GTTTCGCGGA TTCGGTGCTT
AAATCATCTC GCAATCGCGA TTTGGAACAC GAGATCGTGA GTATGTCCAT TTGTGATATT
CTCATTCACA GCACTGCGGA GGATTCGAGC TTCACTACCG CTCTTGGTCT TCAAGAACGA
ATTGCGAGCG TGCGCGCCAC GCTCGGAGAG CGCGAGTCTC AACTCGAGTT CAACAGAATC
ACGGACGCTT TCGGTGTCGC GATTAAAGGG GCGATGCATA ATCCAGCACG ACTCGTGTTC
TTATGTGCTC AGCTTATCGC GCTGCGAATT TGCGTGGCCA CGATGGATGA CTTGGACACG
CGCGACGTGA TCGAACTCGA AGTCTTGGCT CGAAACGCCG CGTTTGAGTC GCTCTGGAAG
CACACTTGCA GTGCATTAGT AAACCCTGGC GAAGTGACGG AAACGCTCGC ACACTTCATG
AAATCCTTCT GTGGGCCTTC GCCGAATGGA GATGGTGAGA AGATAGGTCG CGCCTGGTCT
GCAATGTTTC AGCTCGCCAA GACGCGACTC GACATCATCG GTGGCGACGC CGACGACGCC
GGCTGCTCTT CGCAACTACT ATTGACGCAG CTCCGACAAG GCATCTTGAA GGAGATTATT
CTTGCACTTG ACAAGTCTGT ATTAGATGCA TTGATTCATC CGTCTGGCGA TGCCCTGGCA
AACCCGATGA TACCCGGTGG CGGCGCGTTG ACGTTTTCAG CGGGTGCCGA ATTGAAGCGA
GCAATTTCTG TACTCGCCAG CGTCGCCAAA GATCTCAACG TTGGCACGAG TACTGAATCG
ATCATCCCGA AACTCAGAGC CGTCGCAGAT GTGTGCATGA TTCCGAAGGA CGCATTGATT
GACGTCAAGC TTCGCACGGA TATCGTGTGC GGCAAACTCA CGGACGAGGA ACTTGCCAGC
GTCGTCTCGC GATTCCGCCC TGACGATTTC GCGCCCCAAC CCGTGGACCC GGACGTCATC
TCCGCCGTCG TCGACGCGGC GACGAATGGA AAGGGTGACA CGCCCCCCGC GATCGGACCT
TACACCCCAA TGAGCACGGA AGGCGCGCCG TGGATCGCCA ACTTGGCCCG AGCGCTCGCC
GCTTTCGACG GCGTTTTGCA GTCGCGAGCG CCCGGTCCCA GCGCGCACGC CACGCGTTGG
TCCCTAGTCG CCGACGCTCT GCCTTAAGAA AAGCGATGAA GCGTTGAACA AAGGCGCGTC
ATTCACCATA GCGACGCGTT ATTC
 
Protein sequence
MLKKLGSRKD RVVTFVVTVR TLEPWPTRAS GRPGQFAIGW QRGANKRGTT PVRAGERSDD 
GARATYAFDH TFEVEATVRR AGKTGHKEKT LTLYVLALPE DATRGREVTA AKVGACDVDL
AKYVDRTEDE TIMIDVECGE GVRRAVGTPK LSISVRAKEG GANAEGREGE ANAANASPLK
SPTASERGGE YQWASSRFQS EQASSSQADQ VEALTSMASM FKKRAAPVEE TSVVDDVVDE
GSEATANDEA DVEANTARAE LIVARAATPK SDEFSSTPDG TPALEQEDPE LTRARDELFG
APPKDAATSP RGDVDSDGFL LDSDLDTEGE ADEETPAEFT RPVEESPARD DSENAAEQAR
LAEEEARIRA EEDAAVARIE AERKAFEEEE RQLEEQARLE AQRAEEERVR VDEEARYARM
EAERAQAEEE ARRLAEEDAL FAENAEYQRR AEEEQRLRAE EEQRLRAEEE QRLRAEEEQR
LRAEEEQRLR AEEEQRLRAE EERRWAMEAE AERARIEEIE KARAHEAEAA RRAVEDEDVA
AAQIASIAQQ ERQQLAEEEA IRAAQEEEER QRLEDENLRT SEHEARLQEE RELQAEENAK
AAARGDIDVY AKAVITEGAS SVLFSDSADD VTAFGTPSSH SDDAFYTPAT RGTRFADSVL
KSSRNRDLEH EIVSMSICDI LIHSTAEDSS FTTALGLQER IASVRATLGE RESQLEFNRI
TDAFGVAIKG AMHNPARLVF LCAQLIALRI CVATMDDLDT RDVIELEVLA RNAAFESLWK
HTCSALVNPG EVTETLAHFM KSFCGPSPNG DGEKIGRAWS AMFQLAKTRL DIIGGDADDA
GCSSQLLLTQ LRQGILKEII LALDKSVLDA LIHPSGDALA NPMIPGGGAL TFSAGAELKR
AISVLASVAK DLNVGTSTES IIPKLRAVAD VCMIPKDALI DVKLRTDIVC GKLTDEELAS
VVSRFRPDDF APQPVDPDVI SAVVDAATNG KGDTPPAIGP YTPMSTEGAP WIANLARALA
AFDGVLQSRA PGPSAHATRW SLVADALP