Gene OSTLU_16944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16944 
Symbol 
ID5004244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp21942 
End bp24869 
Gene Length2928 bp 
Protein Length975 aa 
Translation table 
GC content52% 
IMG OID640419665 
Productpredicted protein 
Protein accessionXP_001420056 
Protein GI145351375 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0730673 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGA AGAAGCGCCC GTCCAATCCC GGCGGCAGCG GGTCGAAGCG AAAGTTTTTC 
CGCGCCGATG GCGACGCGAG CGGTCCCAGC CCGAGCGGTG TGACGGTGAA GAGCGCGTTC
GAGACTGTCA GCCATCGACG AAAATTCGAC ATTCTCGGTC GCAAGGTGAA GGGCGAGCGA
GGTAACGTGC TGCAAGCGCG CACGGAGGCT ATTGAAAAAC GTCGGCGCAC GCTGCTCGTG
GAACACGAAG CGAACGGCAA GGCGAACGCA TTCTTGGACC GACGATTCGG TGAACACGAC
AGTGGCTTAA GCGTCGAAGA GAAAAATATT GGAAGATTGG CCAAGGCGCG GTTGCGTCAG
TATAAGAAAT CCAAAGCCTT CGCCCTGAAT GAAGACGATG ACGACGACGG CTTTAAAACG
CTCACGCATT TGGGACAGCC GCTCGGGGAG AGAGATTTAA CCGCGAAACC GTCGCACGGG
GACGAGGAAG ACGATGAAAA TCTAGACGAC GAAATCACGC GTAATTTCCA TTTCGGTGGA
GGCGAATTTG AACCAACTTT GAAGCGAGGC GATGGGGAGC ACGATGTGGA TGCGCCGGAA
CGCCGAAAGA GCAAAAAGGA TGTGATGGAG GAACTCATAG CCAAGAGTAA GTTCTACAAA
GCTGAGAAGC AGAAGCAGCG AGATGACGAT GGAGATATGT TGGATAAGTT GGACTCCGAT
TTTAAGGCGA TAAGTCAAGG TGGACTATTG TCGAGCGCGT TGAGAAAAGC TGTCGGGCAC
ATGAAGCCGA CCGCCGCGAA GGCTGCACAA ACCAAGGCTC CCGAGCAGAA GGATGAATAC
GACACGTGGG CCCGAACTTT AGCGTTCGAA AGACGTGGTC AGGCGGGTGA TAGAGCAAAG
ACGCAGGAAG AAGTTGAGGC GCAAGCAAAG CTCGCGCTAG AGCAAGCCGA ACGCAAGAGG
TTGAAGCGCA TGCGCGACGT CGGCTCTGAC GATGAAAGTT CCGACGACGA CGGCCCTCAG
GGAGGATACG CCGCTCGAAG GCGCAAGGTG AAAAAGGGTG ACATCGACGA AACAGACGAC
GCATTACACG GCAATAATCG CAAGCAGCAT GAAGGAGGTG AAGATTTGGA CGAGAATTTT
CAGCTCGAAA GTGATGACGA TGGAGAGAAC GAGGACGAGG GTTCCGAAGA GGATGTGTCG
GACGACGAAT CTGAATCAGA AGATGACGAC GCGCTCGATG ACGCAGCGCG TCTGCGCAAG
TCACTCAAGT CTGAGCTAAA AAATGTCGAT ACAGAATTAG ATCAAGGTAA GAATCGACTT
CGAAAGTTGG GTATTCTTCA AGATGGCGTC GAGGCTGAGG ATTCGGAGGA CGATGAGGAC
GACGACGATG AGGATGAAGA CGACGACGAT GAGGGCAGCG ACGATTCTCA GGAAGTAGAA
GATGCACATG CAGATGTTTT GCGAGAGATT GAAGACGAGG AAGACGTTGA AAAAGAAGAT
GCGTCGAACA CAGCGCGCAA ATCTTCCCAA AAGGAAGCGT CCACGGCGAA AGAGAAGAAG
TTTTCTACGC CGACGAGAAC GGACATCCCC TTCACTTTCC CCATGCCAGA GAGCGCCGAA
GATCTTAATT CTATATTAGG TGATCATAAC GCTGAAGATG CGTTTACGAT CATCACGCGA
ATCCGGGCCT GCAACGCGCC CACGCTCGCG GCGGAGAATA GAAAGCGAAT CCAGACTTTG
CTCGGTCTTC TGTTACAACG GTTTGAGATT TTAGCCGGTC AAGCGCCGTT GCCGGTAGAT
CATTTGGACG TTCTATCGAA ACACATTGTA GACTTGAGTA CACAAGTTCC ATTTTTCGCT
GCAACCGCAG CCAAGGCCCG CGTAGAAAAG ATGAGCACTC GGCTTCGTCA GGCGCTGCGT
GCCGGGGAGA CGGGATGGCC GCCTTCTCGT ACGGTTCTGC TCCTTTCTTT ATTCGCCGCC
ATCTTTCCTA CGACGGATAA GTCACACCCA GTTATGACTT CGGCGACACT TTATATTGGA
AACTTACTCG CGCATTGCGC GATCAGATCA GTGAGAGATG CAGCTTTGGC GGTCATCCTT
TCTACAATGG CAAGCGTTTA TTCGACTGGC GCCGAAAGGA TATTTCCTGA AGCCCTCACT
CTCATGAATG CTTTGATTCA TTGCGCATCG CGATCAAAGA CCAATTGGGC GGCGGGTTTG
TCAACACACC TAGTCGAACA AGTTGGCGGA CCCTGGTTAT CTTCAGCGCT TACGTCTGCG
ATGGAACCCA TGACGTTGCC AGAAATGTTG GATGGCATTT ACGCCGAAAA ACTCGAAGAG
AAGAAGTTAT CGGCCGCAAC ACTAAGGGCT GCTCTTTCTT GCTTAAGGCA ACTATCTAAA
CCAGTGATAA AGACTGCGTC AGCGTCTGAG ATTCTCTCAC CTGTTCGTGA TTCCGTGAAG
GCGCTTAGAA AATCGTTGAA AAAGTCGAAC GGTGGTTTAG CAGAGTTGTG CGACGAGCTC
GTCAAGGAGT TAGACGACGC TCTCGTGGGC GCGGTCAAAA CGCCTTTGGC GTACCACACC
AAGACGGCGG AGGCAATCAA AACGTTCAAT CCCATGTACG AGGAAGACGG CTATCAAAAA
GGTCGGGACT ACGATCCGAA TCGCGAACGA GCCGAGGCAA GAAAGCTCAA GAAGCAAGTC
AAACAAGAAA CTCGCGGCGC CATGCGCGAA TTGCGCAAGG ACAATCGATT TATGGCAGAT
GCTCGTTCGA AGGAACAGTT CCAAGCCGCG GAAGAACGCG GGGCTCGCCA GAAAGACATT
TTATCATTCT TGGAAAAGCA AGAAGCCGAT TTCAAATCTG GAGGACAAGG TGGCCAAATT
GTCAAGAACA AACGCCGCGT GTCGAAAGGC TCGAGACGAG CCTTCTAG
 
Protein sequence
MAKKKRPSNP GGSGSKRKFF RADGDASGPS PSGVTVKSAF ETVSHRRKFD ILGRKVKGER 
GNVLQARTEA IEKRRRTLLV EHEANGKANA FLDRRFGEHD SGLSVEEKNI GRLAKARLRQ
YKKSKAFALN EDDDDDGFKT LTHLGQPLGE RDLTAKPSHG DEEDDENLDD EITRNFHFGG
GEFEPTLKRG DGEHDVDAPE RRKSKKDVME ELIAKSKFYK AEKQKQRDDD GDMLDKLDSD
FKAISQGGLL SSALRKAVGH MKPTAAKAAQ TKAPEQKDEY DTWARTLAFE RRGQAGDRAK
TQEEVEAQAK LALEQAERKR LKRMRDVGSD DESSDDDGPQ GGYAARRRKV KKGDIDETDD
ALHGNNRKQH EGGEDLDENF QLESDDDGEN EDEGSEEDVS DDESESEDDD ALDDAARLRK
SLKSELKNVD TELDQGKNRL RKLGILQDGV EAEDSEDDED DDDEDEDDDD EGSDDSQEVE
DAHADVLREI EDEEDVEKED ASNTARKSSQ KEASTAKEKK FSTPTRTDIP FTFPMPESAE
DLNSILGDHN AEDAFTIITR IRACNAPTLA AENRKRIQTL LGLLLQRFEI LAGQAPLPVD
HLDVLSKHIV DLSTQVPFFA ATAAKARVEK MSTRLRQALR AGETGWPPSR TVLLLSLFAA
IFPTTDKSHP VMTSATLYIG NLLAHCAIRS VRDAALAVIL STMASVYSTG AERIFPEALT
LMNALIHCAS RSKTNWAAGL STHLVEQVGG PWLSSALTSA MEPMTLPEML DGIYAEKLEE
KKLSAATLRA ALSCLRQLSK PVIKTASASE ILSPVRDSVK ALRKSLKKSN GGLAELCDEL
VKELDDALVG AVKTPLAYHT KTAEAIKTFN PMYEEDGYQK GRDYDPNRER AEARKLKKQV
KQETRGAMRE LRKDNRFMAD ARSKEQFQAA EERGARQKDI LSFLEKQEAD FKSGGQGGQI
VKNKRRVSKG SRRAF