Gene OSTLU_30641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30641 
Symbol 
ID5000571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp511280 
End bp513131 
Gene Length1852 bp 
Protein Length614 aa 
Translation table 
GC content55% 
IMG OID640415992 
Productpredicted protein 
Protein accessionXP_001416969 
Protein GI145344914 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0313408 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.102836 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGTTTT TTCGAAAGAA GACGTCGACG CGAGCGTCTT CGGACGAGGG GAGGGAGAAC 
GCGAGCGCGG AGGGGAGACG GTCGCTTCGG GACGGCGCGG ACGGGACGAC GTCGCCGAAG
GGCGCGCGGA GCGGGGAGCT GAGCGAGAAC GGGTCGACTC GGGCGGTGGG GACCGAGCGG
ATGTCGCAGT CGAGCGAATT GGATGATTAT GAGTCTCGCG ATACGCTGAC GGAGTTCGCG
ACGTCGAGCG CGAGCCCGCG GGTACCGCCG GGGCGAAAGA GTCTTCGAAG ACGGTCGCTC
GAACTGTACA GGCAAGCCGC CGCCGGCGCG ACGATGACAA ACTCGAAGAA AGCGTTGGTG
GATGCGGCGA TCGATGAGAT CATCGCGGAC GAGGAAACCG GTGAATTCGT TCCGGGGACG
TCGACGGCTG CGGAGAACTC GGTGTATCGC ACGATGTTTC ACACCGTTCG CCGTCTCGTG
GACCTCGAAG CTTCCAAGCG CGCTCACGGC TCAGATAGCG ATCCATTAGA TACGTTACAG
GCGATCGCCA ACGCGCTGAG TTTAGCGAAC GCGTTGATTT TACAGATTCA TGGAAGTTTG
GCGAGCAAAC AGGCGGCGGT GGATCTGCTC GATCCAGTCG CGAAAGGCTT CGTCGAACAG
TTTTTCACGG CGGACAACTC CTCGCTCATG CTCGCGCAAA ACATGGCGCA GTTGGCGATC
GAGATGCGAG GACCAAGAAA TCACATCCAG AGCCAAAATC ATTCTGCCGC TTCTTTACCT
TCGCGGGCGA GCAGTGAAGG CGTGCGAGAT GAGGACAGTC CGGGAGGGTC CATGAATACA
ATTTTCGCCG ATGTAGGCAG GAATCCGATG TACACGAGTT GGACGTCGTT CGACATCTAC
AAATTCGCCA GCGAGTGCGA GCGCGGACGT AAAGGTCCGC TTCAAACGCT TAGTATGGGT
TTGTTTGACC GTTTCAATTT GTTTCATGCT TTACCTTTGG ATCGCAATTC TGTCTCAAAC
TTTATTGCAG ACATTGAGAG GAAATATCGC GACAACGATT ATCACAATCG CGTTCACGCC
ACCGACGTGA CTCAAGCCGC GGCTTATCTC ATCGAGACAA GTTTAGAGTC CCAGATTGAG
CCTATACACA CTTTTGCAAT GCTTGTTGCC GCCATGTCTC ACGATGTGGG ACATCCTGGA
GTCAACAACA CATTTCTCGT CAACTGTAAA TCCGCGGAGG CGGAGCGTTG GAACGATGTG
AGCGTCAACG AGAATGGTCA CTTGTTCACA GCTTTTTCGT TGCTCAAGAA GCACGCGGTG
CTAGCGAAAT TTACAGATTC CGAGCAGTCG GACTTGAAGA AGTGGTTACA AAAGATGATC
ATGTACACCG ACATGGAGTT CCATGGCGAG CTGACGCAGC GAATGCTGAA GGAAATCGAG
GACGAACAAG ATGAGGAAAC GAATAGTATC AAACCGATTA AACAGTGGCA AAATATTTGG
GTACCGTTAG CATTCGCGCT ACATTGCGCA GATATTAGTA ATCCCGCCCG CCCGTACGAG
CTAGCGTTGG CTTGGGCGCA AGCCGTCACG GCTGAGTTCT ACAAGCAGGG AGATCGCCAA
CGCAAACTCG GGATGCGAGT AGAACCCTTC ATGGACAGAA GTCTGGCGGG ACCGGCTAGC
ACACAATCCA ACCAGCTAGG TTTTATCAAG TTGGTTGTGA AGCCTTCACT TTGTGTTCTC
GAGGCGTTCA TGCCCGCGGC TTCGCGACAT CTCTTGGATA CTTTGGAAGA GAACATCGCG
GCGTACGGCA ACGACGTCGC CTCCGCGGCT CACGCAGGGA GTTAAAGGTG TA
 
Protein sequence
MGFFRKKTST RASSDEGREN ASAEGRRSLR DGADGTTSPK GARSGELSEN GSTRAVGTER 
MSQSSELDDY ESRDTLTEFA TSSASPRVPP GRKSLRRRSL ELYRQAAAGA TMTNSKKALV
DAAIDEIIAD EETGEFVPGT STAAENSVYR TMFHTVRRLV DLEASKRAHG SDSDPLDTLQ
AIANALSLAN ALILQIHGSL ASKQAAVDLL DPVAKGFVEQ FFTADNSSLM LAQNMAQLAI
EMRGPRNHIQ SQNHSAASLP SRASSEGVRD EDSPGGSMNT IFADVGRNPM YTSWTSFDIY
KFASECERGR KGPLQTLSMG LFDRFNLFHA LPLDRNSVSN FIADIERKYR DNDYHNRVHA
TDVTQAAAYL IETSLESQIE PIHTFAMLVA AMSHDVGHPG VNNTFLVNCK SAEAERWNDV
SVNENGHLFT AFSLLKKHAV LAKFTDSEQS DLKKWLQKMI MYTDMEFHGE LTQRMLKEIE
DEQDEETNSI KPIKQWQNIW VPLAFALHCA DISNPARPYE LALAWAQAVT AEFYKQGDRQ
RKLGMRVEPF MDRSLAGPAS TQSNQLGFIK LVVKPSLCVL EAFMPAASRH LLDTLEENIA
AYGNDVASAA HAGS