Gene OSTLU_29689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29689 
Symbol 
ID5006960 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp120451 
End bp121988 
Gene Length1538 bp 
Protein Length420 aa 
Translation table 
GC content64% 
IMG OID640422381 
Productpredicted protein 
Protein accessionXP_001422902 
Protein GI145357389 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value0.0208694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00156333 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCGACC TGACGTCGAC GGTGCGACGC GACGCGACGC GAGCGCGACG CGACGCGCGA 
CGGGAGACGA CGGGAGACGA CGGGCGGCGA CGCGACGAGC GGACGGCGAA CGCGAACGCG
GCGACGCGGC GGCGGCAGGG TTTGCGCGAT CGCGACGGCG ACGCGGCGCG CGCGACGGCG
GCGGACGCGC GAGGACGCGG GCGAAGAGTA ACGCATCGCG AACGCGTCGC GGACGCGAAG
GGCCGACGCG CCGCGGTCGA GCGCGAGGAC TGACGGAGCG CGGGACGACG ACGCAGATCG
CGGCGCGGCT GGACGCGCAC CTGACGCTGC CGCTGCTGGA GTTCGCGCTC GCGGAGGGCA
CGCACGATGC GAAGTCGCTG CGAGAGGCGA AGCTGGCGAC GCTGGTGAAG ACGAAGATGT
GCGATTGGGC GGAGGAGGCG CGCGCGGAGG CGAGCGGGTC GGGATCGGCG GCGGAGGCGA
AGGCGAGGCG GGACGCGACG GTGGAGGCGC ACGCGGCGCT GGGGAAGGCG GCGGCGCGGG
CGGTGAAGTT CGCGAGCGAT GGGGCGCTGA TCAAGAACTT GCGACGGGAT AAGGCGGCGA
ACGCGAAGTT CGCGGAGGAT AATCACGGGG TGACGAGCGC GGACGTGGAC GCGCTGTACA
AGTTTGCCAA GTTTGAGTAC GAGTGCGGCG ATTACGAAAA CGCCTCGGAG CACTTGGGCG
CGGTGCAGTT GTTGAGCGCG GACAACGAGC GGTGCGAGAG CGCGCTGTGG GGGAAGTTCG
CGGCGGACAT TTTGTTGCGG AACTGGGGCG GGGCTCTGGA CGACATGAAT AGGTTGCGAG
ACGCGTTGGA GAGCAACGCG AGCACGAGCA ACCTCGTCAA GATGAAGCAG CGCGCGTGGT
TGTTGCATTA CGCCCTGTTC GTCTTCTTCA ACCACCCGAA CGGTCGCAAC TTGATCATCG
ACGTGTTGTT CCAGGAGCGA TACATGCAAG CGGTGCAACA AGAGGCGCCG CATTTGTTGC
GTTACCTCGC CGTCGCCATC GTCGCCAACA AGAAGCGCCG CAACATGCTC AAAGACTTAG
TGAAGATTAT CCAGAGCGAC GTGTACGACG ATCCCGCGCT CGACTTCGTC GTCGCGGCCT
TCGTCGACTA CGACTTCTCC AAGACGCAAG AGATGCTGAA GAAGTGCGAC GCGATGATTG
AAAAGGATTT CTTTTTAATC GGCTGCAAGG ACGCGTTTGA CGAAAACGCT CGACAGTACG
TCATCGAAAA CTACTGCAAG GTGAACAAGC GCATCGACAT CGCCAACTTG GCGCAAATGC
TCGGTATGCC CGCCGCCGAC GTCGAGGCCA CCATCGCGAC TCTCATCCGC GGCAGTAAGC
TCAACGCGCG AATCGATTCC GAAGCCGGCT TCGTGCACGT GCACGTCGAG AAGAAATCCG
TCAACGAGCA AATCATCGAA AAGACCAAAG CCTTGCTGTC CAAGACCACC GCCCTCACGC
AAGCCGTGTT GGCCAACACC CAGGCGCAGG CGTATTAA
 
Protein sequence
MRDLTSTIAA RLDAHLTLPL LEFALAEGTH DAKSLREAKL ATLVKTKMCD WAEEARAEAS 
GSGSAAEAKA RRDATVEAHA ALGKAAARAV KFASDGALIK NLRRDKAANA KFAEDNHGVT
SADVDALYKF AKFEYECGDY ENASEHLGAV QLLSADNERC ESALWGKFAA DILLRNWGGA
LDDMNRLRDA LESNASTSNL VKMKQRAWLL HYALFVFFNH PNGRNLIIDV LFQERYMQAV
QQEAPHLLRY LAVAIVANKK RRNMLKDLVK IIQSDVYDDP ALDFVVAAFV DYDFSKTQEM
LKKCDAMIEK DFFLIGCKDA FDENARQYVI ENYCKVNKRI DIANLAQMLG MPAADVEATI
ATLIRGSKLN ARIDSEAGFV HVHVEKKSVN EQIIEKTKAL LSKTTALTQA VLANTQAQAY