Gene OSTLU_38469 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38469 
Symbol 
ID5001841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp356287 
End bp357438 
Gene Length1152 bp 
Protein Length383 aa 
Translation table 
GC content62% 
IMG OID640417262 
Productpredicted protein 
Protein accessionXP_001417745 
Protein GI145346541 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.416523 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGACG CCGACGCCGC GGACGGCGTC GAGGGCGACG CGCGCGAGCG CGAGGCGTTT 
CTGAAGATCG CGCGCGCCGT GCACGCGTAC GACGCCGACG CGGCGCGACT GCTCGAGCGA
TGGCGCGCGC GACTCGAGCG GGACGACCTG CCGAAACGCT ACGACGGCGC GCTCGACGGC
GCGCGACGGG ACGTGCGAAC GCTCGGTGCG AAAGCCAAGG CGTTAAACTA TGATTTCTTG
CGGTGCGCGC TGCACACGTT CGTGGATAAT GAGCGCGCGC CCGCGCATTT GCGCATTCCG
AGCGCGCGCG TCGCCGCGTG GTCGCGGGAC GAAGCGTTTC GCGCGGAGCG GGACGACGTG
GATAAGGTGC GGTACGTGCT GAAGAATGTG TGGCGAGATT GGTCGGAAGA GGGTGCGCGC
GAGCGGAAAC CGGTGTACGA TTTGATATTC TCGGCGTTGA GGGAGAAGTT GGGGGCGATC
GACGCGCGCG TCGGGAGCCC GGTTGGCGAG GCGCCGCGCG TGCTCGTGCC TGGATGCGGT
TTGGGACGAT TGGTGTTTGA GTTAGCCAAG CTCGGATACG ACGCGCAAGG GAATGAGTTT
AGTTACTACA TGTTGATGTT CTCTTCGTTT TTGCTGAACG CGACGAGCGA GGTTGGGGAA
TTTGGAATTT GTCCATGGAT GCATAGTCGA AGCAACCATC GCGAGGCGGC GGACATGTGG
CGGGAAACGC GCATCCCAGA TGAGGTTCCG GGCGACGCGA ATTTGCCACC AGGAGCGATG
ATGAGCATGG CCGCTGGGGA CTTCGCGGCG GTGTACGGAG AGGCGCGCGA AACCGGAATG
TGGGACGCCG TCGTGACGTG CTTCTTCATC GACACCGCGC ACAACATCGT AGAGTATTTA
GAGTGCATCG CCAACTGCCT ACGTCCTGGA GGATGTTGGG TGAATTTCGG GCCATTGCTT
TATCATTGGG AAGAGTACGT CGACGAACAG AGCGTCGAAC TGTCGCTCGA GGAAGTGCTC
GCCGCGGCGG AATCGTTCGG CTTGCGCGTC GAGCGCTCGG AATCGACCGC GCCAGTCGAC
TACACGAGCG ATCCACGCTC CATGCACAAG ACGACGTACT CGTGCGCGTT CATCGTCGCC
ACCAAAGTGT AA
 
Protein sequence
MVDADAADGV EGDAREREAF LKIARAVHAY DADAARLLER WRARLERDDL PKRYDGALDG 
ARRDVRTLGA KAKALNYDFL RCALHTFVDN ERAPAHLRIP SARVAAWSRD EAFRAERDDV
DKVRYVLKNV WRDWSEEGAR ERKPVYDLIF SALREKLGAI DARVGSPVGE APRVLVPGCG
LGRLVFELAK LGYDAQGNEF SYYMLMFSSF LLNATSEVGE FGICPWMHSR SNHREAADMW
RETRIPDEVP GDANLPPGAM MSMAAGDFAA VYGEARETGM WDAVVTCFFI DTAHNIVEYL
ECIANCLRPG GCWVNFGPLL YHWEEYVDEQ SVELSLEEVL AAAESFGLRV ERSESTAPVD
YTSDPRSMHK TTYSCAFIVA TKV