Gene OSTLU_16474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16474 
Symbol 
ID5003472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp387804 
End bp389543 
Gene Length1740 bp 
Protein Length579 aa 
Translation table 
GC content53% 
IMG OID640418893 
Productpredicted protein 
Protein accessionXP_001419367 
Protein GI145349905 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACGCG CGTCACGCGT CGCGCCAGCG ACGGTGACGT ACATCTCCTA TCTGCCCGGC 
GACGACGATC CTTTGGACGA GCGCGCTACG GAGGCGTGTC AAGCGCTAGT GGACGCCCTC
GATAGTGCGC TGCGAAAGAC GAGTGAAGCG TTTTGGGCGC ACGTGCAGAG AAACGGACGA
GGGCTCGGGA GGAGTCTGGA CACGTACTTG CAGTTTAAAA CGCGGCCATT CGAAACTCGA
GGTCGGGGGG CCGAACCGTC GACTTCAGTT ACGGAGGACG AGTTGGGAAG GAAGGTCTTC
CTCACGATGC TCAGACTGGT GAATGACGGC GCTCGAGAGC GAAGCTGCTT AACTCTCGAA
GAGCGGAGGC AGGCGGTTTT AAAGCATAAC CTGATCGACG TACCGAAATT GATTGACTTG
ACGGTGATTT ATGGAAGCGA TAACAGGGAT CTAGTGCACG AAGTGCTAGA AAATGCTGTG
AAACTGTTGC CGACTCTCGA AGACGATTTT GGGCGCACTG GATTGATGAT CGAAAAGAAC
TTGAAAGATA TGGCAGCGCG AGTGACTCCT GCGGCTGAAG CTAACGAGCT CCCACCAGAC
GGGTTGAGTG AATCGCTGTC ATACTTTCAC GACGTTGCTA TTTCACTTGT TTCACTCGTC
ACCGTGTATC CAGAAGCCGC AGAATGGTTG CGTCGTGATG GGAATCTGAC AGGTGCGCTC
GAAAAGATCC GAAGTATGGT ACTTTCTAGC TTCCAGTCCA TGGCAACTTG CGATAGTGAT
AGTTTAGATA CGGTGAGAGG TTCTTTAGGC GCGGCAATTT CCTTTCTGAG AAGTGCAGAT
GGAGGAGGAG GAGGAGGAGG AGGAGGAGGA GGAGATGGAG AACCCGTGGA TGCAGAGCGC
GATAAGTACG TAGGACTGGC TCCAGTGGTC GTGAACACGA TTGAATCTGT TAGAATCATC
CTACCCGATG TCGGCATTGG GTTTCTTAAG GCATGCGTAG ACCATTTTGG TCCCAACGCT
GAGTCAATTG TGCAGCATTT GTTTGAGGAG TCGCTTCCGC CTGATTTGCG GTCGCTTGAT
AGACAGCTGG GATGGCCGCC ACCCGCGTCA AAGTTGACCT GGCGGAGTCG TCCTCAGCCC
AGTATACCCG TTGGCAAGAG ATTGAACAAG GAAGCTGACC TCGAACTTTC AGTGGAAGAT
AAGAAGCGAG TGCTGAAGGC AGCGCGCAAT TTGGAATACG AGGATGAATA CGATGATTCA
TTCGACGATC TCCCAGTGCA AGTAGCGAAC GTCACGCTCG GTACGGACGA ATTAGAAGAT
GGAGACGGTC GCAAAGAACG TGCAAACGCA TCGAAGCGTG TCTTCTATGT GGCCGACGGC
AAAGTCTATC ACTCCCACAA AGCCGGCGCC GAGAAAATCT TCGCGCATAG CGCAGAGGAA
GCGTCAACGA TCGCCCTGGC CCAAGAAAAA ACGAAAAGAA ACGAGATAGA AGGACTCGGT
GCGGGTGGAA ATAAGTCACG GTTCGATTCA GCATTCACCC CGAGCACAAG CGCGGTACCT
TTCACGCCAA GGGGATTGTC GACAAGTGCT TCGGCCAAAG AACAACCCAG GGCAAACGGG
GGTGCCGGTC GCGGAGGAAG AGGGTCTGGC GGACGACAAA CGAGAGGACT GACTGCAAAA
GATTTTACAC ACAAGCATCA CAATCAAAAA GCAAAAGCAG CTAAGAAGGC GGCGTTGTAA
 
Protein sequence
MSRASRVAPA TVTYISYLPG DDDPLDERAT EACQALVDAL DSALRKTSEA FWAHVQRNGR 
GLGRSLDTYL QFKTRPFETR GRGAEPSTSV TEDELGRKVF LTMLRLVNDG ARERSCLTLE
ERRQAVLKHN LIDVPKLIDL TVIYGSDNRD LVHEVLENAV KLLPTLEDDF GRTGLMIEKN
LKDMAARVTP AAEANELPPD GLSESLSYFH DVAISLVSLV TVYPEAAEWL RRDGNLTGAL
EKIRSMVLSS FQSMATCDSD SLDTVRGSLG AAISFLRSAD GGGGGGGGGG GDGEPVDAER
DKYVGLAPVV VNTIESVRII LPDVGIGFLK ACVDHFGPNA ESIVQHLFEE SLPPDLRSLD
RQLGWPPPAS KLTWRSRPQP SIPVGKRLNK EADLELSVED KKRVLKAARN LEYEDEYDDS
FDDLPVQVAN VTLGTDELED GDGRKERANA SKRVFYVADG KVYHSHKAGA EKIFAHSAEE
ASTIALAQEK TKRNEIEGLG AGGNKSRFDS AFTPSTSAVP FTPRGLSTSA SAKEQPRANG
GAGRGGRGSG GRQTRGLTAK DFTHKHHNQK AKAAKKAAL