Gene OSTLU_24970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_24970 
Symbol 
ID5003129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp597613 
End bp599338 
Gene Length1726 bp 
Protein Length552 aa 
Translation table 
GC content59% 
IMG OID640418550 
Productpredicted protein 
Protein accessionXP_001419214 
Protein GI145349594 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0280772 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.584273 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGATGG ATCTCAGAGG AGTGCCGATG ATGTTGTTAG ATCGCGAGAG CACGTTGTTG 
GCGAGGAAGC GATCGCTGGA TAACGTGATC GCGGCCGATG CTGGGAGGGA GAAGGATATG
TGGCAGGTGC GAACGGATGA ACGCATGAAT GGATTGGGAA AGCAACCGAC GCTTGCTCCT
CCGACGTCGT TGACGTGCCT GAAGTCGGGA TCGTGTTTGC CGATCGGTGG GTACAGCGTG
GTCGCCACCG CGCCGCCGAT GTGGGGGCGC ACCGACGGGC CGCACGCGGG CAAGAGCGTG
ATTTTAGTCA TCGCGCGATT GGACGCGAAC GGGTTGTTTA GAGACGCGAC GCCCGCGGTG
AACGCGCGGA TGACTGGCCT CGTCGCGTTA ATGGCGGCCG CCAAGTCGAC GCAAAAGATG
TTCAAGGAGA TGGACGCGGA AGACGTCGCC CATCCCGTGG CGTTTCTGGC GCTCAGTGGT
GAAGATTTCG GTAAGCTCGG CTCTGAGCGA ATCGCGCGCG AGATGCTCGC CTCCACCGAA
ACGTCGCAGC TTCCCGGTTT GGCTGGGAAG AAAATTCGCG CCATCATAGA ACTTGGTCCG
CTCGGTTTCT CTGAATCCTT CGTGGGCGAA CGCGTGCCGA CGATTTACGT CCACGGCGCG
CACGAGGGGA AAATGTTGGA GCGGATTGAG AAAATCGCGA ACACGTTTGA GTTTGAAGAG
ACGGCGGCGT TGGGTGTACC CCGGATTCTT GAGACCGCGG CGCCGACATT CGAAGCGCTC
CTCGCGAATG AGTACGAAAC GGCGTATTTA TCCGAGGATC CCGACGCTCC AATCGACGAG
CTCAGTGGAA CGACGTTGGA CGCCGGTAGT TTTCGCGCCA TCGACGCCGA GCGCATGGAA
ACTGTCGTGC ACGTCATCGC GCGTCTCGTG CGCGCCCTCG CGACGAACGA CAAAGTCGAC
GCCCCGCGCT TGGATGTCGA GGGAGGTAAG CTCGCGGTGA AAGAGCTCGC AAAGTGTCTC
ACGAATGAAA ACTACGGTTT GGAGAGATGC GAACTCGGCA AGAAGTTTTT GTACGGCGAA
GAAGCGAGCG CGCCCGGACT CGGGGAATCT TTCGTCGACC CAGTCGCCTT GCCGTCTCGT
TACCCCGATG CACTGCAAGG ACTCTCTCGC GATATGCAGT CGCACGAAGA CAAGAACGCC
CTGGCAAGAT TCGTGTGGAA TTATCTCGCC GACGCCACGT CGAACACGGT TTCGCCCAAG
ATGTGCGAAG GAGACGGTTC TTGCGCCGAA AACACCGTAT GCGTCGGCCG CACGCCGATG
AGCGTCGGTG AGTGTCATGC TGCGACGTCG AAGTATATGT TAGCGCTTTC GACGCGATTA
GCTTTTGATC GTTCGACGGG TCTTTGGATC GTGAACGAGC CCAAAGACCC GTTCGAGCGC
GCGGCGCCGC TGTGGACGGA GAGCGACTGG TCGCCGGCGA TCGGTGCCAC GCTCGTCGCC
CCGGTGAAGT ACAACTTCTT CACGAGCGTG GATGCTTTTC TTCTGTACGG TGTCATCTGC
CTGATGCTCG TCGTGGCTGC GCAGTTTTGC TTCGATCGCG ACAAGAAACG CGGCGGCGCT
CGCGAGCGCG AAGCGCTTTT GCGAGGCGCT CAACCGTGAC GCGCGGCGCG AACAGAAAAC
GCGTGTAATA TTAGTAGTTG ACAATTAAAA CGACCACTGA AACGAA
 
Protein sequence
MGMDLRGVPM MLLDRESTLL ARKRSLDNVI AADAGREKDM WQVRTDERMN GLGKQPTLAP 
PTSLTCLKSG SCLPIGGYSV VATAPPMWGR TDGPHAGKSV ILVIARLDAN GLFRDATPAV
NARMTGLVAL MAAAKSTQKM FKEMDAEDVA HPVAFLALSG EDFGKLGSER IAREMLASTE
TSQLPGLAGK KIRAIIELGP LGFSESFVGE RVPTIYVHGA HEGKMLERIE KIANTFEFEE
TAALGVPRIL ETAAPTFEAL LANEYETAYL SEDPDAPIDE LSGTTLDAGS FRAIDAERME
TVVHVIARLV RALATNDKVD APRLDVEGGK LAVKELAKCL TNENYGLERC ELGKKFLYGE
EASAPGLGES FVDPVALPSR YPDALQGLSR DMQSHEDKNA LARFVWNYLA DATSNTVSPK
MCEGDGSCAE NTVCVGRTPM SVGECHAATS KYMLALSTRL AFDRSTGLWI VNEPKDPFER
AAPLWTESDW SPAIGATLVA PVKYNFFTSV DAFLLYGVIC LMLVVAAQFC FDRDKKRGGA
REREALLRGA QP