Gene OSTLU_3943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_3943 
Symbol 
ID5005618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp126032 
End bp127552 
Gene Length1521 bp 
Protein Length507 aa 
Translation table 
GC content55% 
IMG OID640421039 
Productpredicted protein 
Protein accessionXP_001421580 
Protein GI145354625 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1499] NMD protein affecting ribosome stability and mRNA decay 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.673808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AGCGTGCTGT GCTGCCTGTG CGCGATCCCG ATGCACCCGA ACCCGAGCGG CATGTGCGTC 
GAGTGCGTGC GCACGCAGGT TGACATCACC GAGGGCATCA GCAAAAACTG CGTCGTCACG
TATTGCTCGC AGTGCGAGCG CTACCTGCAG CCGCCCAAGC ACTGGATCCG CGCCGATCTC
GAGAGCAAAG AGCTGCTGAC GTTTTGCATC AAGCGCATCA AGGGATTGCA AAAGGTGAAG
CTCGTGGACG CGGGCTTCGT GTGGACGGAG CCGCACAGTA AACGACTGAA GACGAAGCTG
ACCATACAGA AGGAAGTGCT GAACGGAGCG ATACTGCAGC AGACGTTCGT GACGGAATTT
GTGGTGGAGT GGAGGATGTG CGATGCGTGC GCGCGGAGCG CGGCGAATAG TGATCAGTGG
AACGCGTGCG TGCAGGTGCG GCAAAAAGTG GAACATAAGC GGACGTTTTT GTTCCTCGAG
CAGATGATTT TGAAGCACGG CATGGAGCGA GACGCGATCG GGATCAAGAG TCAGCCGGAT
GGGTTAGATT TTTATTACGG TCATCGATCG CATGGGTTGA GGTTTGTAGA TTTTTTGGGC
AGCGTGGTGG CGACGCGGTC GAGGGGCGAC AAGCAGTTGG TGTCGCACGA CGCGAATAGT
AATACGTACA ATTACAGGTT TACTTTCTTC GTGGAGATCG TGCCGGTGTG CAAGGAAGAT
TTGGTGGTGA TTCCTTTCAA GCTTTCGAAG GAGTTTGGGA GCGTTGGGCC GGTGATGCTG
TGCACGCGGG TGTCGAATTC TTTGCAGTTT ACGGATCCGG TGACGATGCG ACAGATTTGG
ATCGATCAGG AAAAGTATTG GCGCCAACCG TTTCGCGCGG CGGCGACGGC GAAGCAGATG
ATAGAGTACG TCATCCTTGA CGTTGAAGTC GATCAATCCA CGAGACAAGG GAAGTTGTGC
ATGGCGGATG TCGAAGTCGC GCGCTCAAGC GATTTCGGCG TCAACGACAC GACGTTTTTC
GTCAAGACGC ACTTGGGGAA TATCTTACAA GCGGGCGACA CCGCGCTTGG ATACGATTTG
ACAAATTTGC AAATCGTCGA TCCAGAGATG GAGAAGTACT CGGGTAAACA TCAAGGCGTT
GTGCCGGATG TTTTCCTCGT GAAGAAGTCG TATGCCGAGA GTCGTCGACG AAGGCGCGAG
CGAGGCGTGG CTAGAAATTG GCGTTTACAG CGCATGCAAG TGGAAGAAGA CGAAGAAACG
CAACAAAGGT CTCGCGGGAG CGCTGATCGC ATGGCTCAGG ACGAAGAGTT GTTCTATCAA
GAGTTGGAAG AAGACGAGGA AACGCGAGCG CAAGTACAAA TTTTCAAGGA TGAAAACGCC
ATAAACGCCA ACGTCACCGC GGCGGCTGGC GACGACGACG ACGACGACGA CGACGCACCA
GAGGTGCCGA TCGAGGAGCT TCTGGATGAG TTGACTCTAT TGCAGCGCCA AGCTGACGAC
GAGGACGACG AGAACGATCT C
 
Protein sequence
SVLCCLCAIP MHPNPSGMCV ECVRTQVDIT EGISKNCVVT YCSQCERYLQ PPKHWIRADL 
ESKELLTFCI KRIKGLQKVK LVDAGFVWTE PHSKRLKTKL TIQKEVLNGA ILQQTFVTEF
VVEWRMCDAC ARSAANSDQW NACVQVRQKV EHKRTFLFLE QMILKHGMER DAIGIKSQPD
GLDFYYGHRS HGLRFVDFLG SVVATRSRGD KQLVSHDANS NTYNYRFTFF VEIVPVCKED
LVVIPFKLSK EFGSVGPVML CTRVSNSLQF TDPVTMRQIW IDQEKYWRQP FRAAATAKQM
IEYVILDVEV DQSTRQGKLC MADVEVARSS DFGVNDTTFF VKTHLGNILQ AGDTALGYDL
TNLQIVDPEM EKYSGKHQGV VPDVFLVKKS YAESRRRRRE RGVARNWRLQ RMQVEEDEET
QQRSRGSADR MAQDEELFYQ ELEEDEETRA QVQIFKDENA INANVTAAAG DDDDDDDDAP
EVPIEELLDE LTLLQRQADD EDDENDL