Gene OSTLU_40935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_40935 
Symbol 
ID5002336 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp390894 
End bp394400 
Gene Length3507 bp 
Protein Length926 aa 
Translation table 
GC content56% 
IMG OID640417757 
Productpredicted protein 
Protein accessionXP_001418236 
Protein GI145347569 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.240067 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCAAA TCCGTCGCAA CAATTTGAAA ATTTCTGCCG ACGTCGATGA TTTTGCGGGG 
AGCTTTGACG TCGCGGGTCT GTTGCTCGAC GACTTAGAGT TTGCATCATT ACGTGGCAAA
GTAGAAGCGG CGAGTGCAAA AATTGATTTG CGTGATCGCG TCGGTGTTGG CAGCCTGCAG
TTGAAGCAGC CACGTCTGAG TGGTATTTCG GGGGATTCAC TGGAGGCCGA CGTCACGTGG
GCGGATCGCG TCGTCAGTCT TCAACGCGCG ACGTTGAAGC AGGCGAAGTC GCAATACGAT
GCCGACGGAG ACTATGCCTT GCCGGATGAT ATCTGGAACT CTTTGCCGAG CGAACGCGTC
GTAGTTGAGG TGGAAGAAGT TTCAGAAGTT GAAGAAGTCC TCCTGCCGGC GCCAGTCGAT
GACGGCGTCG CGCAGTCGTC GCTGCCGGAA GACGTGCAAG TGATCGAAAT CGTTCCGAAA
GACGCCAAAA ATAATACTGA GACGATCGTT GGTAAGCGAG TGAGATTTCC TCGTCCAGCG
ATTTTGGACA AAGTTGGTAA TTCCGTAAAG AGTCTCGCCA AAAAACTCGA CGCTTTGCGG
AAACCGGCTA CCACGACCAC CGGCGAAAAC ACTCGTTCGG GTGCCGATGT TGTGGCTACC
GAGCCCGTCG AGCCCGTTTC GAAGAAAGAT GGAGTGCCCG AACCGATCAT CGCACGAGCG
ACGAAGGATG TTGAGCAGCT CGAAACTGGC ACAGACACGG ACAGCGAAGG GACTGACGTG
GACAAGCCTT CGAGCGCGTC GACTGAGATC AAAAACGTTG AAAGCGTCGC CGACGCCGCC
GCCGTTGCTG AGACTGTTGC TGAGCAGACG AGTGAAAGCG AGAAAGAAAC CTCTCTGATG
GAGCCCGTGT TAGTCGCCGG GGAAGCGGAA GTAGCCCAAA AAGCGAAACC GAACGCCGCG
GCGAACGCAG TTCGCGGCGT GTTGCGGTCG GCTCGTAACG TCGTCAAATC AAACAAAGTC
AACGAGAAAG AGAAGGATGA AAAATCTCGT ATCACGCTTT CATCTTACGA GAACGAGTTT
AATTCGGACG TTTCTGGTGC GTGGCGCTTC CGACTCGCGG TGCCTGAGGC AGATATTGAA
GAGATGCTTC CAGTTCTTCG TGTCCTTACG GATTTGCGAA AAGGGGCGAC GCCCGAAGAG
TATGGCCGAG CCAAGCAAGC GTTCCTCGAA GGTGTCGAAA AAACGGGATA CGCGATCGTC
GACTTGGCGA GGCAAGTTGA TGAAGTTACG ACGAAGCAAA AGAGCGAAGC TCCAGTCGCC
GAGTCAGCTA CGTCGACGGA CGACACGAAG GAAGTTAGTG AGACGACGAA GACATTGCCC
GGTTTACAAG ATTTGAAGGG TGGTTGGCAC GGTATGATTC AAGCCACCGG CGGCGGCGCT
TTGGAGACGT TTGATTCGCC GCAACCGACG GAAACCGTCC TTTTCGACGT TGCCGGTAGC
GATTGGCAAT GGGGACCTTA CAAAGTTGAA CGCGTCGAAG CTCAAGGCGA AGCGAGTTCG
AGCGAAGGCG TCAAGTTGAC GAGCTTAGAA GTGTCTTCGG ACGCCGCGAG TCTTTCCGTG
TCGGGCGCTA TCGGTGGTTC TCGTCAAGAC GCGACGTTTG CCGTCAGAGA CCTCCCAGCG
CCCTTACTCG GCGCATTTGT TGGGCCGCTC ATGCCCGAGC AAGACTTGAG CGACTTCCCA
CGAATAGGTG GCGATTTCCT CGTGCAAGGT CATTTGGGCG GTTCGGTGAC GGCGCCCGAG
GGCGAGTTCC TGATGAGACT TCGAGACGGC AAGATCGGTA ATGTCAAGCT GAAGAGCGCC
GAGTTGAGTG CGGAGTTGAA CGAAGCTCGC CGTGCAGAGT TTGAGGGCGA GGCGGTGCCA
GCTGTAGGTT CTGGTCTCGT GCGCATAGCG GGCGTCGTAC CGCTTCCGGA AGCGAACGAT
CAGTCTCTCG CCGTCGACTG GCGTGTGCGA GAGCACGGCA TGACGCTGCT CACGGCATTC
GTCCCAGAAG TCGCCGAATG GCAGAGTGGA TCGGCCGATA TGTCGTTGCA CGTCAGAGGT
ACGCCTGCGG CGCCTGTTTA CGATGGCGTC ATGGAAATAA GAAAGGCACG GATCTTGTCT
CCTTTGCTCG CCCGCCCGAT TTATCCGGCC AATGCGACTC TGCGCATCCA GCGCAACACA
CTTTACGTCG ACGACATCGA AGCAAAGAGC GCCAAGGGCG TAGTGCGAAT CAAAGGAGCC
ATGCCGTTGT TGAAGCCAAG CCGCAGCTCC GGTGGCGAGA CGTGGGAAGG TCTCGTAGCT
CGCGCAGACA CGCAAGGAGG CGTAAAGATG ACGATCGACG GCCTCGATGT CCGCGCAAGA
AATGTGTACA ACGGACAGCT GAACGCCGCT ATGGTGGCGA AGGGAACTGT CACGGCCCCG
GAACTTAGCG GAGATGTGCG ATTCTCGAGA GGTACCGCGT TCGTGCAGCA GCAGCCACCG
AACGCGGACG AAATGCTCAA TCAAACGAAG TCTGGCGCGT TGTCGGGGGC GAAGAGAGAT
TCGCGCGGAG TGCTGGCTGG GATTTTAGAG CGAGCCGCGA GAGCAAATGA TCCGAACCAC
GGCGAAGCGG GCAATCAGAC TGAAAATGAA CTCATGAACG AGAAGAATCT CGAAAAGTTG
CAGAATTTGC GCTTACGAGG CCTTCATCTT TCTGTCGGAC CTGAGATGTC TGTCGTTTAC
CCGTTTGTGT TGAACTTTGG CGTCAGCGGC GAACTCACGC TCGATGGCGT GATTGATGCA
GGCCTTTTAC GACCGAACGG CTCGCTTCTG TTCGATCGCG GTGACGTCAA CTTGGTGGCG
ACACAAATCA GACTCGATCG AGACCACCCG AACAGAATTG TGTTCACACC AGAGCAAGGA
CTCGACCCCT ACGTCGACAT TTCCTTCCTC GGCACGGATC TTCGAGCTCT CATACAAGGC
CCGGCTTCAA GGTGGACGGA CAGCCTCACG TTAACGTCTT CTGCGCAGAC TACTCCCGGC
GAAGGTGACG CGACGCTGTC TCCAAGTGAA GCCGCTCGCA TTTTCGAAGG TCAGCTGGTG
GAATCGCTCC TCGAACAAGA CGGAAAAATC GCCTTTAGCA ACTTGGCGTC GACGACGTTG
GCGTCGCTCA TGCCAAAGAT TGAGGCCGGT GGCAACGTCG GCAAGGCGCG CTGGAGATTG
ACCGCGGCGC CATCGTTGCC GGGTTTGCTT TCACTCGACC CTGACTTGGA TCCGTTCAGC
AACACTGGAT CGTTTACCCT CGGCTCCGAA GCGGAGATTT CATTCGGCGA TTCGTTGCAA
GCGACGTTGA GTCGCAACCT CGACGCCGAC GAAATGCGTA CGGAGCTGTC GCTCATGTAC
AAACTTACCA GTAAGCTCCG AATGCAGTTA AAGAGCCTGT CGGCGTCGGC GACGAGAGTG
ATGTTCGAAT TCAGCACGAA AGATTAA
 
Protein sequence
MVQIRRNNLK ISADVDDFAG SFDVAGLLLD DLEFASLRGK VEAASAKIDL RDRVGVGSLQ 
LKQPRLSGIS GDSLEADVTW ADRVVSLQRA TLKQAKSQYD ADGDYALPDD IWNSLPSERD
EKSRITLSSY ENEFNSDVSG AWRFRLAVPE ADIEEMLPVL RVLTDLRKGA TPEEYGRAKQ
AFLEGVEKTG YAIVDLARQV DEVTTKQKSE APVAESATST DDTKEVSETT KTLPGLQDLK
GGWHGMIQAT GGGALETFDS PQPTETVLFD VAGSDWQWGP YKVERVEAQG EASSSEGVKL
TSLEVSSDAA SLSVSGAIGG SRQDATFAVR DLPAPLLGAF VGPLMPEQDL SDFPRIGGDF
LVQGHLGGSV TAPEGEFLMR LRDGKIGNVK LKSAELSAEL NEARRAEFEG EAVPAVGSGL
VRIAGVVPLP EANDQSLAVD WRVREHGMTL LTAFVPEVAE WQSGSADMSL HVRGTPAAPV
YDGVMEIRKA RILSPLLARP IYPANATLRI QRNTLYVDDI EAKSAKGVVR IKGAMPLLKP
SRSSGVKMTI DGLDVRARNV YNGQLNAAMV AKGTVTAPEL SGDVRFSRGT AFVQQQPPNA
DEMLNQTKSG ALSGAKRDSR GVLAGILERA ARANDPNHGE AGNQTENELM NEKNLEKLQN
LRLRGLHLSV GPEMSVVYPF VLNFGVSGEL TLDGVIDAGL LRPNGSLLFD RGDVNLVATQ
IRLDRDHPNR IVFTPEQGLD PYVDISFLGT DLRALIQGPA SRWTDSLTLT SSAQTTPGEG
DATLSPSEAA RIFEGQLVES LLEQDGKIAF SNLASTTLAS LMPKIEAGGN VGKARWRLTA
APSLPGLLSL DPDLDPFSNT GSFTLGSEAE ISFGDSLQAT LSRNLDADEM RTELSLMYKL
TSKLRMQLKS LSASATRVMF EFSTKD