Gene OSTLU_29678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29678 
Symbol 
ID5006866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp104439 
End bp107904 
Gene Length3466 bp 
Protein Length1150 aa 
Translation table 
GC content56% 
IMG OID640422287 
Productpredicted protein 
Protein accessionXP_001422808 
Protein GI145357198 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.00854714 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.00187759 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACACGA ATTGGACGCC AGCGACGCCC GCGCGCGCGC GTGCGGACGC ACGAATCGCG 
TCGGATAAGG TGCTGTCGAC AAAGGCGTTC GAGGACGCGC GGGTGAAGGA TCACCGCGAC
GTGGAGCGGG AACGCGGGGT GTACCTGCCG CCGAGCGCGG AAGAGGCGGC GACGCACGCG
GACAGCGCGC AGCCGAGACT GCTGAGCGAG GATGGGAAAT TGGAAAAACT TAAATATTTT
GACTCCGAAG TAGACTTGGG ATTGGAGTTC GGGGCGGGGG TATCGGGATA CTTTTTCGTG
CTGCGGGCGT TCGCGTGCTT GTTCTTCGCG GCGTTCGCGT TGAATTTGCC CGCGATGTTC
ATTAATTACA CGAGCACGTA TTACGCGTCG CATTACGAAG CGGAGCCTGA AACCGCGACA
GCGCCGACGG CGGACGCGCT CAGTGGACGC CCGACGACGT CGTTTCACTC GTGGTTCGTC
GATTGGCACT GGGCGAATCC GATGAGCGCG AGCTTGGGGA CCGTGGCGCC TGACGATGTG
CCGCAACGAG CGTGGGTGCT CGCGGGGACA ACGAGTGGGG TGCTGCGGTG GAGCTCCAGT
AAGATAGATT TCATTCGATT TTCGGTGGTG ATGGATGTCG TGGTGTCCTT GATAATTATA
TTGAGCGTTC CAGTGATTAT GCTTGTGTTG ATGCGTACGG AGCGACGGGT CGAGAGAGGG
ACGGCGACGT TGAAGGATTA CACGGTATTA GTTACAGGAT TACCAAGCGA TGCGACCGAT
GAAGAAGTGC GATGCTTTTT CGCGCTCAGG TTTGGGACGG TAGCCGACGT CGTGCTCGTA
AAGACGGAGT GTATGCAAAT CAACGCGCAG CGACGGCGTC GGCGCTTGTT GGAGGATTAC
GACGAAGCCG AAGCTGCGCT CATCGCCGCA GGTAATCGCG GAGGTGACGG CACGAAGACG
GCTATAGAAA ACGAGATATT GGCTGTGGAT AGGAAATTGA AGCGCAGAAG AGCGCGAACG
CGGGCGAAAC AGTGTAGTGC TTTCATCACG TTTGAAACCG AAGGCTCAAA GATTGATTGT
ATATTGCGCA ACACGCGCAG TATAATGTCG TACATCTTTG CGTTTCCTAG GAAAGAGCGT
TTCCGAGGAA AGCGGAAGTA CAGAGTGCGA GATGCTCCCG AACCTGAGGA CGTACGATTC
GAGAATCTGA ATCTCTCGAA TCGCTCATGG CGTCGACTCG TCGTGCTATT CTCGTGCTCG
GCCGTCGTGC TCTTGTGTTA CGGCTTTTTG AAGATGCTCG TGGATGACAA GGAGAAGCTG
TGGGAGAACG CAGACATGAT GGTCACTACG CTTGCGGACG ATGTTGGCAT TGTCGTCGCC
CACGGTAATC CGGTTGAGCA ATTCGAGACG CATAAGAATC AGTTCAGGAC AGCATGCAGA
GCTCGCTTGG ACCAATGCGG CGTGGCTTTT TCCAAGGACA AAACGTACGT TGGCATGCCT
TGGGGGGCGC CAATTTACGC TTTCTACGAC TACCCCAACG CCACACTCTT AGATCGTCGT
TACGCTCAGC AGGATGCGGT TCGTGATTTG ACGAATTGCG CCCAAGACAC CAACCGTTGT
CCGGGAGGTC CGACGATGCA AAACTGTCAC GCGTGCTACT GCGCCAGTCT GAAGTACGGC
TTAACTTCGG AGGTTGTTCG GGCATACAAC AAAGCCATTC GCCACTCGTG CGCGAAATAC
GTGAACCTCG GTCCAGGAGA GTATTACAAC TGGTTGTGGG TGTCGTTTTG CATTACGCTC
ATGAACGTCC TCCTCGAATG GATTGTGCCC CTCCTCGTCG CCGCAGAACG TTTGCGCACG
CGCAGTGCCA CGAAGGTGCT CAAGACGAAA ATAATCTTCT GGGTGCGATA TTTGAACGTC
GCGGTTATTT ATGGCTTGCT CAACGCCAAT TTCTATCACA TTGGCAGGTA TTTCCCGCTC
ATCAAGCAAA TGTTTGGGCT CAAAGGCGAG TACGCAGATT TCACGAGCGA ATGGTTCAAC
GACGTCGGCT TGGTGTTGTT TTTCGCAATC ATGATGAGCG TGACGATACG TATTTTGGCT
CGAGTACTTG TAGACATCAT CACGCATGTC GGTCGAAAAT TCTCCGTGGC GTACTGTCAC
ACGCAAGCAA AGCTGAACAA GGCGTTTGAA GGGCCATCGT TTGACACCGG CGCCAAGTGC
GGCGACGTTT GCTTTACGAT TATGGCGGCG ATGACCTTTT CGAGCGGTAT GCCGCTGATA
TACTTGGTTT TGTCGATGTA CTTCGTGTTG GTGTACCTGT ACGATTACCG CCTCCTGTTG
AAAGTGTGCA AGTTGCCCGA AAGATCGAAG AGCACGCTTC CCATGACGGC GGCGAAGGTG
CTTTTCATCT CAGTCTCGAT TCACGCTCTC ATCGGTCTTT GGATGTTCTC GTACCACTGG
ACACCTGATT TGGCGAAACC GACGAAAGAC TTTGAACATT CGAGTAAGAA CAATGCTCCT
CCGCTTGCGT ATGAGATTGG GGGTGAAATA TTAAACCCGC CGCACGACAA CGGCGCGCTC
ACGGCCATCG TGCAAAGCAC CGGCGTCGCG ACGACGTACT TTCATCAGTA TCGCGACGCG
ACAGTCGGGG CGTTGTCGGC GGGAGATTTC GTCGCTCCAC CGCCACGCGT ACAGCTGCGT
TTCGCAGAAC GACCGTTCAG CGAAGCGGGG ATGCCGTTCA TGGGTATGTT CTTCGCCCTT
TTGGGCGTCA TGGGTTTGTG GCAAATCGCA GTCGCGTGGC ATAACTGGGG CAAGTCGCGT
CGAGACATCG CGCGCTCGTG GAAAAACTTA CCTCAGTATC ACGAAGCCAT CATGACCGGG
TTAATCGTCG GTTCGGAGAC GTACCGTCCC GAATATCAGC CCGATTACGC GTTCTTGTTC
GACAAGAGCA CTGTCGCGGC GGCCAAAATG AAACTAGGCT CGTACGCGGG AGGCCCCGTG
CGCGGCGACT CGGTGAACTC GCGAGACGCG TGGTCGCTCG GACAAGGCGA CGACGATGAT
ATCGCGGAGC TGATTCATCA TCGTTATGAT AACGAACGCG AGTACGACGG TGGCGCGCCT
TGGGTGCGTA AGACCGGTTC AAAGATATAC GGCGGCGACG CCGCGACCGC GAAGCGAGGC
TCTCGCGGCA CGCGCCAGCG CGACGGCGGA CATTACGGCG TACCAGTGGT CGATGTTCGC
GCTTTGGGGG TTGGAAGCGA CTACAATCAC GTCGAAAACA ACGCGTTCGG GCACTCCGAC
GCGTTCGTCG CGGATGATCA TGAGTGGGAC TCCGCCTCCG ACGCCGACAC GCTCGACGAC
GACGCGTCGA CGCAGCGAAG TCAATCCTTT AGCGATTACG AGAACGATGA TCAACGATTC
GACGGCGCCA GGCGACCGGC ATGGCTCGAC TAGGACGAAA CGTACA
 
Protein sequence
MDTNWTPATP ARARADARIA SDKVLSTKAF EDARVKDHRD VERERGVYLP PSAEEAATHA 
DSAQPRLLSE DGKLEKLKYF DSEVDLGLEF GAGVSGYFFV LRAFACLFFA AFALNLPAMF
INYTSTYYAS HYEAEPETAT APTADALSGR PTTSFHSWFV DWHWANPMSA SLGTVAPDDV
PQRAWVLAGT TSGVLRWSSS KIDFIRFSVV MDVVVSLIII LSVPVIMLVL MRTERRVERG
TATLKDYTVL VTGLPSDATD EEVRCFFALR FGTVADVVLV KTECMQINAQ RRRRRLLEDY
DEAEAALIAA GNRGGDGTKT AIENEILAVD RKLKRRRART RAKQCSAFIT FETEGSKIDC
ILRNTRSIMS YIFAFPRKER FRGKRKYRVR DAPEPEDVRF ENLNLSNRSW RRLVVLFSCS
AVVLLCYGFL KMLVDDKEKL WENADMMVTT LADDVGIVVA HGNPVEQFET HKNQFRTACR
ARLDQCGVAF SKDKTYVGMP WGAPIYAFYD YPNATLLDRR YAQQDAVRDL TNCAQDTNRC
PGGPTMQNCH ACYCASLKYG LTSEVVRAYN KAIRHSCAKY VNLGPGEYYN WLWVSFCITL
MNVLLEWIVP LLVAAERLRT RSATKVLKTK IIFWVRYLNV AVIYGLLNAN FYHIGRYFPL
IKQMFGLKGE YADFTSEWFN DVGLVLFFAI MMSVTIRILA RVLVDIITHV GRKFSVAYCH
TQAKLNKAFE GPSFDTGAKC GDVCFTIMAA MTFSSGMPLI YLVLSMYFVL VYLYDYRLLL
KVCKLPERSK STLPMTAAKV LFISVSIHAL IGLWMFSYHW TPDLAKPTKD FEHSSKNNAP
PLAYEIGGEI LNPPHDNGAL TAIVQSTGVA TTYFHQYRDA TVGALSAGDF VAPPPRVQLR
FAERPFSEAG MPFMGMFFAL LGVMGLWQIA VAWHNWGKSR RDIARSWKNL PQYHEAIMTG
LIVGSETYRP EYQPDYAFLF DKSTVAAAKM KLGSYAGGPV RGDSVNSRDA WSLGQGDDDD
IAELIHHRYD NEREYDGGAP WVRKTGSKIY GGDAATAKRG SRGTRQRDGG HYGVPVVDVR
ALGVGSDYNH VENNAFGHSD AFVADDHEWD SASDADTLDD DASTQRSQSF SDYENDDQRF
DGARRPAWLD