Gene OSTLU_86314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_86314 
Symbol 
ID4999386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009355 
Strand
Start bp1067041 
End bp1070166 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table 
GC content60% 
IMG OID640414807 
Productpredicted protein 
Protein accessionXP_001416019 
Protein GI145341857 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAGCG ACGGCGATCT CGCGAACGAT GAGAACGCCG ACGCGGTCGC GCCGAAGAAA 
CGCGCCGAAG CGATCGCGAA CGATGACGGT GACGCGTTGA ATCCGTTGGT GCCGCTGCGC
GCGCTCGACG ATGAGACGCT CGCGAGCGCG CGCGCGACGC GGGAGAGCGA CGCGGACGCG
CGGACGACGG TGGAGGATGA CGTGCGCGCG GAGGAGTTCG CGTTCGCGGA GGCGCTGATG
CGGGTGAAAC GGGAAGGCGC GGGCGCGGGG GGGGCGGCGG CGATGTTCGA AGACGCGTGT
CGGTTGCGAG CGCGGGAACT GCGGACGCGA GCGGAGGCGC GAAGACGGCG AGCGCCGAGC
GTGGCGAAGC GTGAGGTGGG CGAGGCCGAG GCGCTGGAGC GGGAGGCGCA CTCGTGGTCG
TTGATTTATC ACTTGCTCGG CGACGGGGCG ACGGTGGAGC GGGAGAGCGC GGAAAAGGAA
ACTGAAGTGC TGCGCGCGAC GCCGAGGGAG GAGGGCGGGA CGAGAGGAGA CTTTCTTCCG
CCGCCGCTTC GCAGTCGACT GCGATGCGCG TCTCGAGACG AAGAGCGAGA TCCGGTGACG
TTTAGATTGA ATAGAATAAT CGCGTGGTTG GAGGCCAACT CGGCTTCAGC GCTTCGCAGA
GCGGAGCTGG ATGGCACGGC GTACGATGGA AGGTTTTTGC GAGATGAGTG CGATTGGCGC
GCAACCGCGG ATGCCATCGA CGCGTCGGCT AAGTGTGATC CGGATGGTAA TCCGCTGTCG
ACGTCGCTGG ACCCGGACGG GCCCATGCGC ACGAAATCGG CGCTGCACCC GTCCAACGCG
GACGCGGAGG TTCGATTGTG TAAACGGTTG TGGAAGCTCA TTCGCGCTGG GAGCGTGCAA
GAGGCGCGCG ATTTGTGCTC CAAAGTCGGT CAGCACTGGC GCGCCGCCTC GCTTGGCGGT
GCCTCGGGCT GGGGACCGGC TCCAGTCGGC AGCACTGCTG ACGAAGAGCT CGAGCGAGAC
ATTCGTAAGC TCTTGGCGCT TCGCGACGAG GACGCGCTCG CGGCACAAAA TGAAGTTGAC
CTCAACGACG ATGCTACTGC GGCCGAGTGC GACGGTATCG GCACTGCGCG TCGCGCGCTG
TGGAAATGGA CCTGCATGGT AGCCGCACGT CACATCGACA AAGCCGGCAA GCTTTCGCAA
ACGCCCGCGG CCAAGTATGA GGCGAGTGTG TACGGGGCGC TATGCGGTGA TTTACAAACG
ATGCTCGCCG TTTGCGAGGG TGATTGGGAG TCCACGGCGT GGGCTTACAC CCGAGCGCTG
TTCGATCTTC GGGTCGATGC CGTCGTGAAC ACGGGCAAGG TGCTCGACGA CGTGTCAAAC
TTTGAACCTG GCGAAGTCGT GCGAGATCCG ACTGAGCTAG AGACGACGGA TGACGCTGTG
GATCGTTTGG GCGAACCACG ATGGCCAACG CGAGACGTCA TCAACGCGAC GCCAAAGACA
GTGGAAGAAA TTTTGCTAGT CAAGATGCCC GAACGTTTTC CCGACGCCGA CGCGCATCGA
ACGGTGCAGA CGCACTTGAT TCTCGGTAAG ATGAAGGAGT TATTGTTGGA CCATATGATG
CGATGGATCT TCCCTGAGGA CGAGCTTGAT TCAAATGTCG AACGTGTAAG TTCCGAGCCG
CTCGACATTG GTCTCACCCG CTTTACCGCG CATGCGCTAC TGTTTTTAGA GTCGTTGTTG
CCGGAAGGTG GTGGATTATC TCCCGGAGGC GAGCTTTACT TTCACTTGAA TAAGGTGCTC
AACTTGTACG TTGTGCACTT GATCGCAAAC AAGCGTTACG CGTTGGTGCC AGCGTACGTG
GTGCACTTGA GACACCCTCT GTTGATTGAA ACGTACGCCA ATTTCTTGGA TCTCCTGGCG
CCCGCCGTGC TTTCTCGCAA GACGCTGTGT TACGCCGAGG CGGCGCTTTG GATGGAAATA
GAGGGCCCGG GCGGGTGGCG AGAAATCGTC ACGAGAGCCT TAAGCGACTC CACGAGTCTC
GTGAACGTTC ACCGAGGACC CGAGTATCGA CGTTTGATGC TCCAGTGGGC GTGCGTTACG
AGTGAGACGT ACCCAGAAGC GGTGAAACAC GCGTGTTTGC TGCTCCGCCA ACTCATGTGC
CAGCGAACGT CGATCGAGGT CTTTGCGAAT GACGCCCCAG TGGATGGTGA ACTCCGTGCT
CGCGTCATCC TACTCGAGGA GCTTCCAGAA GTCGCGCAAG AGGAAGCGAG AGCGAACGGA
GCCGCGGCCG CTGCGGCTGA ACTTGCCGAC TGGGCGAGAT ACCTCGCCGC CACGGAGGCT
ATTTCGCAAT GGAAGCAAGT CTGGAGCGTG AACGAATCGA GACGACTAGA CGTAGCGGCA
CAAAACACTC GCCCGTACGC TCCAAATTCA GCGGGGGAAG TGACTGAAGA CGAGCTGGCG
AAGGCACGCG ACGCCATCGA CGCCGTCGTG GCGCTTCTTC GGTCGGAGAA TTGGCTCGAC
GACGAAGCGT TGTACGACGA CATGGAACAA ACGGGCGACG CGACGCTACG GGTTGTCGCG
ATTCCCGTCG CCAAAGCTTT CGACGACCCA TTGACTTCGG CGAATATGAG TATTGAGCAA
ATCGCCCGAG ATCTCGAGAC GTTGCTTGGC TCAAAGTTCG CTCAAGGCGT CGTTGAGGTG
AGCGCGACTG TAGGCGTCAC GCCCGGCGAG TGTCCTGCGC GCGTGGAAGG TGAGTACGGT
CAAGTCGTCG TGCAGATCTC CACCGAATGC AACGATGAAG ATCGAGCGTC GCTATACCAA
GACGTCTCGC TGGCGATGGC CGACTGCGTC AAGGGTGATC TCCCAGGACA AGAAGTCACG
CTCGACGTAC AATCAGTCGG TGGCAGTAGC GAGACGTTGG TACATGCGTT ATGTCGGGCA
ATCTGCGTGC CTTCGCTCGT GATCCAAGCG GCGCAAGTCG AAGCCGCCAC GCGCACTGGA
ACGACACAAA TAATTGAAAT GACCGCCGAC CCGAAGTTTG GTGTGCACAA GTATTTCGCC
CCGACGGAAC TCCGATGGTT GTTGGAACTC GGACGCGAGA TCGGTTTGAC AATTTTGGAC
AAATAG
 
Protein sequence
MPSDGDLAND ENADAVAPKK RAEAIANDDG DALNPLVPLR ALDDETLASA RATRESDADA 
RTTVEDDVRA EEFAFAEALM RVKREGAGAG GAAAMFEDAC RLRARELRTR AEARRRRAPS
VAKREVGEAE ALEREAHSWS LIYHLLGDGA TVERESAEKE TEVLRATPRE EGGTRGDFLP
PPLRSRLRCA SRDEERDPVT FRLNRIIAWL EANSASALRR AELDGTAYDG RFLRDECDWR
ATADAIDASA KCDPDGNPLS TSLDPDGPMR TKSALHPSNA DAEVRLCKRL WKLIRAGSVQ
EARDLCSKVG QHWRAASLGG ASGWGPAPVG STADEELERD IRKLLALRDE DALAAQNEVD
LNDDATAAEC DGIGTARRAL WKWTCMVAAR HIDKAGKLSQ TPAAKYEASV YGALCGDLQT
MLAVCEGDWE STAWAYTRAL FDLRVDAVVN TGKVLDDVSN FEPGEVVRDP TELETTDDAV
DRLGEPRWPT RDVINATPKT VEEILLVKMP ERFPDADAHR TVQTHLILGK MKELLLDHMM
RWIFPEDELD SNVERVSSEP LDIGLTRFTA HALLFLESLL PEGGGLSPGG ELYFHLNKVL
NLYVVHLIAN KRYALVPAYV VHLRHPLLIE TYANFLDLLA PAVLSRKTLC YAEAALWMEI
EGPGGWREIV TRALSDSTSL VNVHRGPEYR RLMLQWACVT SETYPEAVKH ACLLLRQLMC
QRTSIEVFAN DAPVDGELRA RVILLEELPE VAQEEARANG AAAAAAELAD WARYLAATEA
ISQWKQVWSV NESRRLDVAA QNTRPYAPNS AGEVTEDELA KARDAIDAVV ALLRSENWLD
DEALYDDMEQ TGDATLRVVA IPVAKAFDDP LTSANMSIEQ IARDLETLLG SKFAQGVVEV
SATVGVTPGE CPARVEGEYG QVVVQISTEC NDEDRASLYQ DVSLAMADCV KGDLPGQEVT
LDVQSVGGSS ETLVHALCRA ICVPSLVIQA AQVEAATRTG TTQIIEMTAD PKFGVHKYFA
PTELRWLLEL GREIGLTILD K