Gene OSTLU_33254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33254 
Symbol 
ID5003141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp630253 
End bp634503 
Gene Length4251 bp 
Protein Length1376 aa 
Translation table 
GC content61% 
IMG OID640418562 
Productpredicted protein 
Protein accessionXP_001419442 
Protein GI145350062 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.218113 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GACACCGCGC TCGTCGTCAA AAAAAGTCAG AAAGTTAAGG ATGGCATCGC GCGGTGCCGA 
ACGCGCGCCA CTGCTCGCGG GCGCGTCGCG CGCCGCCGCC GACGACGTCG AAGGAGGCGC
CCAACACCCC GCGATGATCC GGCCGAATCA TCGCTCGAAG GCGTGGACGA CGAAGCTCGC
GATAGGCGCG ACGCTCGTCG CCGTCGGCGC GCTCATCGGT GGCCTGACGA CGTCCCGCAA
CGGCGCGAGC GGGTCGTCGA CGTCGCTGTC GTCGTGGTTG GCGGGCTTCG CGGCGCCGTC
GAGCGCGGAG GCTCGCGAGG GCGCCGGGCG ACCGGGACGC GATGTGAAAA TCACGCTCTT
GACGGCGTGT TCGCCGCACG ATATGATTCG AATGAATCCA GGGACGTGGC ATCACAGAGT
CGGGGCGAAG GTGACGACGA AGGATCACGA TAACGAGATG GCGTTTAGGA TGGCGCACCA
CTTGCGCGAA GATCCGCTCA GGTGCGGGAC GTACACCGGT ACGGTACGAT TGCGCGCGGG
TGTGCAGATG AACTTCTGGC TGTATCCGAT CAAGAACGAT GCGAACGAAA CGTTGGTGTA
CGATGAACAG TTTGAATTCG CGCAGAGCGA CTCGGGATGC CGCGACGAAG GCAAAGGAGG
CGGCGGCGAT CCCAACTGGG ATCGTTGCAG TCCAAAGTTC TCGCCCGCGC CTCTCGCGTT
GGTGGGCACC GGTACCAACG ATGGGAATTG CGTGCGGCGA TTCGATGTGT ATTACGACGA
ACCCGATCCT GGGCACGAGA CGTTTTTCAA TCGTATTTAC GACGGGCGTC AGACCACTTT
CGTGTGGGGG TCGTGCGGGA ACGCGTGCGC CGACTACCAG CCGCCGCAAT GTCCGGTGTG
CGCGATCAAC GAGTTCGTGC AAGGACACAG GTGTACGCCG TGTCCGCCCG GTTCGACGCA
CTACGCTGGA GCGCAACCCG CGGGCCCGGA CTCACAGTGC GCTCCCATTC GATGCGGTCC
GAACGAACTC GTGAAAAATC ACGAGTGCGC GGCGTGCCCG CCCGGCACGA CGAACGATCA
AGGCGACAGC TGTATGGAAG CCGATAGTTC CTGCGATCAC ACGTTGTGTG GAATCCATCA
AAAAGTTTCC AACCACGTGT GCGTCCCATG CGAACCTGGC AAAGAAAACG CCGCGGGCGA
CGACGCGTTC GGCGACGACA CTGAGTGCAC CGACATCGTT TGTGGGTCCA ACCAGTACGT
CAGAAACAAC ATCTGCCGAA ACTGTCCAGC GGGAACAACT CGACCGATGG GAGACATCGC
GTCGGCGGAA GACACGACGT GTATCCCGAC GTATTGTGGC GAACACGAAC GAGTCTCGTA
CCACGTCTGT CGCGCCTGCG CGGCTGGGAC GGAGAACGCT GCGGGCGACG ACGCCTCCGG
GCAAAACACT TATTGTGAAG CGATCATGTG CGGAAGTAAC GAACGGGTTC AGAACCACTT
GTGCGTCGCG TGTCCGGCCG GTTCAACATC GCCCTCTGGC GCAAACGCCG CCGGTGGCAA
CACTGCGTGC ACGCCGACGC TCTGCACCGA ACACATGCGA GTGGTTGGTA ACGCGTGCGT
GCCTTGCCCG GCTGGCTCGG AGAACGCCGC GGGTGATGAC GCAACGGGTG ATGACACTAC
GTGTGATTAC GGTATATGCG AAGAAAATCA GCATATCAAA AATCACGTCT GCACCGACTG
TGCGCCGGGG ACGCATCGAA CCGCTGGCGA TATCACGACG AGCGCCGACA CAACCTGCCC
CGCAATCCTC TGCGGCGTCA ATCAAAGAGT CTTAGGCAAC CAATGCGTGG GTTGTCCGCT
TGGAACGACG AACGACGCCG GCGACGACGC TTCGGGCAGC GATACCGAGT GCGATCCCAT
CATTTGCGGG TTGAATCAGC ACGTCATATC TCACGCGTGC GTCGCGTGCC TTGACGGCTC
TGCGAACGAA GCCGGCGATA ACGCCGCTTT AGGCAACACG GATTGCGACG TCCAAATGTG
CGGCACGAAT GAAGCCGTGG AGTCAGCCGT GTGTCATAAG TGTCCCGACG ACGCATCTCA
CGAAGCTGGA GATTCCCCGA CCGGAGGCGA TACCTCGTGC GATGACCTCT ACTGCCACGT
TAACATGAAA GTTCTCAGTG GTGCGTGCGA GGCGTGCCCA CCCGGGACGG TGAACGCCAA
TAACGATCCG GTGAGCGGTG GTGATACGAC GTGCGATCCG GTGCTTTGTC CCGTCCACAT
GCGAGTTCAA GACAACGCGT GCGTGCCGTG CGACGCTGGC ATGGAAAACG CTGCGGGCGA
TGACGCGAGC GGGAAAGACA CGACATGTGA CATCGTGCTC TGCGAAGAAA ACCATCGCGT
GTCTGGCGGT GCGTGCGTGC CGTGCGAGCC GGGTGAGTCG AATGAAGCCG GTGACTACAT
GCTCGGCGCG GACACCACGT GCGAAGCGAT CCTTTGCGGC GTCAACCAAA AAGTCTCAGG
CAATGAATGC GTATCGTGCG AAGCTGGGAC GACGAACGAC GCTCAAGACG ATGCAAGCGG
CGATGATACC ACATGCGACC CCGTCGTGTG CATCGCGAAT CAATACGTGA CGGAAAATAC
GTGCGCGTCG TGCGCTGCGG GAACGACGGC GCCGCCGGGC TCTCTCGCGA ACGGCGACGA
CACGACGTGT GCGGCGACGC TTTGCGCGAC AAACGAACGC GTCGACGAGA ATCACGCGTG
TGTACCGTGC GACCCGGGGA TGACCAGCGC CTCCGGCGCC GACGCCAGCG GCGCGCCGAC
AACGTGCAAA GCTATTCAGT GCCTCGCCAA TCAGCGCGTG CAAGATCACG CGTGCGTATC
GTGCGATCCG GGCTTCACCG TCGCTCCGGG CGGCGACGCC TCGGGTGATG ATACGACGTG
TCAGCTCGAC AAGAACGCGG CGTGCCAAAC GAATCAGCAC GTCGTGGACG GCGCGTGCGT
CCCCTGCCCC CCGGGAACCA CCAATCGCCC CGGCGACTAT TACACCGGCG AGGACACGAC
GTGCGACGCC GTTTTGTGTG GCGCACACGA GTACGTCTCT TCGCACGTGT GCACGCCGTG
TGCGTCCGGC CGAGAAGCTC TCGGTGGTGC GGATGCGAGC GGTGATGATA CTGTGTGCGG
CGATCCTATT CCCGCCACCG CCACCAAAGT GACGCCGGAA TCGGCGCAGG ACACGTACAC
CGCGCCGCTT TCGAAATCCG GGAGTGGCAA GGTGACGATG TCGATCCCCG GCGACCCGGG
CGCCCCAGTT GGAGAGTTTT CCCTCATCGC AACGGCGTAC GGCGCCGAAA TCAAACAAGA
CGTTCGTCAA GTGTCGTGGT ACGCGTGGAG GGCGGCATCT CGCAGCCTGA ACGTCCACTC
TCAGTGGGAC AGGAGCAAAA GTCAACCATG GCTACAAAAC GCGAACGTTG CGTATCTCTT
CACCTTGAGC GAGCTCGTCG GCCAAGGAAT CGTAGTGGGT ATGGGCGACC ACAATACCAG
GCTCCATAGT CTACCATGGT ACTCGGGCAG TTCGATGAAG ACGCTGCCCG CGACGCAGTC
GTACAGCAAA AAAGGCTTGG CGACTGGAGC GCATCCCGTC ATTTTGATCA ACACTTACAC
GCCCAACAAC TGGCCGGATA CGTTTGTGTG CAACGTGTTC AAACTCAGCA CGAAAGGTGG
GACGTTCAGA GTGCGCGTCG GACGCGCGGA TAGCCACGGT CATGGTTGGG GCGATAACGA
CATTCGCGTC GTGTACGCGG CGTTCAAGCC AAACGCCCCG TCGTTGACGA ATGACATTCT
CGCCGCCGGC GAGATTCAAC TTCCGAAGCC TCTCCCGATC GGCAAAAAGA CGTTTTCAAA
CGTGACGCTC GAGCACAACT TGAACAGGCT CGACTATCAA TGCCTTGCGA CGGTTCGAAT
CGACACCGAC AAGGACCCCA CTCGCTCCAT CGCTGTCGAA TGTCAAGCCG GAACCGCGAA
CACCATCACC TTTATGGCTG CGGATTTGGA ACTTCGCGCT TGGGACACAT ACGGGCCGGA
TGTGTTCGTG TCGTACATGC TCATCCCACT CGTCGGTGCG CCGGTAATTA AACCCGATCA
CTTTTCGGGC GTTCCGACGA TCGTCGGTTG AGAGCGCTTT AATTTCAGAG TTTTAATTAC
AGACCCTAAA GGTCGAAATC GCGCAGAGCT TCGAGCGCCA GGGCGTAGTC T
 
Protein sequence
MASRGAERAP LLAGASRAAA DDVEGGAQHP AMIRPNHRSK AWTTKLAIGA TLVAVGALIG 
GLTTSRNGAS GSSTSLSSWL AGFAAPSSAE AREGAGRPGR DVKITLLTAC SPHDMIRMNP
GTWHHRVGAK VTTKDHDNEM AFRMAHHLRE DPLRCGTYTG TVRLRAGVQM NFWLYPIKND
ANETLVYDEQ FEFAQSDSGC RDEGKGGGGD PNWDRCSPKF SPAPLALVGT GTNDGNCVRR
FDVYYDEPDP GHETFFNRIY DGRQTTFVWG SCGNACADYQ PPQCPVCAIN EFVQGHRCTP
CPPGSTHYAG AQPAGPDSQC APIRCGPNEL VKNHECAACP PGTTNDQGDS CMEADSSCDH
TLCGIHQKVS NHVCVPCEPG KENAAGDDAF GDDTECTDIV CGSNQYVRNN ICRNCPAGTT
RPMGDIASAE DTTCIPTYCG EHERVSYHVC RACAAGTENA AGDDASGQNT YCEAIMCGSN
ERVQNHLCVA CPAGSTSPSG ANAAGGNTAC TPTLCTEHMR VVGNACVPCP AGSENAAGDD
ATGDDTTCDY GICEENQHIK NHVCTDCAPG THRTAGDITT SADTTCPAIL CGVNQRVLGN
QCVGCPLGTT NDAGDDASGS DTECDPIICG LNQHVISHAC VACLDGSANE AGDNAALGNT
DCDVQMCGTN EAVESAVCHK CPDDASHEAG DSPTGGDTSC DDLYCHVNMK VLSGACEACP
PGTVNANNDP VSGGDTTCDP VLCPVHMRVQ DNACVPCDAG MENAAGDDAS GKDTTCDIVL
CEENHRVSGG ACVPCEPGES NEAGDYMLGA DTTCEAILCG VNQKVSGNEC VSCEAGTTND
AQDDASGDDT TCDPVVCIAN QYVTENTCAS CAAGTTAPPG SLANGDDTTC AATLCATNER
VDENHACVPC DPGMTSASGA DASGAPTTCK AIQCLANQRV QDHACVSCDP GFTVAPGGDA
SGDDTTCQLD KNAACQTNQH VVDGACVPCP PGTTNRPGDY YTGEDTTCDA VLCGAHEYVS
SHVCTPCASG REALGGADAS GDDTVCGDPI PATATKVTPE SAQDTYTAPL SKSGSGKVTM
SIPGDPGAPV GEFSLIATAY GAEIKQDVRQ VSWYAWRAAS RSLNVHSQWD RSKSQPWLQN
ANVAYLFTLS ELVGQGIVVG MGDHNTRLHS LPWYSGSSMK TLPATQSYSK KGLATGAHPV
ILINTYTPNN WPDTFVCNVF KLSTKGGTFR VRVGRADSHG HGWGDNDIRV VYAAFKPNAP
SLTNDILAAG EIQLPKPLPI GKKTFSNVTL EHNLNRLDYQ CLATVRIDTD KDPTRSIAVE
CQAGTANTIT FMAADLELRA WDTYGPDVFV SYMLIPLVGA PVIKPDHFSG VPTIVG