Gene OSTLU_25083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25083 
Symbol 
ID5003848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp583512 
End bp585794 
Gene Length2283 bp 
Protein Length740 aa 
Translation table 
GC content60% 
IMG OID640419269 
Productpredicted protein 
Protein accessionXP_001419639 
Protein GI145350494 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.156442 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.477507 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGCGT GCGCGATGGG AAACGATGCC GCGGAGCCGT ACGCGAGCGC CGCGGGACGC 
GCGACGCAGG AGGTGCGGAA AGCGAGCGAC GCGACGCGCG GGGGACGGGG AGAGATCAGA
GGAGAGACGC CGGATTTGCC GCTGTCTTAC GCTCGAAACG GCGGGACGTG CTCGACGAGC
GATGACTCGA GGCCGAGTTC GAGGATGGCG ATGGGCAAGG ATGGGAATTC GCGCGCGAGC
GGGGACGGGA CGACGTCGGG AATGGAGGAA GAGCGCAGAC GAGCGAGAGA GGCGACTCGC
GCGTGGTTCT CGGAGCGCGC GGCGAACGGA GATCTGGAGC TCATGCGCGC GGAGTTACAC
TCGATGGGAC AGAAGGAGTT ACAGCGCATG TTTGTCGAGA TGTTTGACCG CGCGACGACG
AGCAACAATA ATCAGTGGTT GCGAAGACGA ATCGCCAACG GGTTGGGCTT GGAGGACGTC
GCCGAGCACG TCGTGAGCTC GCAAGCCAAG TTGGCTTCGG CGACGAACGA ACGCGTTCCA
TCGAAGAGAC AAAGTGCGCG TCGCAATGCG GTGGAAACCG CGGCAGTAAC ACCGGCGGCA
CAGGCGACGC CGAACACGCG GGGCGCGGAG TCCCCGGACG AGGCGGCGGA GTTCACCCCT
GATGGCGTTC GCAAGTCTCG TCGAGCGGTG AAGCCGAAGG CGATATTTGA TTCCGACGCG
TTTCCTTCCG CGGCCAAGGT GAGTGAGGAG CGTAGGCGAG AACGAAAGCG ACAAGAGGCA
TTGGAGCAGG CGGCGACGTG CGCGAGTGGT GTAACGACGG ATCATCACGG CGGCAAGGCC
GCCGTAGGAC GCCGCGTGCG AGTGTACTGG CCTCTCGAAG GCAAGTTCTT TGCGGGTGTC
GTCACTGCGT ACAACGCGCG CACTGGCTTG CACCACATCG ACTACGACGA CGGCGACAAG
GAAGAAATCA AGCTGGCGAC GCCCGAACAG CCTCGCAAGT TTGAGGCCAT CGAGCACCCG
GCGTTGGCGT CGACGCCTTC TCCGGCGAGT GACTGGCCTG CACTTACGAA TACTGTCGAT
TCATCGATAT CGTTGTCTAT GCCGCAGCCG GGACGACCTG CTGATATTCT CAGTACGCTC
CCGTCAAGCT GGCCGGCGGT GGGATCGCTT GTGTGGGGTC GGGTGCGCGG TCACGGCTGG
TGGCCCGGAG CGGTGCACGA CAAGGACGCG AGTCACGACA TGCAAGAGAT CAGTTTCTTC
GACAACAGTA GGGCTCGACT TCACCGCCAC GACTTGTTGC CCTTTCAACA GTATTACATG
GTGCTGCGCG ACGCGAAAAA GACGCACGCG TACGCCGAGG CGGTTTCTCG AGCGGCGGAA
ACGTATGAAA GCCGACGGCA GCGAACGGTG AAGCGACGTT CGAAGAAGGA AGAGCAATCA
AACGTCGAAG AGTCGAGTGA GCCGCCGAGA ATATGGCATT TCGAACGCGA GGGGGCAAAG
AAGTCGCACA AGCGGTCGAA AGACGTTGAC GTCGACGACG CAGGCGCGGC GAAGCGAGGC
AAAGTGAACG AACACTCGAC GACGGTATTC GGTTCGATTG AAGCTCCGAA GACTCTGAAC
GATCTAAATA AGAGCTTGGA AGAGATGAAG GCCAAGATGT TACCGCTCGC GAAAGAGAGC
CGAAAGACGC TCAACAAACA TCTGGTAGAC ACCGCCAGAA AAGTGAAAAC GGAGGCGGCT
GACGACGACG AAGAGACACT GATCGCCGCC GATGAACGTG TCGTGGTGCT CGACGAGTTG
GCGTCCATCG AATCCTTGAT TGCGTGGAGC GAAAACAAAG CTGCGAAGTC GCCCGCCAAG
ACTCCCGATT TGTCCTTCAT GACAAAACGC GGCGATGACA TTTCACCCTC GGACCCGAAA
GGGCTTCTAC GACAACCGAG TGAAAACTTG TTGTGCTTGG GTGAGATTAG TGAGTTTTTC
GACGGTACCG CGCGAGCGGA TCCCTTGGGA ATCGACGATC CAATTGTCGC CGATTTGGGC
GCGGACGAAG TCGCCAAGAT GACCGCGCCA TCCACGCCCG AGCAACACGG CATCGAAGGC
GAGGGCAGTC CGTTCGCAAA AAGCAACATG AGTGACTCGT GCACCACGCT TCACGCCTCT
TACGGTGACA AAAAGATTAA CGTCTCAGAG ACGCCGGTGA CGAAAGGAGC GCTCTCCGCC
TGAGCGTAGA GGCGATAGAT TACACAACAT TTTCCACCAC CGTCCTCTTC GGTGGGTCCA
CTT
 
Protein sequence
MPACAMGNDA AEPYASAAGR ATQEVRKASD ATRGGRGEIR GETPDLPLSY ARNGGTCSTS 
DDSRPSSRMA MGKDGNSRAS GDGTTSGMEE ERRRAREATR AWFSERAANG DLELMRAELH
SMGQKELQRM FVEMFDRATT SNNNQWLRRR IANGLGLEDV AEHVVSSQAK LASATNERVP
SKRQSARRNA VETAAVTPAA QATPNTRGAE SPDEAAEFTP DGVRKSRRAV KPKAIFDSDA
FPSAAKVSEE RRRERKRQEA LEQAATCASG VTTDHHGGKA AVGRRVRVYW PLEGKFFAGV
VTAYNARTGL HHIDYDDGDK EEIKLATPEQ PRKFEAIEHP ALASTPSPAS DWPALTNTVD
SSISLSMPQP GRPADILSTL PSSWPAVGSL VWGRVRGHGW WPGAVHDKDA SHDMQEISFF
DNSRARLHRH DLLPFQQYYM VLRDAKKTHA YAEAVSRAAE TYESRRQRTV KRRSKKEEQS
NVEESSEPPR IWHFEREGAK KSHKRSKDVD VDDAGAAKRG KVNEHSTTVF GSIEAPKTLN
DLNKSLEEMK AKMLPLAKES RKTLNKHLVD TARKVKTEAA DDDEETLIAA DERVVVLDEL
ASIESLIAWS ENKAAKSPAK TPDLSFMTKR GDDISPSDPK GLLRQPSENL LCLGEISEFF
DGTARADPLG IDDPIVADLG ADEVAKMTAP STPEQHGIEG EGSPFAKSNM SDSCTTLHAS
YGDKKINVSE TPVTKGALSA