Gene OSTLU_16194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16194 
Symbol 
ID5002939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp521112 
End bp523562 
Gene Length2451 bp 
Protein Length816 aa 
Translation table 
GC content59% 
IMG OID640418360 
Productpredicted protein 
Protein accessionXP_001418731 
Protein GI145348594 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.84941 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCGG CGATGCGGGA CGCGAAGGCG TCGCTGCGTG ACTTTTCGAG CGACGATTTC 
GATCGCGTGC GGTGGATTAA CGAGCAAACG CGCGCGCGAA TCGGCGACGC GGGAGGGAGC
GGGCGCGACG CGACGGCGCG GGATGGATCG GCGGTGGTGG CGCGCGCGGT CGGAGAGGGG
ACGCGAGGGA CGAGCGCGTC GCCGTTGGAA CGATTTTTGG CGGATTTGGA ACTGCAGCTG
CAGTTGCTCG GCGAAGATCT GTCGATGTCG CTGGAAGAGC GCTCGCGCGA GGGTGTCGCG
CGCGTGCCCA AGGCGGTGAA GGAGATAGAA GTCGTCGAGG GGCGGGTGAA GCGTCTGCAC
GAGGAAGTGC GAGGGATATT GGATCGATTG GACGAGGTGG AGTCGGAGTC GCGCGCGAGC
GTGGAAGCGC TGCGACAGCT CGACGCGGCG AAACAACGCA TGGAGAGCGC GCGAGAGACG
CTGCAGGAGG CGAATGGACT AGCAGATTTG ATGGCGAGTG TGGACGGGAT CTTTGCGAGC
GGGAATATTC GTAACATGTC CGAGTCGTTG GCGCGGATGA AGCGCGGTTT AGCCGTCGTG
GGCGACGTGC CAGAATTCGC CGACGGGCAA GACAAGGTGA ACGCGTTCGA GCACAAGCTC
GAGGCGATCG TCAGACCGGC CTTGATCACG GCGTTGGAGT CACAAAACAG CACGGCGGCT
CGCGAACATC GCGACGTGTT GCGCACTACC GGCCGCGGTT CGGCGTTGGA GAGCATTTAT
GCGGACACGC GCGTGACGTC TCGCATGCTC AAGCAGTGGA AGTCGCGCGA GCGCGATGCG
ACGACGACTG ACGCTGGGAG GCTGGACGCT TCGGTGCATA TGATCGATGA ATTCTTGAAG
TATTGCGCCA CCGCTCTCAA AGAGGAGATA TCTTGGTGCT CGAGTACGTT CCCCGAAGAC
GCCGTGCTTT TGATTCCGTT GTCTTGGTGT TCGCTTCACA CCACGCTCGA AACGTCCATC
ACGGAGAAGC TCAGCTCGCT AACGCTCGAG CAACTCGTCT CGGCTCGAAA TTCGTTCCAA
ACGTACGTCG AAGACGTTGG GATAGCGTTT ACAAAACTCG CCGGCGACGC TGCAGCGCAC
GCGAACGCCG ACGCCGTCAA GGGCGCCATC GGCGACGCGT TGATGGCGAT TGTCGAGCCT
TTCATCGCGG TTGAGCAGCG TTTCGGCCAG CTCGCGCTTG CAGACATACG AACTACTTTA
GATTCTTCGG TAAACATCCC TGAGGCACAG TCTATTGCGA CGTCGGACGA CTTGACCGCA
GTTATACAAA GCGTGCTCGC CACGCTGCCG AAAGCCATCG ACGTCTTCGG CGTCGTCATA
GACAGATGCG AGGCGATTAC CGCTGGCGTT GAGACGATGA CTTGCATTCA TGCGATTGAA
AGTGGAATGG AACACTACGT CGATTTAGTC GCGCTCGTCC TTCGTGACTT GAGAAACGCG
GCCGGTTTGA TTGACTCGAC CGCGACGAAC ATGAACTCAA ACACGACTTC CGCGGGCGAA
GAATTCATCA GAGGATCGCT CAGTATGCTG GACATGATCA ACGCCATCCC GAACGCACTG
TTAGATTTCG AGTCCGATCT ACGTGCAAAG CTTCTGAAGC TTCGAGCGAC GCTGCGACCG
GCTTTGGATG TTACTTTGGA GCTTGGTGAC GGACCAAAAA GCCGTACGCT CCTCGGTTTG
AGCATCGCGG CGCACGCGGC GCGATCACGA AAGTTGGCCA CGTTCCTTGA CAAGGTGGCG
GATGCCGCGG TGAAGCACTC GCTCGATGGA TCCATAATTC CGGTCGGTGC TGAGCACATG
AACGCTCTGA CTAGATCACT CGAAAAATTC GTGTACGACA CTCTCCTCGG GCGGGTGTCG
CTCGAACTCA AGGGAATCAG CACGTCTGAG GTTTGGAGCG CCAAACCTGC CGAGAGCGCG
TATAAACTCC CGACGTTTAG CGCTTATCCG CAAGAGCGAA TGACGAATGC AGGTGAATAT
CTGCTTTCAT TGCCGCAGCA TCTGGATAAC ATGCACGACG ACGACTTGGC TCGCGCGACG
TCGCTTTCGG GCGACTCGAA TGCGGCGCAA GAGCCGGCGA CTTCCGAGGC GTGGATTGCG
AAAATCGCCG AAGCGAGTGC TGAGTTGCTC TTGAAAGAAG TGCGTGCCAT CGCGTCGCTC
ACCGACCAAG GCGCTGCGCA GTTGTCGGCG GATTTGGAGT ATTTTTCGAA CATAGTTGCA
GCGCTCTCAC TCGCGCCACC CAGCGCGCTC ATCGCCTGGT ACAAGTGCGC GAGCGCGCCT
CGCGACGAAT ACGAAGATTT TGCACGCGCT GCGACTTCAG AGGGCATCGA CGTGCGCGTC
GTGCGAGCCG CCGCGGCGAC GCGAGGGATT AAGTTAAGTA GTTCTTTGTA A
 
Protein sequence
MTSAMRDAKA SLRDFSSDDF DRVRWINEQT RARIGDAGGS GRDATARDGS AVVARAVGEG 
TRGTSASPLE RFLADLELQL QLLGEDLSMS LEERSREGVA RVPKAVKEIE VVEGRVKRLH
EEVRGILDRL DEVESESRAS VEALRQLDAA KQRMESARET LQEANGLADL MASVDGIFAS
GNIRNMSESL ARMKRGLAVV GDVPEFADGQ DKVNAFEHKL EAIVRPALIT ALESQNSTAA
REHRDVLRTT GRGSALESIY ADTRVTSRML KQWKSRERDA TTTDAGRLDA SVHMIDEFLK
YCATALKEEI SWCSSTFPED AVLLIPLSWC SLHTTLETSI TEKLSSLTLE QLVSARNSFQ
TYVEDVGIAF TKLAGDAAAH ANADAVKGAI GDALMAIVEP FIAVEQRFGQ LALADIRTTL
DSSVNIPEAQ SIATSDDLTA VIQSVLATLP KAIDVFGVVI DRCEAITAGV ETMTCIHAIE
SGMEHYVDLV ALVLRDLRNA AGLIDSTATN MNSNTTSAGE EFIRGSLSML DMINAIPNAL
LDFESDLRAK LLKLRATLRP ALDVTLELGD GPKSRTLLGL SIAAHAARSR KLATFLDKVA
DAAVKHSLDG SIIPVGAEHM NALTRSLEKF VYDTLLGRVS LELKGISTSE VWSAKPAESA
YKLPTFSAYP QERMTNAGEY LLSLPQHLDN MHDDDLARAT SLSGDSNAAQ EPATSEAWIA
KIAEASAELL LKEVRAIASL TDQGAAQLSA DLEYFSNIVA ALSLAPPSAL IAWYKCASAP
RDEYEDFARA ATSEGIDVRV VRAAAATRGI KLSSSL