Gene OSTLU_3730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_3730 
Symbol 
ID5005459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp509277 
End bp510500 
Gene Length1224 bp 
Protein Length408 aa 
Translation table 
GC content55% 
IMG OID640420880 
Productpredicted protein 
Protein accessionXP_001421302 
Protein GI145354036 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0269126 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00961002 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATTGCGGGCT CCATCGGGCC TGAAGCTGGC GCGCGCTTCG GCTCGGCCAT TTGCGTAAGA 
TTAGAACAAG AGCAAATGGA TGGAAAGGCG AGATCATGTA GCAACATAGT GAACGTTCTC
GCGAGATTGT ACACGTGCGG GCTGTTTCCG TCGGCGTGTT GTTACGGTTT TCTAGTCACC
TTGGCCAAGA GTTTGAGTGA GCTCGACTCT ACGCTCATGC TCACCTTGCT TCGCATCGCA
GGGAACCGCT TGCGTTCAGA AGATCCAGTG GGGATGAAAG AGTTCATCTT GGCTTTGCAA
GCACGCGTGG CTGAGTTACA GAAGGAGCGC GGCGACGGCG AAGACGGCGG CCAACTCTCA
AAGCGCGCAC GCTTGATGCT CGAGATGGTC ATCGATTTGA AAAACAACAA GAAGCGAGAC
AGCGCACAAG ATGTCGGTAA AGATCAGTGG GGCTTCCCTG TCGCGCTCAG CAAGTGGTTG
CGGGGGACAA ACGTGGGCGA GGCTACCGTG GCGCTTCGCG CATTGACGTA CGAAAAGCTC
ATCAAAACTG AGAGTCAGAA AGGCCAGTGG TGGTTACCCG ACGCCGCAGG AACTGCAGAG
TGGTTCGCGG CGCGTGCAGC GCAAGGTGCG ATCACCGAAC AAGCTGGGAA GACACGCGAG
GGTGGTGAGT TGCTTCAACT TGCGAAGAAG ATGCGAATGA ACACAGAAAC GCGTCGAGCC
ATTTTTTGCG TCGTCATGGG CGCAGATGAT TTTGCCGATG CTCTCGAGCG TTTACTGCGC
TTACCACTCG CTGACAAGCA AGATCGTGAA ATTCCTCGCG TTCTGCTCGA GTGCTGTCTT
CAAGAAAAGG CGTACAATCC GTACTACGAG GTTCTCGCCA GCAAATTGTG CGAGCGTCAG
CGCTCGCATC GGCTGACGTT TCAGCTCTGC ATTTGGGATC AGCTCAAAGA GATCGACGAT
CCGTCATCTT CGGTCAGACG CATATCGAAC ATGGCGCGAT TTTTTGCCGG ATTGGTGCTT
TCAGGTGCGC TCGCACCGAC TGCGCTCAAG GCTTTGGAAT TCGGCGTCGA CATCGCGCCG
CGCGTCGCGC TCCATCACAA GCTCTTCCTC CAAACTGTCC TAGACGATCG CTCGCGAACG
TCTGCCGCAG ATAGCCTCTT CCAAAGGATT GCTGTCCACC CAGAACTCAT GTCCGCCAAG
GCTGGCTTCT TACGACTTCT TCGT
 
Protein sequence
IAGSIGPEAG ARFGSAICVR LEQEQMDGKA RSCSNIVNVL ARLYTCGLFP SACCYGFLVT 
LAKSLSELDS TLMLTLLRIA GNRLRSEDPV GMKEFILALQ ARVAELQKER GDGEDGGQLS
KRARLMLEMV IDLKNNKKRD SAQDVGKDQW GFPVALSKWL RGTNVGEATV ALRALTYEKL
IKTESQKGQW WLPDAAGTAE WFAARAAQGA ITEQAGKTRE GGELLQLAKK MRMNTETRRA
IFCVVMGADD FADALERLLR LPLADKQDRE IPRVLLECCL QEKAYNPYYE VLASKLCERQ
RSHRLTFQLC IWDQLKEIDD PSSSVRRISN MARFFAGLVL SGALAPTALK ALEFGVDIAP
RVALHHKLFL QTVLDDRSRT SAADSLFQRI AVHPELMSAK AGFLRLLR