Gene OSTLU_17964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17964 
SymbolSDG3520 
ID5005552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp19820 
End bp20970 
Gene Length1151 bp 
Protein Length384 aa 
Translation table 
GC content64% 
IMG OID640420973 
Productpredicted protein 
Protein accessionXP_001421367 
Protein GI145354174 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGT GCGACGTCGC CGTCGCGCGC GCGCGCGAAC GTCGAGGCGG AGGTCGAGGC 
GTCTTCGCGA CGCGCGCGTT CGAGCGAGGC GAGTGCGTGA TGATCGAGCG CGCGCTCGTG
GGAGCGCAGC ACGAGGAAAA CGTGAAACAC GCGCGCGCGT GCGAAAATTG CCATCGATAC
GTCGGGACGG TGTCGAGCGC GGTCGGACGA AGGCTGCTGG AAAAGTACGC GAACGCGGCG
CCGCCGAAAC CGACGACGCG CGAGGATTTG GTGAAGCTGG CGAGCGGTGA GGCGACGCTT
CCGGGCGCGG ATGCGTTCGA TGGACCGCGT GAAGTTGGAT GTTTGGGCGC GTGCGCGCGA
AACGTGTACT GCTCCGAGGC GTGCGCGTCC GAGGCGTGGC GCGAGAGAGA GTCGCTCATG
TGTCCGGGAG AGAAGGGGAC GGCGACGAAT AAGCGGGCGT TGGATGAATT TTACGCGCAC
GCGAGGGAGA CGAACGATAT ATTTATTTTA GCGGCCAAGG CGGTGGCGAC GATGTGCGCG
CGAGCGTCGC GGGCGAGCGC GAGAGATCGA GACGACGGGT CGAGCGGAAA GGAGATCGAG
CGGGACGCTT CGGCGGCGGA AGACTTTGCG CGCCTGCCGT TCGCCGTCGT CGCCAACGCG
CCTTGGTGGG AAAGCGTGGC GACGCCGCAC GACTGCGAAG ACGAGCGCGC GGAAATGGAA
TTCCGCACGA CGTTGCGAAC GCTCGCGCAA GACTCTTTGG ACTTGCTTCG ATCGGCTTGG
GGCGAAACGG CGAACGCTTG GCCGCGATTC TTCACGCTCG AGACGTACGG CAGACTCATC
GGAGCGTTTG AACTGAACAA CCTCGAGCTC GTCGTGGAGA GCCCGGTTGA GAATTACTTT
CTCGCGATCG ACGCGGCGCC AGATGGTGAA GAGAAGCGAG CGGCGATGCG CGTCACGCAG
CCACTGCTCG ACGCCTTGGA TACGGAGTAC GACATTCCGC TCCTCGGCAG CGCGTTGTTC
TCCGTACAAT CCGGATTCAA TCACGACTGC GACCCGAACT GTGAGCCGAT GAAGGGAGAG
GAAGACATCG ACGGCGCGTG CGTCATCATC GCGCGGCGCG ATATCGCAGC CGGGGAAGAG
TTGACGATCT C
 
Protein sequence
MTACDVAVAR ARERRGGGRG VFATRAFERG ECVMIERALV GAQHEENVKH ARACENCHRY 
VGTVSSAVGR RLLEKYANAA PPKPTTREDL VKLASGEATL PGADAFDGPR EVGCLGACAR
NVYCSEACAS EAWRERESLM CPGEKGTATN KRALDEFYAH ARETNDIFIL AAKAVATMCA
RASRASARDR DDGSSGKEIE RDASAAEDFA RLPFAVVANA PWWESVATPH DCEDERAEME
FRTTLRTLAQ DSLDLLRSAW GETANAWPRF FTLETYGRLI GAFELNNLEL VVESPVENYF
LAIDAAPDGE EKRAAMRVTQ PLLDALDTEY DIPLLGSALF SVQSGFNHDC DPNCEPMKGE
EDIDGACVII ARRDIAAGEE LTIS