Gene OSTLU_17601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17601 
Symbol 
ID5004766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp280411 
End bp281670 
Gene Length1260 bp 
Protein Length419 aa 
Translation table 
GC content57% 
IMG OID640420187 
Productpredicted protein 
Protein accessionXP_001420808 
Protein GI145352974 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.125061 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCGT CATCCTCCGT GCTCGAACCC GAGGACGGCG AACCGTTCGC GCGCGTCCAT 
CGAGCGCACT ACCGCGGTTT CGTCGTGGAC GCGCCCTCGG TGCTTCCCGC GTCGCTTCAC
GACGACGTCG AACGCGCGTT CGACGACATG CGCAGCCGAG GCGAATTCAC GCACGACGTC
GTTTCCGCCG GAAATAAAGT GTCCACGACG TACGTGCGGC GCTGCCTGCT CGGTGAGGAC
GGAATGACGT ATCATTATCA GAAGTTACGC CTGTTTGCGC AACCGTGGCG CGGACGGGAG
GCGTACGAAG TGGTGAGACG GTTGAACGAG ACGCTGACGA GGAGCGCGCG AGAGCGATGC
GAAAAGCTCG GCGGAGCGTT CGCGGAGAGT GAGTGCGAAT ACAACGTGAC GCTGATTAAT
TACATGGAAA CAGAGGGTGA GAGCGAAATA GAGCTACGGA ATGAGGAGAA GTTCGATCTC
GGGACGACGT CGGTGAGTTG GCACAGCGAT TCGTCGTTGC GAGAAAACTC CACCGTGGCG
GTGTATCACA CATACGAGGC GCCAGAGAGA AAAGACTGGC GAGTGGCGCT GAGGGCGTTG
AACGCGGAGT GCGAGGTGCT TTGCGTGCCT TTGGAGGATA AAGCGACGTA TTACATGTGT
GGTGAGTTCA ACGCCACTCA CCATCACGCC GTGCTCACCG GCTCGAGTGC GAGATATTCT
TCGACACATC GAGTCGCCGT GGTGGCCAAG GATACGTTCC AGTACATCAA AAGGCGGTGT
ATCGATGCGC TCGCAATTGT ACCAGACTTA GAGCGCGAGA ATAAACCTTT GGATGCGAAA
CAGATACAGT TCTTAGCCGA CGTCCATCGC GAGGTTGAAT TTCAATGGAT TAGAATGTTT
CACCTGCAAG GCGAAGCACA CGCGGCGTGG CACGATACGT ATTGGACGAG AAAAATCGCC
GAGCTCACCG AGGCGTGGGA TCGCATGGAG GCTTGCTTTC GAGTGATTTT GTCCAAGCTC
AAGCGTTCAT CTCGCTCGCC CGAGTCGCCG CCTCGAGCGT ACGCGATGCT GCTTTACGCG
CTGAAAACCG TCAAGGAGTT GCGAGACGAG TACACGAAGC GAACCAAGGC GTCGGCGTAC
GCGTCGCTAC CGCCGTCGCA ACGTCCAGTT GATCTGCCCG CGTACGACAA TACTTCTCCC
CTTCCGTTCG AGCTCAAACC CGTGATTTAC TTTCTCGAGG AGGAGCAAAA CAAAGTGTGA
 
Protein sequence
MSPSSSVLEP EDGEPFARVH RAHYRGFVVD APSVLPASLH DDVERAFDDM RSRGEFTHDV 
VSAGNKVSTT YVRRCLLGED GMTYHYQKLR LFAQPWRGRE AYEVVRRLNE TLTRSARERC
EKLGGAFAES ECEYNVTLIN YMETEGESEI ELRNEEKFDL GTTSVSWHSD SSLRENSTVA
VYHTYEAPER KDWRVALRAL NAECEVLCVP LEDKATYYMC GEFNATHHHA VLTGSSARYS
STHRVAVVAK DTFQYIKRRC IDALAIVPDL ERENKPLDAK QIQFLADVHR EVEFQWIRMF
HLQGEAHAAW HDTYWTRKIA ELTEAWDRME ACFRVILSKL KRSSRSPESP PRAYAMLLYA
LKTVKELRDE YTKRTKASAY ASLPPSQRPV DLPAYDNTSP LPFELKPVIY FLEEEQNKV