Gene OSTLU_17947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17947 
Symbol 
ID5004943 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009367 
Strand
Start bp515227 
End bp516356 
Gene Length1130 bp 
Protein Length322 aa 
Translation table 
GC content60% 
IMG OID640420364 
Productpredicted protein 
Protein accessionXP_001421018 
Protein GI145353434 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones67 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCAC CGCTGGTACC GTTCGTGGTG TGCGGCACGA AGAAAGGTTC GGTGCCGTTC 
GCGCGCGTGA AGACGAAATA CGGCAAGACG CACACGAGAA TTGACAACTG CTCGGGCGAT
GTTTCGCTGC TGTTGTCGAC GCTCACGCGC GCGCTCGGAT GTGGTGGAAG CGTGTGCGAT
GATAAGAAAT CCGTCGTCAT CGCGTTCGAC GAATCTTTCG ATCGCATCGT GAAAGTACTG
AAGTCGTTCG ACCCGCGGTG CGTACGCGGC GTGCGCGGTG CCGAGACGAG CGTTCCCGTC
TCGAGAGCGA GCGCGAGCGC GCCGACGAGA GAGCCGAGGA CAAAGCCGCG ACACGGTGCG
ACGGGGAGTG GGAAAGGTGT GAGTGGTGCT GCGGGGCGTG CCGAACCGAT CGTCATAACG
AAGCGAGGCG CGCATGAGAG CGCTAAACAG TCCATCGACG GATACCGCTT GCTCTTGAGC
AGGTGGCCGT ACTGGGACGG CGACGTTTCG AGAATGTACG ACATGTATCA CAGACATCGA
AGGCTTAATG ACGATGTGGT GATGTCGTTC GATGCAGACT CGATTCTGCA ATCCAGTGCG
ACGGACACGA ACGCGATGAG CGACTTCGAC CGCGCGGCGC AATGCGCTTC GACGGAGGAT
GCGCTTCGGA CGCTTGGCAT GTTAGCTATG CCAAGCGAGT TTCGTCAATC GCGGCTCGAA
CGACAGCGCG AGGCCGCGAA GAGGAAAGAA AGCGTCGCAC GTGTGTCGAG TGGGCCCAAA
ACGTCTTCCG CGGTTGTCGA CATTGATCCG TTTGCTGAAT ACTTGCAGCA CGATCGTGGG
TTTGCGCCGG CGCGCGTTTC GTCAAAGCCC GCCGGCGTAA GTAGTCCACC GCCGTCGATG
AGGACGGCGA AGCCCGCGCG AAGACCAACT GGAGGTTCTT ACACGAATGC TCGGAAGCAC
GGTGCTTCGA CGTCGGTGTC GAGGCTCGCA CGTGTTCCAT CGTCCGAGTC CGATGACGAC
ATTGTGCCGG CGCGCAGGCG CCGAGACTTC GACGTTGGTT TCAGGCGCAC GGCGAGCGGA
AAGTTTACAT TTGGCGGAGA GGATCGAACG CAGACGCGCG TCGCATTTGA
 
Protein sequence
MSPPLVPFVV CGTKKGSVPF ARVKTKYGKT HTRIDNCSGD VSLLLSTLTR ALGCGGSVCD 
DKKSVVIAFD ESFDRIVKVL KSFDPRCVRG VRGAETSVPV SRASASAPTR EPRTKPRHGA
TGSGKGVSGA AGRAEPIVIT KRGAHESAKQ SIDGYRLLLS RWPYWDGDVS RMYDMYHRHR
RLNDDVVMSF DADSILQSSA TDTNAMSDFD RAAQCASTED ALRTLGMLAM PSEFRQSRLE
RQREAAKRKE SVARVSSGPK TSSAVVDIDP FAEYLQHDRG FAPARVSSKP AGAPRLRRWF
QAHGERKVYI WRRGSNADAR RI