Gene OSTLU_17597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_17597 
Symbol 
ID5004764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp273782 
End bp274902 
Gene Length1121 bp 
Protein Length374 aa 
Translation table 
GC content60% 
IMG OID640420185 
Productpredicted protein 
Protein accessionXP_001420646 
Protein GI145352639 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.210971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACGA CCGCGATCCC GTCGCGAACC GCGGCGCGCG CGCGCGGGTG GACGAGAGCG 
CGAGAGGGCG CGCGCAACGC GACCATCGTG CGCGCTAACC CACCCGGAAG TGCGCGACGC
GAAGGCGTCG CGCGACGCGG CGACGACGCG GCGACGACGC GCGCGATCGC GGTGGATGAA
AGAAAGTCAT CATCATCGAC GGACGCCGTT CGCGAGTGCG CCGTGGGGGA CGCGCTGCGG
GTGCGATTGC CCGGGGAGAC TCGAGCGAGT TATCTAGAGG TGGAGGCGAC GAGCGCGAAC
CGCGCGCGCG GAGCGTTTCG AGGCGTCGAT GCGGTGCTCG AGGTGTGCGA AGAGACGAAT
AGAGTGTTGT TGAGTGAAAT TGTCGATGGG AACTTTGGAA AGGATGGGAA GAGGAAGTCA
ACGCGATTGT TGCGGGAAAG GTTTACGCGC TCGTACGACG GTTTGACGAC GACGCTGGAG
ACGCTGCGAA GGTCGGGAGA ATATCCGGCG CCTGGGTTTT CGGGCGCGAT CGAGTGCGAA
GAGGTGTCGA GACTTTGTAG TTTGGAAGAG TTACGCGCGC GAGTTTCGGA AGATGCGAGC
TTTTGGGCCG ACGGAAGTCC GAAAGATTTC CAAGGCTTGT CCGCGGACGA GCACGCAACG
CTCAACGCCG GGGAGATTTT GGCGTTTCAG TGTGGATGCA GACCGATGTG CATGGTGCAG
CTGTGGACTG GTTGGGAAGA CGATTTGGGT CGACGACAAA GCATTGACAT GCCTTTTGTC
GCCCGATTGC TGAAGGAAAT CACCGAAGAC GATGAAATTG GTGTCATCAC CGTCGCGCCT
CCAGGGGCAT CAGAAAAGCT CGGAATCACC GCGCTGTTGT ATCCTCGAAA GTCGCCGTAT
AAAGAGCGCG CGAAGCTGCT CGCCTCTTTC GGCGCCCAAG CAGCCATCGT GGCGGGTTCA
GCGTACTATC AAACGCTGAT AGGACGGTGC CTCGGGTACA AGGAGGAGAA CATCAATGCG
CACGTGCAGC AATACAACAA GGGCGTCGGC GTCTCGAAGC AAATCAGCGA CTTGGTCGAA
GAAGAATTGG CCGGGCTTAG CGCAGTTCCC GCGTCAAAAC G
 
Protein sequence
MQTTAIPSRT AARARGWTRA REGARNATIV RANPPGSARR EGVARRGDDA ATTRAIAVDE 
RKSSSSTDAV RECAVGDALR VRLPGETRAS YLEVEATSAN RARGAFRGVD AVLEVCEETN
RVLLSEIVDG NFGKDGKRKS TRLLRERFTR SYDGLTTTLE TLRRSGEYPA PGFSGAIECE
EVSRLCSLEE LRARVSEDAS FWADGSPKDF QGLSADEHAT LNAGEILAFQ CGCRPMCMVQ
LWTGWEDDLG RRQSIDMPFV ARLLKEITED DEIGVITVAP PGASEKLGIT ALLYPRKSPY
KERAKLLASF GAQAAIVAGS AYYQTLIGRC LGYKEENINA HVQQYNKGVG VSKQISDLVE
EELAGLSAVP ASKR