Gene OSTLU_94166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_94166 
Symbol 
ID5006990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009375 
Strand
Start bp232655 
End bp233731 
Gene Length1077 bp 
Protein Length358 aa 
Translation table 
GC content60% 
IMG OID640422411 
Productpredicted protein 
Protein accessionXP_001422846 
Protein GI145357276 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value0.128481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00106938 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGCGAGGC GCGCGCAGTG GCGGTCGTTC GCGCACGCGA AGTGCGACGT CGACGACGCC 
CTCGCGACGA CGTTCGCGCT CGCGAAATCG GTCGAGGTGG CGGTGCCGCG AGGACGGCGC
GACGACGAGG ACGCGATCGC GCGCGCGCTC GACGACGACG CGAATGCGAT CGTCGGGACA
CATTTCGAGG CGGAGATGTC GATCGCGGAC GCGCTGGACC CGCACTTCGT GCACGAACAC
GTGCGACGGT GGACGGCGGC GAAGACGAAG ACGCGCGACG CGGGCGCGAC GTTTGGGCTG
ACGACGATCG ACGCCGAGGT AGATTCGGAC GATTGTTTTT GCGTGACGCC GAAGGGACGA
TTCGTTGCGT CGCTCATGGC GGACGCAAAG GAAAGTTTGG GGATGCAAAC GACGGAGGAC
GCGCGCGGGA AGGCGCTCGC GAGCGTGAAT TTGACGCGAG AAGAGTTCAA GGTGGGGAAT
TGGTTTCACG ATCGAATCGG GGCGTGTGCG CGGGCGTTTG AGCAGCGAAC GGGAAAGTCA
AAGGTATTGT GCGCGTACAT GGTGAGTGGT GCGCACGAGG AGGTCAAGTT TCCACGCACA
GTGACGGCGG CGAGGCGCGC AGAGAGCGTC AAGACTTCGT CGCAAGTTCG CGTCAATTTG
GTGAGTCAAA GTGCCGTGTT ACTCGCGTCT GCGAGTCCAC CGATGAGAGG CGAGGAAAAG
GACATCGATG TTGATGATTT AGATCGGATT TTAGAATTTT GTGGGCGCAT AAGTCTAGGA
GACGCCTACA TGGAGGGCGC CGAGGACGAC CGAGGCGACA TCGTCGAGAC GCGTCGCTGG
CGAGGTTTGA TGTTGTATCC AGATACTCGA CGCGTGATTG ATATCGCGCG AAGGATCGTG
AACGATGGGC TCGCACCGTG GGCCGTCGTC ACTGTTTGGG CTTTCGCACA CATGCCGAGC
GAACTCACAG GTGAGCCGAG AAAGAAAATT GAGCGCGACT CATGCGCGCC AAAGATGGGC
GCCGTGTTCA CCATCGTCTT GTACCCCGAC GACAAGTATT TGAAATTCGT AGCGTAG
 
Protein sequence
MARRAQWRSF AHAKCDVDDA LATTFALAKS VEVAVPRGRR DDEDAIARAL DDDANAIVGT 
HFEAEMSIAD ALDPHFVHEH VRRWTAAKTK TRDAGATFGL TTIDAEVDSD DCFCVTPKGR
FVASLMADAK ESLGMQTTED ARGKALASVN LTREEFKVGN WFHDRIGACA RAFEQRTGKS
KVLCAYMVSG AHEEVKFPRT VTAARRAESV KTSSQVRVNL VSQSAVLLAS ASPPMRGEEK
DIDVDDLDRI LEFCGRISLG DAYMEGAEDD RGDIVETRRW RGLMLYPDTR RVIDIARRIV
NDGLAPWAVV TVWAFAHMPS ELTGEPRKKI ERDSCAPKMG AVFTIVLYPD DKYLKFVA