Gene OSTLU_26666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_26666 
Symbol 
ID5004791 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp16353 
End bp17511 
Gene Length1159 bp 
Protein Length325 aa 
Translation table 
GC content59% 
IMG OID640420212 
Productpredicted protein 
Protein accessionXP_001420725 
Protein GI145352802 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGTCCA ACATCCCGCG AGAATTTCAG AAGAAATATA CCTGGGATGC GATCGTGTTC 
TCAGATGGCG CCACGTACGT CGCATCGACG CGAACGCGGC GCGAACTCGT TCGCTCGCGC
GCGCGGAAAC CCTCGAGGCG ATTGTTCGCG ATTGTAGCGC GAAATCGCCG CGCGCGACGT
CGATGGCGCG CGAAATCTCG CACGAAACGT ACGACTGATA TTTTAACCAT CGCTTGCGCG
TGTCTTTCGA CGCAGTTACG AAGGCCTCGT GAGTAACTAC GGGCAGTGTG AAAAGTTGGG
CGTGTTTCAG TACGTCGACG GCGACCGATA CGAAGGACAA TACTCCGAAG GTATGATGCA
CGGCTACGGA GTGTACACGT GGGGTTTGGA TGACTCCACG TATTACGGAC ATTGGCAGAA
TAACTCCCAG AACGGGTGCG GGGTGAAGCT CTACGGTTCG GGCGCGGTCG AGGTGGGCGA
GTGGAAGGAT GACCAATATC TCGGCGAATA CACGGGACGT TGTGGGGAGG ATGAGCAAAA
CAGAGCAATG ATGCACGCGA TGGAAGTGGC GACGCGCGCG CGTTTGTTCA CGGGTAAGCC
CGATGGCGAA GTCGTGGTGT TGGAGAACAT CTCGAACCCG GATACGGCGG AGAGTCACCA
CCCGGTGGTG TACGATCGCG GCACCGAGTG GCAAATGCCC GGGTACAAGG GCGAGCAGTT
CGAGCCTCCG GCCGATCTCG AGCAAACGCA ACCGAAGGTT TTCGCGCAAA TGCAGCGATT
CAACCAACTT TGGGAGCGCG CTTGGAGATA CTACAACATC GACGTCCCGG AGGGTGAGAA
TGACCAAAAG TTGCAAGAGC TCAAGTTCTT GACCGAGGCA CCGGCGACGC TTCGTACCGT
GGACGAAGAC TACGATGAGT ACGAGGATGA AGACGACGAG GATGAAGACG AGGACGAACA
ACCGAGCTCT CGATCCCGTC GCGCTGGACC GGCCGCGATG TCCCTTAGCT TCCGCAACGC
GAACCCGATC TCCACCGCGT TCGCGCGCTT GGGCAAGAGC AAGCACGCGT TGCGCAAGAA
CCCGCTCAAG GCTGCGGCGG GTGCTTTCGA AGCCGCGCGC GAGCGATTCA TGGGCGCGAC
GAAAGTCACC ATGGCTTGA
 
Protein sequence
MGSNIPREFQ KKYTWDAIVF SDGATYEGLV SNYGQCEKLG VFQYVDGDRY EGQYSEGMMH 
GYGVYTWGLD DSTYYGHWQN NSQNGCGVKL YGSGAVEVGE WKDDQYLGEY TGRCGEDEQN
RAMMHAMEVA TRARLFTGKP DGEVVVLENI SNPDTAESHH PVVYDRGTEW QMPGYKGEQF
EPPADLEQTQ PKVFAQMQRF NQLWERAWRY YNIDVPEGEN DQKLQELKFL TEAPATLRTV
DEDYDEYEDE DDEDEDEDEQ PSSRSRRAGP AAMSLSFRNA NPISTAFARL GKSKHALRKN
PLKAAAGAFE AARERFMGAT KVTMA