Gene OSTLU_29894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29894 
Symbol 
ID5000533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp145623 
End bp146720 
Gene Length1098 bp 
Protein Length365 aa 
Translation table 
GC content63% 
IMG OID640415954 
Productpredicted protein 
Protein accessionXP_001416083 
Protein GI145341994 
COG category[R] General function prediction only 
COG ID[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0723329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTACG ACGGCGCGAG CGACGCGGAG GAAATCGCGC GCGCGGTGAA GACGTTCGAC 
GCGATCAACG CGCTGCGACG AGCGACGGAG GAGGACGCGG GGGAGGGGGA GGACGCGAGA
CGCGCGAGAA AGCGCGCGGC GGCGCTGGAA CGCGCGGCGG AGCGCGCGAA GCGCGCGAAG
GCGAAGCGAC AGGGCGAGGG ATTCGATGGG AATAAAAATG TGAAGTCGAC GGGGGCGTAC
GTGACGGGGC TGCCGTCGGA CGCGACGGAG GAGGAGTTGG GGGAGGCGTT TAAAAAGTGC
GGCGTGGTGA AGCTGGACGC GAAGACGGGA CGAGCGAGGG TGAAGGTGTA CAGGGATGCG
GACGGGAAGG TGAAGGGAGA TGGGTTGGTG GTGTTTTTGA AGGCGCCGAG CGTGGATTTA
GCGATCGCGC TGTTGGATCA GACGGAGTTG AGGCTTGGGG ACGCGACGAC GAGGATGACG
GTGACGGCGG CCAAGTTTGA GGCCAAGGCG CGGGGGGACG ACGAAGGTGG GGGGGCGAAA
GTCGCGGCGA AAGCGAGCGG TGGCGGCGCG CGAATGACGA AGGCCGATCG CAAACGCGCG
GCCGCTCTTC TGAAAAGGCA AGAGGCGGAG GCGTTGGGAT GGGCGGGTTT CGACGACGAC
GTCGACGCGA AGAAGCTCAT CGTCGTCTTG CGGCGGATGT TTACTTTAGA AGAGATGTAC
GCCGACGCAA ATTTGCGTAA AGAGCTCGAA GAAGACGTTA TGGAGGAAGC GCAGCGTACG
TGCGGGCCGG TGATGAGCGT GAAGACGTAC ACGACGTCGC AAGATGGAAC GATGACGATT
CGCTTCAAAT CTCTCGAAGC CGTCGAAGCG TGCGTCAAGG CGTGGAACGG TCGCTGGTTT
GACGGTAGAC AAATAGAAGC CTCGATGTGG GACGGAAAGA GTAAGTTTGT GAGCCAACGT
GACGAGAGCG AGGCGGCGCA ACGTGCGCGG TTAGACGCGT ACGCCGCCGA ACTCGGCGGC
GGCTCGGACG CCGAAGACGC CGAAGACGAC GACGACGACG TCGACGACGA CGACGAACAT
TCCGACGACG AACAATAG
 
Protein sequence
MTYDGASDAE EIARAVKTFD AINALRRATE EDAGEGEDAR RARKRAAALE RAAERAKRAK 
AKRQGEGFDG NKNVKSTGAY VTGLPSDATE EELGEAFKKC GVVKLDAKTG RARVKVYRDA
DGKVKGDGLV VFLKAPSVDL AIALLDQTEL RLGDATTRMT VTAAKFEAKA RGDDEGGGAK
VAAKASGGGA RMTKADRKRA AALLKRQEAE ALGWAGFDDD VDAKKLIVVL RRMFTLEEMY
ADANLRKELE EDVMEEAQRT CGPVMSVKTY TTSQDGTMTI RFKSLEAVEA CVKAWNGRWF
DGRQIEASMW DGKSKFVSQR DESEAAQRAR LDAYAAELGG GSDAEDAEDD DDDVDDDDEH
SDDEQ