Gene OSTLU_25031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_25031 
Symbol 
ID5003647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp226551 
End bp227979 
Gene Length1429 bp 
Protein Length474 aa 
Translation table 
GC content60% 
IMG OID640419068 
Productpredicted protein 
Protein accessionXP_001419530 
Protein GI145350258 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.102558 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGGCG GTGCGGAGGA CGCCGCGTGC GCCGCGACGC GCGTGCGCGG TGAACTTAAC 
GCCTCGTGGC CGCGCGGAAG AACGCCCGCG AGCGGGTCGA CGCCGCTCGT CGCGCTCGCG
AAGAATATGT CCTCGGTGCT TCAACATTTT GAAGCGGTGA AGACGACGGA TGCCGACGAC
GAGACGACGC AGGTGGCTTC GCTGCGCGCG TTTAGCGAGC GAGAGGTCGT GGTGGATGTG
TACGATCGAT CGGGCGGGGC GACGCGAGCG GTGCGTGAGA AGTTGGGCGC GGTGGGCGGG
ACGCGGGTGC GGGAAGCCAG AGACGCGAGC GGCGACGGCG ACGTCGGGTT GGTCGTCGTC
GACGGGCAGG AAGAAGAGAG TTTCGAAGCG ATCGGCAAAA TTTTGAAGCA GTCAGGCGTG
GCGTCGACGC GAGTGATCGT GCTCGTGTAT GGGGCGGCGG TGGGGACATT GGAGGAGGCG
CGAGCGCGAG CGGCGCGAGA GACGCGAGCG CCGATTGAAA ACGTCTTTGC GACGTTTGGG
ATTTTAGACG AGTCGACGAG CGAGCGGTTG GATTCGTTGT TGAAGAATCC TCCGCTAAGA
TCGACGTTGG AGACGTCCGC TGAAGTGAGA GTGTCTGTGC CAGAACCGGT GTCGGCGTTC
GCCTTGGACG ACGAAGACGA AGACGAAGAC GAAGGTTTGA CTTCGATCGC GTTGTCGGAC
GCGAAGGTCG AGCGTGAAGA ATGGGAAACG ATCGTTCGTG AACACGCGGA GCGATGCTTC
GAAGATGCCG TCGACGCCAC TATGGAGCAG ACGCGACAGT TTTGCTTGCG ACGCGCGGAA
AAGTTGGCTT TGCAGTACGA ACCGCTGCGT AGTTCTTTAT ATCAACGGAT TAGAGGCGAG
CGTTCGAAGA CAAAAATCGT GGAGCCGTCG ATACAGGCTC GCGTGGCGCC GCCATCGCAA
TACGTCGAAG AAATGACGAC GACACGTCCG TATAAATTGA ATTCCGCGGC GTCGCAACCA
CTCGGTATCG ATATGATCAA GCTTCAATCA GTGGCGATTT CTGGTGTCGA GCAAGGCGCC
GTCGTTGCGT CGCAGGCGGC TGAAAAGGTT GGCTCGTTCC TCAACTGGAT CGTAAGCGAG
AATGAAGAGA GCGAGGACGA GCGTAGACGA CGAGTTGAGC ACGAGCGCGT CTGGCACGAA
AGGCGCCAAA AGTTGTGGGA TGCCGAGCGC GCGCAACGTG TGGAGACGCA AAACCGAGGC
AAACACTCAC CGACGGCGTC CTTCGTCGAA ACGACGTCCA AGCTCGTAGA CATCATAGAC
GGTTCGCTTG GCGGCGAATC TCAGCTCGAT CAAGCCAACG CGAGGATTCG CGCGCTCGAG
GCTGCCCTCG CGCGATACGA CCCGTCGCAC GAGCTGTTGC GCTAAACTA
 
Protein sequence
MPGGAEDAAC AATRVRGELN ASWPRGRTPA SGSTPLVALA KNMSSVLQHF EAVKTTDADD 
ETTQVASLRA FSEREVVVDV YDRSGGATRA VREKLGAVGG TRVREARDAS GDGDVGLVVV
DGQEEESFEA IGKILKQSGV ASTRVIVLVY GAAVGTLEEA RARAARETRA PIENVFATFG
ILDESTSERL DSLLKNPPLR STLETSAEVR VSVPEPVSAF ALDDEDEDED EGLTSIALSD
AKVEREEWET IVREHAERCF EDAVDATMEQ TRQFCLRRAE KLALQYEPLR SSLYQRIRGE
RSKTKIVEPS IQARVAPPSQ YVEEMTTTRP YKLNSAASQP LGIDMIKLQS VAISGVEQGA
VVASQAAEKV GSFLNWIVSE NEESEDERRR RVEHERVWHE RRQKLWDAER AQRVETQNRG
KHSPTASFVE TTSKLVDIID GSLGGESQLD QANARIRALE AALARYDPSH ELLR