Gene OSTLU_94311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_94311 
Symbol 
ID5001834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp411164 
End bp412384 
Gene Length1221 bp 
Protein Length406 aa 
Translation table 
GC content54% 
IMG OID640417255 
Productpredicted protein 
Protein accessionXP_001417761 
Protein GI145346574 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.320101 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGGGG TTCTGAGCGA TCGCTCGAGC GTGTTCGACT TTAAGATGTT TAAGCTCTTC 
ACGTTCGTGT TGAACGTGAC GCTGAGCACG TTTGAGATCC CAAGGCAGTT GATTTTGATT
CCTCTCATCG TGGGCAGTCG ATTGGCAATG GTGCCGGAGT GGCTGCTGAG CGCGTTCGAG
TATGATTCGT GGGAGATGCC GAGAGCGAAA AAGAAACGCC CCAAGTTAAG CACCGATTTT
AAGTTCCCGA CGCTTCGTTG GAATGCTTTG AAGACGTGGG AGAACGCGAT GCAAGCGGGG
AGAGAGAGTT TACGCCGCGT CGAGTCGCAA GTGCGCACCG AGCCGGGTGA GACGCCGGAG
ATGAGAGCGG TGATTGATAA ACTTATCGAG GAAGGTTTGG CGTACGACAA GGCGTGCGAT
AGCCAGAAGT CGCAGGCTGC GTTTGAGCGC GCGCTCGAGC TAAGACCGAA AGATCCGGTC
GTGATGATTT CTTTGAGTAA GGAACTCAGC GACCGCGTCT TTGATCACGA AATTTTCCAC
AACAAGCCGC AGGCGAGACA GCTTGCGTCG AGAGCCGCTG ATTTAGCATC GGAGGCGATC
GAACTCGCAC CTGAGAACGC ACAGTGCTAC ATCGCGCTGG CCGTGGCGAA CGCGCGGTTG
AGCATGTTCA GCGACGCGCG GCAAAAGGTG GAACTCACGC ACTCGATTAA AGGAAATTTA
ATGAAGGCGT TGGAAATCGA GCCCGAGAGC GATTACGCCT ATCACGTGCT GGCCAGATTC
GAGCACACCA TGGCGCACAT CGGTGGATTG ATGCGGTATT TAATCAAGAC AATATACGGC
GCCATCGAAC CGGCGACGAT CGAGCGGGCT GAGGAGTACT TCAGGCGAGC GATTGAAATC
AATCCAAAGC GCCTGATTCA CGGCGTTGAG CTCGCCAAGC TTCTGTACGA AACGAAGCGA
TATGACGAGT GCAAAGAATT GTTGGTGCCA TCGATCGAGC TAGAAATCGA AGACATCAAC
AGCGTGCGGA CGAAGAAGGA CGGCGAAGCG TTACTTAAGA AGTTGACCAA CAAACTCAAC
AGGACGCCGA GTCGTCTCTC GATGTCTCGC ACACCGTCCA AGCATCGTCT TCCGCGTACC
TCGTCCTCGT CGCACACTCC GATGTCGCCG TTGACACCGA TCTCACGCAA TAACTCCAAG
GGCAGCGGGC TCTTTGATTA G
 
Protein sequence
MVGVLSDRSS VFDFKMFKLF TFVLNVTLST FEIPRQLILI PLIVGSRLAM VPEWLLSAFE 
YDSWEMPRAK KKRPKLSTDF KFPTLRWNAL KTWENAMQAG RESLRRVESQ VRTEPGETPE
MRAVIDKLIE EGLAYDKACD SQKSQAAFER ALELRPKDPV VMISLSKELS DRVFDHEIFH
NKPQARQLAS RAADLASEAI ELAPENAQCY IALAVANARL SMFSDARQKV ELTHSIKGNL
MKALEIEPES DYAYHVLARF EHTMAHIGGL MRYLIKTIYG AIEPATIERA EEYFRRAIEI
NPKRLIHGVE LAKLLYETKR YDECKELLVP SIELEIEDIN SVRTKKDGEA LLKKLTNKLN
RTPSRLSMSR TPSKHRLPRT SSSSHTPMSP LTPISRNNSK GSGLFD