Gene OSTLU_86921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_86921 
Symbol 
ID5001378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp146950 
End bp149307 
Gene Length2358 bp 
Protein Length785 aa 
Translation table 
GC content58% 
IMG OID640416799 
Productpredicted protein 
Protein accessionXP_001417435 
Protein GI145345896 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5096] Vesicle coat complex, various subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGTCGA CCGCGAAATC GCCGTCGAAA CGCGCGGATG ACGTCGCGGA GCTGTCGAAA 
GCGTTGAAAC TGTTGAGCTC CAAACAATCG GACGACGACC GCGCGAACGC GCAGCGCAGA
CACGTCCTCC GACGCGTCCT GAACCTGCTC ACGATCGCGG TCGACGTGTC TAAACTGTTC
CCAGACGTCG TGCTGAACGC GCACACCGTG GACGTGGCGT GTAAGAAGTT GATTTACGCG
TACATTTGCC ACCACGCGCG GCGCGAACGG GAACTGGCGA CGCTGGCGGT GAACGCGCTG
CAAAAAGATT GCGCGAGCGC GAATGAAACG ATACGGGGAT TGGCGATACG AAGCATCGCG
GGATTAGGGG TGGATGATTT GATCGAGTAC GCGACGGCGT GCGTGATGGC GGGGTTGAGA
GACGCCGGAG GGTATCCGCG AGCGGTGGCG GCGATGGGGG CGCTCAAGGT GTACGATTTG
AATCCAAGCG CGGTGCGGGA GACTGGGATA CTGGATGCGC TGCGAGAAAT GCTGGTGAAC
GATACCGACG CGGGAGTGGT GGGGAATTGT TTGATCGTGT TGAGGGAGAT TGACGGGATC
GAGTCGTTGG CGACGAAGCC CATCGTGTAC GCTTTGATTA ATAGAATTAA ATCGTTTAGC
GAGTGGAACC AGGCGTTGAT TTTGGAGTTA GTGGGGGCGT ATGAGATTCA GAACAAAGAT
GAGACTTTTG ACATCATGAA CGCGCTCGAG TCGAGGCTGA GCGCGCCGAA TTCCGCCATC
GTTTTGGGCA CGGTGAAGGT TTTTTTGAAC ATTACGCTCG AGATGCCAGA CGTGCATCAA
CAGGTGCTGG AGCGCATCAA GGCGCCGTTG TTCACGCTAG CGAACGGCGG AACAGTGGAA
ACGAGCTACG CGGTGTGGGC TCACGTGCGG TTGTTGGTGA AACGCGCCCC CATCTTGTTC
TCGACTGATT ACAAGAACTT TTATTTCCGC GGGAGCGATT CAGGGGCGGT GAAGAGTTTG
AAGCTTTCGA TGCTCGTAGC CGTCGCGGAT GCGCAAAACA CATACGACAT CGTCACCGAA
CTCACTGAAT ACGTCACCGA CGCGGATATC GGTATTGCGC GCGCCGCCGT GCGCGCGGTG
GGGGAAATCG CACTCTCGGC GGCGGATGAT TTGGAAGGCA TCGTCGACCG CTTGTTGCAA
TACTTTGATT TAGACATCGA GCACGTGACA GCGGAAACGA TCATTTCGGT GGTAAATGTC
TTGCGCAAGC GTCCAAAGTA CGCGGTACAG TGCGTTCAGG CGATCAAGAA CATCGATCTG
ATTGACGTCG TCCCGTCTCG CGCGCGCGGC GCCTTGGTGT GGATGTATGG CGAGTACGGC
GAAGATATTC CGCTCGCGCC GTACTTTATC GAGCCTGTAC TCACAAACTT TGGCGATGAG
CCGAGCGCCA ATGTGCGATC ACAACTGCTG TCGAGCGCGA TGAAGCTCTT CTTTAAGCGC
GCACCGGAGA TGCAAGCGAT GCTCGGCGCC GCCTTGCTTG CGGGCTCGTG CGACACAAAC
CAGGAAGTGC GCGACTTGGC CAGTTTGTAC TATCGCCTGT TAGAGCGCGA TGTGCGCGCG
GCGGAGAAAG TAGTGAACTC GCGTGATAAG TCATCGCCGA TTTACACCTT CAAAGAAACC
GTGATAGAGG ATGAGACGTT CGACAAGGTG TTCAACGAAT TCAACACATT GTCCGTGCTG
TACGAACGTC CAGAAGTGAA ATTTGTGGAC CCGGACGCGT TCACTCGTCG CGCGCGCGTG
GATGCCGACG AAATGGACGA CATCGCCGCG GGCGGTGGTG GCTCATTGAT CGACCACTCG
ATGGATATGA TTGACCTGGG GGACACCGAC GAGGAAAAGG CGTCCGCCAG CGGCGGTTCG
GCGGTGGATT TGCTCTCGCT CCTCGACGTA GACGTACCGG CGGCGCCCCA ATCTGTCGAT
GCGCCGCCGG CAGTATTCGC GCTCGATCCA CTACCAGCGC TCGATTCGAC GAGCTTTCAA
ATGAAGTGGA CGTCCGCCGC AGTGGTGGCG ACGGGATTGC AAGCGACGCT GCGATCGAAC
GCGCTCGCGA GCGCGGCGCA AGTTACGCAA CATCTCGCAC CTCGAGGCGT CGCCACCATG
GCGAGCGGTG GACCGTCGAA TGCGATGAAG TTTTATTTCT ACGCCATCGA TAATGGAAAC
CGCGAAATCT TTCTAGTGGA AGCGCTCATC GACGCCACGG CGCGTGCGGC GACGTTTACG
GCAAAGTGCG ACGGCCGCGG GACGAATTTC GTCGCGTTTC AACGTCTCTT CGCCGAAGCG
TTGTCATCGA TTCCTTAG
 
Protein sequence
MPSTAKSPSK RADDVAELSK ALKLLSSKQS DDDRANAQRR HVLRRVLNLL TIAVDVSKLF 
PDVVLNAHTV DVACKKLIYA YICHHARRER ELATLAVNAL QKDCASANET IRGLAIRSIA
GLGVDDLIEY ATACVMAGLR DAGGYPRAVA AMGALKVYDL NPSAVRETGI LDALREMLVN
DTDAGVVGNC LIVLREIDGI ESLATKPIVY ALINRIKSFS EWNQALILEL VGAYEIQNKD
ETFDIMNALE SRLSAPNSAI VLGTVKVFLN ITLEMPDVHQ QVLERIKAPL FTLANGGTVE
TSYAVWAHVR LLVKRAPILF STDYKNFYFR GSDSGAVKSL KLSMLVAVAD AQNTYDIVTE
LTEYVTDADI GIARAAVRAV GEIALSAADD LEGIVDRLLQ YFDLDIEHVT AETIISVVNV
LRKRPKYAVQ CVQAIKNIDL IDVVPSRARG ALVWMYGEYG EDIPLAPYFI EPVLTNFGDE
PSANVRSQLL SSAMKLFFKR APEMQAMLGA ALLAGSCDTN QEVRDLASLY YRLLERDVRA
AEKVVNSRDK SSPIYTFKET VIEDETFDKV FNEFNTLSVL YERPEVKFVD PDAFTRRARV
DADEMDDIAA GGGGSLIDHS MDMIDLGDTD EEKASASGGS AVDLLSLLDV DVPAAPQSVD
APPAVFALDP LPALDSTSFQ MKWTSAAVVA TGLQATLRSN ALASAAQVTQ HLAPRGVATM
ASGGPSNAMK FYFYAIDNGN REIFLVEALI DATARAATFT AKCDGRGTNF VAFQRLFAEA
LSSIP