Gene OSTLU_49385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_49385 
Symbol 
ID5001302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp463300 
End bp466125 
Gene Length2826 bp 
Protein Length936 aa 
Translation table 
GC content55% 
IMG OID640416723 
Productpredicted protein 
Protein accessionXP_001417252 
Protein GI145345513 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5096] Vesicle coat complex, various subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0497486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACGG TGAATCAGGT CGTCGAGCGC GGGTGCTCGA TGCTCGTGCA TTTCGATCGC 
GCGACGAGCG CGGTGGAGCT GAAGGAGGCG CTCGAGACGG GGAACGCGGA AGAGAAGGCG
GACGCGATGA AGAAGGTGAT CTCGCTGCTG CTGAGCGGGG AGGCGATACC GCAGGTGTTC
ATCACGATCG TGCGGTACGT GCTGCCGAGC GACGATCACA CGGTGCAAAA GCTTTTGTTG
CTGTACATGG AGATGATTGA GAAATGCGGG GCGGATGGGA AGATTTTACC GGAGATGATT
TTGTTGTGTC AAAACTTGCG AAACAATTTG CAACACCCGA ATGAGTTTTT GAGAGGGTGC
ACGCTGCGGT TCTTGTGCCG AATCACGGAG CCGGACCTGT TGGAGCCGCT GATACCTTCG
ATCGTGCAAA ATTTGGAGCA CAGGCACTCG TACGTGCGGA GAAACGCGGT GATGGCGATC
AATAAGATTT ACGACTTGCC CGACGGCGAG CACTTGATTC CCGACGCGCC GGAAATCATC
GAGTCATTTT TGATGAGCGG GGAAAACGAT TTGGGGACGC GTAGGAACGC GTTCTTGATG
TTGTACACGC ACGCCCAAGA GCGCGCGGTG AACTATTTGA TGAACAATCT GGAATCCGTG
TCAAACTGGG GGGACATTTT GCAAACCGTC GTGCTCGACC TGATTCGCAA GGTTTGCCGC
TCGGATCCGA CGCAAAAGGG CAAGTACATC AAGGTCATCT TGATGCTGCT CGGTACGAAC
AACGCATCGG TGGTGTATGA GTGTGCGAAC ACCCTCGTGG CGCTTTCGAA CGCGCCGACC
GCGATCAAGG CGGCGGCCAA CTGCTACTGC CAACTCCTCG TGAACCAAAG CGACAACAAC
GTCAAGCTCA TCGTGCTCGA TCGCTTGACT GACTTGAAGA AGGACAACAA AGAATTGTTG
CAAGCGATGA TCATGGATAT CTTACGCGCG ATCTCCTCGC CGAACATCGA CATCAAGCGC
AAGACGCTCG ATTTAGTGCT CGACTTGATC ACGCCTCGCA ACATCGACGA CGTCACGAGC
ATGTTAAAGA CGGAGGTGAT CAAGTCTCAA TCTGAGAACA CCAGTGAGAC TGGAGAGTAC
CGACAGTTGC TCGTCCAAGC GATTCACAAA TGTGCGTTGA AGTTCCCCGA AGTCGCGGGG
TCGGTAATTT ACTTGCTCAT GGATTTCCTG AGCGACGCGA ACAGCGGGAG CTCGGCGGAC
GTCGCATACT TTGTTCGTGA GATCGCCTTC ACGAACAAGT CGCTGCGTCC GGGGATTATC
GAGCACCTGT TAGATTTGTT CTCCACCATT CGTAGCAGCC GAGTGTGCGC GACAGCGCTG
TGGATCATCG GTGAGTTTAG CACGACGCAG GCGGAGCAAG AGGCGGCGCT CGAAGTCATT
CGTATGAGCC TCGGTCCGGC ACCGCTCGTC GATGGCCCGG ACGGTGAAGA AGAGGACGAA
GACACCACGG AAACGACGAC GCGCCCGGCC GTGTTGGCGG ATGGAACGTA TGCGACGCAA
GCGGCGTACT CGACTTCGGC GGCGATTTCT CAAGTGCCGA ACTTGCGCGA AATGTTGCTG
AAGGGTGACT CCTTCCTTTC GGCGGTGATT GCGAGCACGT TGACCAAGCT TGCGCTCAGA
GTGATAGGAT CTGGTTCGGT TCCTCAAGCG CAAAAGAACG CGACACAAGC CGAGTGCATG
TTATACATTG TGAGCATGTT GCGCTTGGGA ACGAGTGGCA AAGTGCCGAT CGAAATGGAT
AGAGACTCTA AGGCTCGGTT GGAGTTATGC TTCCACGTCA TCGGTCATCC GGAAGAGGCG
GATACCGACG TCTGGTTGAA ATCGTGCGGA GAATCTTTCG CTTTGATGAT TGAAGAAAAG
CTTAGACGCG AATCTCAGGC GAGCGCGAAC TCCGACGCCG CACCAGTCGC GCAGGCTGAC
GATTTGATTG ATTTCCATCA TCTCAAGTCG CGCAAGGGTA TGACGCAACT TGAGATTGAA
GATGCCGTCG CGACCGATCT CGCCCGAGCC ACTGGTTTCA TGGACTCGGT CAAGAAGAAC
GGACGTAGCC TCGACCGCGT GATGCAACTC ACCGGTTTGA GCGACACAGT CTACGCGGAG
ACGTACGTCA CCGTGCACCA GTACGACATC ACGCTTGACG TGACGATGAT CAATAGAACG
GACGAGCCGC TGCAAAACGT CATGTTGGAA CTTTCCACCA TGGGTGATTT GAAGCTTGTT
GAGCGTCCTC AACCATTTTC GTTACCACCG TTCGGTTCTC ACAACCTGAG AGCGAGCATC
AAGGTGAGCT CGACCGAAAC GGGAGTCATC TTCGGCAACA TCGTGTACGA GACCGCTCGC
TCCGATCGTA ATGTCATCGT GTTGAACGAC GTGCACATTG ATATCATGGA TTACATCATC
CCCGCGACGT GCAGCGACAC GGTGTTTAGA AGCATGTGGG CTGAATTCGA GTGGGAGAAC
AAGGTTGCCG TGAGCACAAA CATCACCGAC GTTCGCAAGT ATTTGGATCA TATCGTGACT
AGCACGAATA TGAAGTGCCT CACTCCGCCG AGCGCGCTCG ATGGCGAATG CGGGTTTCTG
GCCGCCAACT TGTACGCCAA GTCCGTGTTC GGCGAAGACG CGCTGGTAAA CGTTTCGATC
GAGAGCAACG ACGGTGAGAT CAGTGGCTTC ATCCGAATCC GCTCGAAGAC GCAAGGCATC
GCGCTTTCGC TCGGCGATAA GATCACGCTG AAGCAATCGA TCGAAATCTA GACGTGTTTA
ATCAAA
 
Protein sequence
MATVNQVVER GCSMLVHFDR ATSAVELKEA LETGNAEEKA DAMKKVISLL LSGEAIPQVF 
ITIVRYVLPS DDHTVQKLLL LYMEMIEKCG ADGKILPEMI LLCQNLRNNL QHPNEFLRGC
TLRFLCRITE PDLLEPLIPS IVQNLEHRHS YVRRNAVMAI NKIYDLPDGE HLIPDAPEII
ESFLMSGEND LGTRRNAFLM LYTHAQERAV NYLMNNLESV SNWGDILQTV VLDLIRKVCR
SDPTQKGKYI KVILMLLGTN NASVVYECAN TLVALSNAPT AIKAAANCYC QLLVNQSDNN
VKLIVLDRLT DLKKDNKELL QAMIMDILRA ISSPNIDIKR KTLDLVLDLI TPRNIDDVTS
MLKTEVIKSQ SENTSETGEY RQLLVQAIHK CALKFPEVAG SVIYLLMDFL SDANSGSSAD
VAYFVREIAF TNKSLRPGII EHLLDLFSTI RSSRVCATAL WIIGEFSTTQ AEQEAALEVI
RMSLGPAPLV DGPDGEEEDE DTTETTTRPA VLADGTYATQ AAYSTSAAIS QVPNLREMLL
KGDSFLSAVI ASTLTKLALR VIGSGSVPQA QKNATQAECM LYIVSMLRLG TSGKVPIEMD
RDSKARLELC FHVIGHPEEA DTDVWLKSCG ESFALMIEEK LRRESQASAN SDAAPVAQAD
DLIDFHHLKS RKGMTQLEIE DAVATDLARA TGFMDSVKKN GRSLDRVMQL TGLSDTVYAE
TYVTVHQYDI TLDVTMINRT DEPLQNVMLE LSTMGDLKLV ERPQPFSLPP FGSHNLRASI
KVSSTETGVI FGNIVYETAR SDRNVIVLND VHIDIMDYII PATCSDTVFR SMWAEFEWEN
KVAVSTNITD VRKYLDHIVT STNMKCLTPP SALDGECGFL AANLYAKSVF GEDALVNVSI
ESNDGEISGF IRIRSKTQGI ALSLGDKITL KQSIEI