Gene OSTLU_44386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_44386 
Symbol 
ID5004404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp466615 
End bp468948 
Gene Length2334 bp 
Protein Length777 aa 
Translation table 
GC content56% 
IMG OID640419825 
Productpredicted protein 
Protein accessionXP_001420517 
Protein GI145352359 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5028] Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.704659 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGGCG CGGCGAGCGC GACGGGGGCG ACGTCGAGGA TAGATCCGTC GCAGATTCCG 
CGACCGCAGT ACGCGAGTCA CGAGGAAAGG GTGACGTATA ACACGAGAAG CGAGATGGAT
CAGGCCACGC ATCCGCCTTC GGCGACGTTG GAGTACGTCG GATGCGACTT GGGGAGCGCC
AACCCGAGAT ACATGCGATC GACGCTGAGC AGCTTACCGA ACACGGGAGA CTTGCTGACG
ACGAGCGGAA TGCCGCTGAG CATCATGACT CGTCCTTTAG CGCTACCGCA TCCCGAGGAA
GCGCCGATTC ATTTGATAGA TAACGGCAAG ACGGGTCCGA TTCGATGCGG TCGGTGCAAG
GCGTACATGA ACCCGTACAT GCGCTTCCTG GACCACTTGA GATTTGAATG CAACTTTTGC
TATTTCGTCA CCGAAGTTCC GCACGAGTAC ATGTGTAACT TAGGAGCGAA CGGTAAGCGC
ACGGATTGGA CGGAGCGACA AGAGCTGTGT AGAGGGACGG TTGAGTACGT GGCGCCGCAA
GAGTACATGG TGCGACCGCC CATGGCGCCG ACGTATCTAT TCTTAATCGA GGTCACGTCC
CAAGCCATTC ACAGCGGTGT CACCACGAGC GCGTGCGAAG TCATCATGCG AACGTTAGAT
AGCGTTCCCA AAGATGCTCA GGTGGGGATC GCCACCGTTG ACTCAGCGAT TCACTTCTAT
CACCTCAAGG ATGGAGCCGA GAAACCGTCG ATGCTCATCG TTCCCGACGT GGAAGACTCG
TACGCGCCGC TGAAGAGTGG TCTCGTCGTC TCCTTGGCCA AAAATAGAGA AGCGATTGAA
AGTCTACTAA AGATGATTCC GGAGACGTTC TCGTCGACCG CACCTGGTCC GAATGCGTCG
ACGGCGGCCA TCAAGGCGGG AATTGAGTGT TTGAAACCGA CGGGCGGCAA GGTGATGGCT
TTCATGGCGA CAATCCCGAA CGTCGGCCTT GGCAAGCTCG AAGCTCGCAC GGGCACTGCC
GGGCAGCGCA CTGGGAACAT CGAAAAGGAG CCATTGAAGT GTATGGCGCC GGCTGACAAG
GCGTATCACA CCATGGCTAC GTACGCCGCG GAGCATCAAG TTTGCATCGA TCTCTTCCTC
TGCGTGTCCT CGGCGGTGGA TGTCGCCACG CTCGGAGTGC TACCGCGCTT GACCGGAGGT
TCGTTATACA GATACCCAGG GTTCAACGTC CAGCAAGACT TCGCCCAACT GCACAACGAC
TTGAGGTGGA ACTTCATCCG CCCGCAAGGA CTCGAAGCCG TGATGCGCGT TCGAGCGAGT
TCGGGTCTGG GCGTTCAAGA TTATAATGGA TTCTTCTGCA AACGAACGAT GACGGACATA
GATTTACCGG CTATCGATAG CGACAAGACA ATTGCGGTGA CGCTGAGATA CGAAGATAAG
TTGGTGGACG GTAGAGAGGC GTACGTGCAG TGCGCGCTCT TGTACTCAAC GACGTCGATG
GAACGTCGAA TTCGAGTGCA CACAATCGCT CTTCCGATTA CATCTGTTCT CGGTGCGCTG
TTTCGGTCGG CAGATCTGGA TGCACAGAGC GACTGGGCTG TTCGTAAAGC GGCGAACGCC
TTACTTTCGG GGAACGGCAC GCTCGCCTCG GCGAAAGACG CATCTTTGCA ACAGTGCATC
GCGACGTTGT TTGCATATCG TCGATTCTGC GCAAGCAACA ACAGCAGCGG CCAACTCATC
TTACCCGAAG GTTTGAAGAC GCTTCCGCTT TACACTTTGG GTTTGCACAA ATCATACGGC
TTACGCTCGG ACGCGTCGCC GGACGATCGA GCGGCATGGC TCTACCGAGC TTTGCACGCG
CCGCCCGAGC TAGCGACGCC CGCCGTGTAC CCGCGGCTCT TCTCCATCCA CGACTTGCCT
CAAGATTCTT CGTTCCCGCC CATTCCTCCG TGCATGTGGC TAAGTTCAGA GAAACTCAAC
CAAGATGGCG CGTACTTGTT GGAAGATGGC CAAGAAATCT TAATCTGGAT CGGTCGACAG
CTTCCTGTCG AGACGTTGCG AGATTTATTT GGCACCGAAA ACGTCGACGA CATCGTCTCT
ACGCGAGCGA CGATTCCGAA TCTCGACACT CCGGCTTCGA AGGCGTTGAA TAGTTTTATT
AACGCCATTC GTAAGCAACG CGGCGCTTTC ATGCGGGCTC GCATCCTCAA ACGCGGGGAC
ACGCTCGAGG CATTGTTTTA CAACCGACTT AGCGAGGACC GCAGTCCGGC GGGCATGAGC
TACGTCGAGT TCTTATGTCA TTGCCATCGA TTGATCATGA ACAAGTCAAA CTAG
 
Protein sequence
MPGAASATGA TSRIDPSQIP RPQYASHEER VTYNTRSEMD QATHPPSATL EYVGCDLGSA 
NPRYMRSTLS SLPNTGDLLT TSGMPLSIMT RPLALPHPEE APIHLIDNGK TGPIRCGRCK
AYMNPYMRFL DHLRFECNFC YFVTEVPHEY MCNLGANGKR TDWTERQELC RGTVEYVAPQ
EYMVRPPMAP TYLFLIEVTS QAIHSGVTTS ACEVIMRTLD SVPKDAQVGI ATVDSAIHFY
HLKDGAEKPS MLIVPDVEDS YAPLKSGLVV SLAKNREAIE SLLKMIPETF SSTAPGPNAS
TAAIKAGIEC LKPTGGKVMA FMATIPNVGL GKLEARTGTA GQRTGNIEKE PLKCMAPADK
AYHTMATYAA EHQVCIDLFL CVSSAVDVAT LGVLPRLTGG SLYRYPGFNV QQDFAQLHND
LRWNFIRPQG LEAVMRVRAS SGLGVQDYNG FFCKRTMTDI DLPAIDSDKT IAVTLRYEDK
LVDGREAYVQ CALLYSTTSM ERRIRVHTIA LPITSVLGAL FRSADLDAQS DWAVRKAANA
LLSGNGTLAS AKDASLQQCI ATLFAYRRFC ASNNSSGQLI LPEGLKTLPL YTLGLHKSYG
LRSDASPDDR AAWLYRALHA PPELATPAVY PRLFSIHDLP QDSSFPPIPP CMWLSSEKLN
QDGAYLLEDG QEILIWIGRQ LPVETLRDLF GTENVDDIVS TRATIPNLDT PASKALNSFI
NAIRKQRGAF MRARILKRGD TLEALFYNRL SEDRSPAGMS YVEFLCHCHR LIMNKSN