Gene OSTLU_88822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_88822 
Symbol 
ID5004877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009366 
Strand
Start bp391947 
End bp394271 
Gene Length2325 bp 
Protein Length774 aa 
Translation table 
GC content60% 
IMG OID640420298 
Productpredicted protein 
Protein accessionXP_001420678 
Protein GI145352705 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5028] Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.355252 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00889443 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAACTTTG TGGAGGATTT CGCGTCTCTG AACCTCGGTC CGGGCGGTGG TCCCGGTGGG 
GAGGCTGGAA TGGATCCCGC GACGTTTCCT CGACCGGGCG ACGACGAAGC GCGTCCGAAC
TTGGAGCTTT CGTGCGATCC CAAGTACATG CGTCTGACGT GCGGGGCGCT GCCGTCGAGC
CCGAGCTTGA AGACGAGGTT CGCCATGCCG CTGGGATGCA TCGTCCAGCC GCTCAGACCT
GGAGATGAGA CGACGGTGAA GACGGCGCAT TTCGGGAGCT CGGGTATCGT GCGATGCCGC
AGATGTCGAA CTTACATAAA CCCTTTCGTG CAATTCACAG ACGGCGGGCG GCGATTTAGG
TGCAACGTGT GCGCGCTGCC CAACGAAGTG CCGGTGGATT ACTTTTGCAC GCTCGACGCG
AACGGGGTGC GTCGAGACAT CGCCGAGCGC CCTGAGTTGA ACAGTGGAAC GGTTGAATTT
TTAGCGAGCC AAGAGTACAT GGTGCGACCG CCGATGCCGC CGTCCTACTT CTTCGCCTTG
GACGTGTCGC ACACGGCGGT GAATAGCGGC TTTTTGAAAC AAACGGTGGA GGTGATTCGG
GACTCCCTCG ACGTCATGTC GAAGAAGAGC GAGCGCACGC GGGTCGGATT CTTGACGTAC
GATTCGACGT TACACTTTTA TAGCCTGAAG GCGAATCAGT CTCAGCCGCA AATGATGGTA
GTCGCCGAGC TCGACGACCC GTTTTGTCCG ATGCCAGACG ACTTGCTGGT GAATCTCGCC
GAGTCGCGCG CGGTGATTGA TGCATTTTTA GACATGGTGT GCGATACGTA CGCGCAGACG
CAAAACATGG AAAGTGCCAT GGGCCCGGCG ATTCAAGCCG CGTTCTTAGC TATGTCTCAC
ATCGGTGGTA AGCTTCTCGT GTTCCAATCC TGCCTGCCCA CGCTCGGCGC GGGACGCATG
ATCAACCGCG ACGACACGCG AGCGAGCACG GATAGCACGA AGGAACACCT GCTTCGCGGC
CCGGTCGATG GTTTCTTCAA GAAGACCTCG GCAGAGTGCT CACGACATCA GATTTGTATC
GATTTGTACA CCATCGCGGC GCCGTTCTCT GATTTGGCCT CCATGGCGGT GTTGTGCAAG
TTCACCGGAG GCGAGTTGCG ACATTACCCC GGTTTTACGC CGGACAAGGA TGGGGTAAAG
TACGCAAAGG AGCTGAAAAA TAATCTCACG CGCTTCACCG CGTGGGAAGC CGTGTGTCGG
GTGCGATGCA GTCGAGGATT TAGAATCTGC GCCTTCAACG GGCACTTCTT CATTCGATCG
ATGGACTTGC TCGCGCTCCC GGCGACGGAT GGCGACAAGG CGTACGGCGT GCACATCGCG
CACGACGAAG TGGTTCCGAG CACGAACATT TCGTACTTGC AGTGTGCGCT ACTGTACACC
TCCGCAGAAG GAGAACGCAG AATCCGAGTG CACACGATGG CGGTTCCGGT GGTGACAGAC
ATAGCAGAGA TGTACCGCGC CGTGGACTGC GGCGCCATGG GCGCGTTCAT GGCGCGTTTG
GGCGCCGAGC GCACGCTCAC GGTGCGATTG CAAGATGCGC GCGAGGCGGT GATGACCAAG
GTTGTCGCCA CGTTGCGTGA GTTCAAGTTG CTCAACACGC AAGCGTCGAG AGCGTTCAAT
AGGCTCATCT TCCCCGAGAG CATGAAGTTA CTTCCGTTGT GGATCTTCGC CGCGAGCAAG
AGCACGGCGA TGCGAGGCGG CCCGCGAGAC GTCCCCGTCG ACGCGCGGAT CGCCGCTGTG
TACGACTTCA TGTCTGCCTC GACGGAAGAA ATCTTAAAGC TGCTGTATCC CACGATGCAC
GCCTTGCACA CGATGCCCGA GGAAGCGGGT ACGAAGGACG AATACGGCAG AGTGATTTTG
CCGCCGCGCA CCGTCCTCGC GGGCGAGCGC ATCGACGCTC GCGGCGCCTA CCTCGTCGAC
GACGGTCGTC GCCTGCTTTT ATGGCTCGGA AAGATGCTCG ACCCACAGTT CGTCGCCGCT
TTGTTCGGCC CTAGCGGTCC TCCGAGCGCG GATGTGGACT GCAACCTCCC TCATCTGGAC
ACCGACGTCT CGCGTCGCGC GCGCGCCGTC GTCGACGACA TACGCGCCGA GGCGTCCCGC
GCGCGTCATC TCGCCCTCAC CGTCGTCATC CAAGGCCATC CGAGTGAGAC GCAATTATTC
CCTTACCTCA TCGAAGATCG AGGCGCGGCC AACGTGCCCG GCGCGTCGTC CTACGGCGAG
TTCCTAGTGC AGCTTCACAG GCAAGTCTCC GCCGCGCAGC GGTGA
 
Protein sequence
MNFVEDFASL NLGPGGGPGG EAGMDPATFP RPGDDEARPN LELSCDPKYM RLTCGALPSS 
PSLKTRFAMP LGCIVQPLRP GDETTVKTAH FGSSGIVRCR RCRTYINPFV QFTDGGRRFR
CNVCALPNEV PVDYFCTLDA NGVRRDIAER PELNSGTVEF LASQEYMVRP PMPPSYFFAL
DVSHTAVNSG FLKQTVEVIR DSLDVMSKKS ERTRVGFLTY DSTLHFYSLK ANQSQPQMMV
VAELDDPFCP MPDDLLVNLA ESRAVIDAFL DMVCDTYAQT QNMESAMGPA IQAAFLAMSH
IGGKLLVFQS CLPTLGAGRM INRDDTRAST DSTKEHLLRG PVDGFFKKTS AECSRHQICI
DLYTIAAPFS DLASMAVLCK FTGGELRHYP GFTPDKDGVK YAKELKNNLT RFTAWEAVCR
VRCSRGFRIC AFNGHFFIRS MDLLALPATD GDKAYGVHIA HDEVVPSTNI SYLQCALLYT
SAEGERRIRV HTMAVPVVTD IAEMYRAVDC GAMGAFMARL GAERTLTVRL QDAREAVMTK
VVATLREFKL LNTQASRAFN RLIFPESMKL LPLWIFAASK STAMRGGPRD VPVDARIAAV
YDFMSASTEE ILKLLYPTMH ALHTMPEEAG TKDEYGRVIL PPRTVLAGER IDARGAYLVD
DGRRLLLWLG KMLDPQFVAA LFGPSGPPSA DVDCNLPHLD TDVSRRARAV VDDIRAEASR
ARHLALTVVI QGHPSETQLF PYLIEDRGAA NVPGASSYGE FLVQLHRQVS AAQR