Gene OSTLU_4555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_4555 
Symbol 
ID5000641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp839408 
End bp840469 
Gene Length1062 bp 
Protein Length354 aa 
Translation table 
GC content59% 
IMG OID640416062 
Productpredicted protein 
Protein accessionXP_001417069 
Protein GI145345117 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5043] Vacuolar protein sorting-associated protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGAGC AGTACCTCGC CGACGCCCTG AATAAGGCGC TCGGCGCGTA CTGCGACGGC 
ATCGACGGCG AGAAACTGCG CGTGTCCGCG TGGAACGGCG ACGTCGAATT GCGAAACGTT
CGTCTGAAAA AAACGGCGCT GTCGACGCTG CGCGCGCCGG TGACGGTCGA CGCGGGATGC
GTCGGGTCGC TGCGGTTGAA GGTGCCGTGG ATGAACCTCG GACGCGAACC GGTGGTGGTG
GAGATCGATC GAGTGTTCGT GCTGGCGTCG AGGGTGACGA TGGAGGAGGC GGCGGCGACG
GCGGACGAGA CGCGAGACGA GGAGGAAGAC GCGGCGGAGA AGAAGAAACG AATCGATGAG
GGAGAGCGAG ATTGGTTGAG GACGGCGATG GGGAAGATGA CGAAGACGAT GCGGGAGGAG
GCGGAGAGAT CGGATAGTTG GTTTTGGAAG ACGTTAAACA CGGTGCTGGG AAATTTACAA
ATAACGGTGC AAAACGTACA CGTGCGGTAC GAGGATGAAA TCACGACGCC TGGGCACACG
TTTTCGTGCG GAATGACGAT AGGAAAGTTG AGCGCGATCA CGGTGGATGA TTTTGGGGAG
CCGACGTTCG TCGCGGGAGG GTCGCTGGAA CGCATTCACA AGCGCGTGGC GTTGGAAAAC
TTTTCAATGT ATCTCGACTC GGGGGCGGTG TATCGACCGT GGAAAACGCA CGCGGGATGG
ACGCCGCCGA AAGTGGAAGA CACAGAGGCG TGGTGGGCAC TATTTGGCGT AGGGTTGGTC
GGAGAAGCGC CGAGCGATGT GCGAAACTAC ATGTTGTACC CGGTGACGGT GGAACTGTTT
TATCACCGCA AAGGACGAAA AGAACAAACC GAGGCGGGTG AACCGAGGCA AATGTGTGAC
TTGAAGTTTC AAGACGCGCG CATGGCGTTG AGTCGTAATC AATACCGCAG TACGGTCCGC
TTGCTAGAGG CCTTCAATCA GTATCGCTTG CGATTGCCGC ACGCCGAGTT TCGCCCGATG
GTGAGCGTCA AAGCTCAACC GCGCGCGTGG TGGACGTACG CC
 
Protein sequence
MFEQYLADAL NKALGAYCDG IDGEKLRVSA WNGDVELRNV RLKKTALSTL RAPVTVDAGC 
VGSLRLKVPW MNLGREPVVV EIDRVFVLAS RVTMEEAAAT ADETRDEEED AAEKKKRIDE
GERDWLRTAM GKMTKTMREE AERSDSWFWK TLNTVLGNLQ ITVQNVHVRY EDEITTPGHT
FSCGMTIGKL SAITVDDFGE PTFVAGGSLE RIHKRVALEN FSMYLDSGAV YRPWKTHAGW
TPPKVEDTEA WWALFGVGLV GEAPSDVRNY MLYPVTVELF YHRKGRKEQT EAGEPRQMCD
LKFQDARMAL SRNQYRSTVR LLEAFNQYRL RLPHAEFRPM VSVKAQPRAW WTYA