Gene OSTLU_42040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42040 
Symbol 
ID5006334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp362331 
End bp363689 
Gene Length1359 bp 
Protein Length452 aa 
Translation table 
GC content57% 
IMG OID640421755 
Productpredicted protein 
Protein accessionXP_001422173 
Protein GI145355876 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5044] RAB proteins geranylgeranyltransferase component A (RAB escort protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones75 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCAAG AGTACGACGC CATCATCTTG GGCACCGGCC TGAAGGAGTG CCTCGTCGCC 
GGCTTGCTCG CGAGCGTCGA AGGCTACAAG ATCCTGCACG TGGACCGAAA CGATTACTAC
GGTGGGGAAA GCGCGAGTTT AAATTTGACG CAGCTGCACG AGAAGTTCGC ACCCGAGAAG
GCGCAAGACA AAGCGGCGCT GACGGCCAAG TACGGACGAT GGCAAGACTA CAACATCGAT
TTGGTCCCCA AATTCATGAT GGGAAACGGA TTGCTCGTGC GCGTGCTCGT CCGCACCGGC
GTGCACAACT ACTTGCAATT TCGCGCCGCC GAAGGGTCGT ACGTGCAAGG AAAGGGTGGA
AAGATTCATA AAGTGCCGTC GAACGATAAG GAAGCGCTGC GGAGCTCGCT GATGGGAATG
TTTGAAAAGT TGCGAGCGCG CAGCTTTTTC ATCTTTGTGC AAAATTTCGT CGAGACGGAC
CCGAGCACGC ACGGCGGATA CAACTTACAG CGCATGCCGG CGAGAGATTT GTACGAAAAG
TTTGGTTTAG CCGCGGAAAC GGTGGAGTTC ATCGGTCACG CGTTGGCGTT GAAGACGAAC
GAACGCTATT TGGACGAGCC CGCCGTCGAC CTCGTCAAGG CGGTGAGACT TTACTCCGAT
TCTATGGCGC GATTCGACAC CGGTTCGCCG TACATTTATC CGCTTTACGG CCTAGGAGAA
CTCCCGCAAG GATTTGCACG TCTCAGTGCG GTGCACGGGG GGACGTACAT GCTGGCGAAA
TCCGACGTCG AGGTCGTGTA CGACGAGGAA ACCGGTCGCG CGTGCGGCGC GAAATCCGAG
GGCGAAACCG CCAAAGCAAA GTTCGTCGTC GGTGACGCGA GCTACTTTCC AGGGAAGACT
CAAAAGGTCG GTCAAGTCGT TCGAGCGTTG TGCTTGCTGA GCCATCCGAT TCCGAATGTG
AACGACGCGG AGAGCGTACA AATTATCATT CCAGCGGCGC AGTGCGGCCG CCGCCACGAC
GTCTACGTGC TCGGCACGAG CTCGGCGCAC AACGTTTGCG CAAAGGGACG CTACTTCGCG
TCGGTGAGTA CGACTGTTGA AACGAACGAT CCTCATCGCG AACTCGAGGC TGGTCTTCGC
ATGCTCGGTC CGATCGACGA GCTCTTTTAC AACGTCACCG ACGTGCACGC GCCTCTCGCG
GATGGAACTG CCGACGGCGC GTTCATCTCC ACGGGTTACG ATGCCACGAC GCACTTCGAG
ACGACGGTTC GGGATGTCGT CGATATTTAT CGACGAATCA CGGGCAAAGA ATTAGACTTG
TCCAAGGACG ACGCGGCCGC GGCGAACACT TCGGGCTGA
 
Protein sequence
MDQEYDAIIL GTGLKECLVA GLLASVEGYK ILHVDRNDYY GGESASLNLT QLHEKFAPEK 
AQDKAALTAK YGRWQDYNID LVPKFMMGNG LLVRVLVRTG VHNYLQFRAA EGSYVQGKGG
KIHKVPSNDK EALRSSLMGM FEKLRARSFF IFVQNFVETD PSTHGGYNLQ RMPARDLYEK
FGLAAETVEF IGHALALKTN ERYLDEPAVD LVKAVRLYSD SMARFDTGSP YIYPLYGLGE
LPQGFARLSA VHGGTYMLAK SDVEVVYDEE TGRACGAKSE GETAKAKFVV GDASYFPGKT
QKVGQVVRAL CLLSHPIPNV NDAESVQIII PAAQCGRRHD VYVLGTSSAH NVCAKGRYFA
SVSTTVETND PHRELEAGLR MLGPIDELFY NVTDVHAPLA DGTADGAFIS TGYDATTHFE
TTVRDVVDIY RRITGKELDL SKDDAAAANT SG