Gene OSTLU_89323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_89323 
Symbol 
ID5005720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp372530 
End bp374176 
Gene Length1647 bp 
Protein Length548 aa 
Translation table 
GC content63% 
IMG OID640421141 
Productpredicted protein 
Protein accessionXP_001421640 
Protein GI145354750 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.883237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0403088 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGG TCGCGCGGTC GCTCCTGCTC TGGCTGACGC TCGCGCCGAC CCTGGCGCGC 
GCGTCGGCGC ACGCGTACGC GCGCGACTCG CTGTACGCGA CGAACGACGC GGCGATCCTC
GTCGCGGGCG CGGAGGGCGT CTTCGCGTCG CGCGTGCGCC TCGAGCGCGA GACGACGGAC
GTCCGAAAGC GATGGCTCGG CGCGAATCCG CGCGTCGCGG ACGGGCGCGC GTACGTCCGA
CTGGACGCGC TGACGTTCGA GCGGCCGCGG GAGACGGCGC GGGCGAGGGG CGCGACGGGG
GGGGCGAGCG GCTTGGTCGA GGCGGCGACG TTTAAACGCG AGGACGTCGA TCGGATCGGC
GTGTGGGACG ACGAGGCGGG CGCGAGGAAG TTTTGTTGCA CGGGTGACAT GGCGAAGCGA
GGGCTGTGCG AGAAGGGAGA GGTGGGACGA TTGGTGGTGC GAGGGCGGGG CGACGGCGGC
GCGACGGCGC CGTGGAAGAC GGAGATTTGG TTCGAGGGCG ACGACGTCGA GGCGAGGAGC
GATGTGCAGG CGGTGAGCGT GCGGGAGACG GGGATGTATT ACATGTGGTT CGTGGTGTGC
GATCCGGAGC ACGCGGGGGT GACGGTGAGC GGGAGGACGC TTTGGAAGAA TCCGGATGGA
TATCTGCCGG GGGCGAAGAC GGCGCTGTTG CCGTTTTACG GCTTCGCGGC GATGGCGTAC
CTCGGGTTGG GGTTCGCGTG GGCGATGGCG TACGTGGGGA ATTGGCGACA CGTTTTAGAG
CTGCATAATT GCATCACCGT CGTGCTGGCG CTGTCGATGT GCGAGACGGC GGTGTGGTAT
TTCGATTACG CCAACTGGAA CGCCACGGGC TATCGCCCGT ACGTGTTCAC CGTGGTCGCC
GTCTTGCTCG GCAGTCTTCG CACGACGCTC AGTCGCACGC TCGTGCTCAT GATGTCCATG
GGGTACGGCG TCGTTCGCCC CACCCTCGGC GGGTTGAACG CCAAAGTGGT GTCGTTGAGC
GTTTGCTATC TCTTCTCCAC CGCCGTCAAG GACGTCGTCG AGCACGTCGG ATCGGTGGAT
GACTTGAAAC CCGGCGCGAG GTTGTTTTTG GTGCTGCCGG TGTCGGTGTT TGATTCCGTG
TTCTTGATTT GGATCTTCAA CTCGCTGTCG AGGACGCTCA CGCAGCTCGT GTTGAGACAA
CAAAAGCAAA AGCTGTCGCT CTACCGCGCG TTCACCAATC TCTTGGCGGC GAACGTCGTG
CTCTCGGTCG GTTGGCTCGC GTACGAGATG TGGTTCAAGA GCACGGACAT GATTGAAGAG
AAGTGGGAGT CGGTGTGGAT GTTGACTGCG TTTTGGCAAG CGTTATCCTT CGGTTTACTC
GCCGGCATTT GCTTTTTATG GCGCCCCGCG AGCGAGTCGA CGCAGTACGC CTACAGCGAG
CTCGCGAACG ACATCTCCGA AGACGCGTGG TGGGGCGAGC TCATCACCAA CGACGACATC
GAGCAATTCG CGGGGTCGTC GAAAATGTCC AAGTCACCGC GCGTGATGAA TAGTGCGAAG
AAGACGCGAG CGATGAACGA CTTTTCGCTC GACGCCGACG ACGACTCGGC GGCGGAAATC
GAAATGGAAA TGGGAAAGAT TGACTGA
 
Protein sequence
MRAVARSLLL WLTLAPTLAR ASAHAYARDS LYATNDAAIL VAGAEGVFAS RVRLERETTD 
VRKRWLGANP RVADGRAYVR LDALTFERPR ETARARGATG GASGLVEAAT FKREDVDRIG
VWDDEAGARK FCCTGDMAKR GLCEKGEVGR LVVRGRGDGG ATAPWKTEIW FEGDDVEARS
DVQAVSVRET GMYYMWFVVC DPEHAGVTVS GRTLWKNPDG YLPGAKTALL PFYGFAAMAY
LGLGFAWAMA YVGNWRHVLE LHNCITVVLA LSMCETAVWY FDYANWNATG YRPYVFTVVA
VLLGSLRTTL SRTLVLMMSM GYGVVRPTLG GLNAKVVSLS VCYLFSTAVK DVVEHVGSVD
DLKPGARLFL VLPVSVFDSV FLIWIFNSLS RTLTQLVLRQ QKQKLSLYRA FTNLLAANVV
LSVGWLAYEM WFKSTDMIEE KWESVWMLTA FWQALSFGLL AGICFLWRPA SESTQYAYSE
LANDISEDAW WGELITNDDI EQFAGSSKMS KSPRVMNSAK KTRAMNDFSL DADDDSAAEI
EMEMGKID