Gene OSTLU_37056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37056 
Symbol 
ID5001373 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp107551 
End bp108861 
Gene Length1311 bp 
Protein Length415 aa 
Translation table 
GC content68% 
IMG OID640416794 
Productpredicted protein 
Protein accessionXP_001417428 
Protein GI145345882 
COG category[S] Function unknown 
COG ID[COG5505] Predicted integral membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.0190175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCGCG ACGCGCTCGC GTCGGTCGCG CGCGCGCGCG CGCCCGCGCC GGCGCGCGAA 
CGCCGCGCGA AGTCGCGCGT TCGGGACGGC GGTGCGTCCC GAACGCGCCG AACGCGCGAC
GCCGCGACGC GCGCGCTCAT CGCGCCGACC GACGCGCTCG GTGTTTGGAC CGCGGTGCTC
GCGTGCGGTG CGTTCGGGCT TTGGGCGGAG AAGCGCCCGT GGGGAGCGAA CGCGGGCGGC
GCGCCGCTGG TGAGCACGCT CGCGGCGCTC GCGCTCGCGA ACGCGGGAAT CATGCCGACG
GACGCGCCGA CGTACGGGAT GATTAATGGG TTCTTGTTAC CGCTCGCGGT GCCCATGTTG
CTGTTCACGG CGGACGTGCG ACGAGTGCTG CGCGGATCGG CGCGCTTGTT GCCGTGTTTC
GTCGTCGGCG CGCTCGGGAC GACGCTCGGG ACGCTGGGGG CGTTCGCGGC GGTGCCGATG
ACGGCGCTAG GAGCGGAGGG GTGGAAGATG GCGAGCGCGT TGATGGCGCG ACACATCGGG
GGGGCGGTGA ATTTCGTCGC CGTGGCGAAC GCGTTGGAGA TGACACCGAA TATCATGGCG
GCGGGTTTGG CGGCGGATAA TTTGATGAAC GCGTTGTACT TTATGGGATT GTTCGCGCTG
GCGAAGGGCG TGATGCCGAA AACGCGCGAC GGCGGAGACG TCGACGCGCG CGAGAGCCAG
GGCGATGGCG ACGTCGCGCT CGACACGGTC GCGCTCGTTG AGGGCGAGGG CGCGGGTGAG
CCTTTCAGCG CGTTGCGCGC GTCGTACGCG CTCGCCGTGG CGGCCTCCGT CGGATACGCG
GCGAAGCTCA TCTCCGCCGC GCTCAATCTG CGTGGCATGG ACATCCCGAT CATCACGTTG
ATAACGGTCG TTCTCGCGAC GGCGATACCT AGACGTCTCG GAGCGCTCGC GGGCTCGGGC
GAGGCGCTCG CGACGCTCGT CATGCAAGCC TTCTTCGTCG CCGTCGGCGC GTCGGGATCC
ATCACCCACA TGCTCACCAC TGCGCCGTCC TTATTCGTCT TCTCCTGCCT ACAGGTCGCC
ATTCACTTAG CATTCCTCCT CGCCGTCGGC AAAGCGCTCA GATTCGACAA AGCGAACGCT
TTACTAGCCT CCAACGCGTG CGTCGGCGGT CCCACCACAG CCGCCGCGAT GGCTTCGGCG
AAAAATTGGA AATCCCTCGT CGTCCCCGCC ATGCTCGTAG GCGTTCTCGG GTACACCGTC
GCCACCTTCC TCGGAATCGC GTTCGGCAAA ACTGTCTTAG CTCGCATGTA G
 
Protein sequence
MSRDALASVA RARAPAPARE RRAKSRVRDG GASRTRRTRD AATRALIAPT DALGVWTAVL 
ACGAFGLWAE KRPWGANAGG APLVSTLAAL ALANAGIMPT DAPTYGMING FLLPLAVPML
LFTADVRRVL RGSARLLPCF VVGALGTTLG TLGAFAAVPM TALGAEGWKM ASALMARHIG
GAVNFVAVAN ALEMTPNIMA AGLAADNLMN ALYFMGLFAL AKGVMPKTRD GGDGEGAGEP
FSALRASYAL AVAASVGYAA KLISAALNLR GMDIPIITLI TVVLATAIPR RLGALAGSGE
ALATLVMQAF FVAVGASGSI THMLTTAPSL FVFSCLQVAI HLAFLLAVGK ALRFDKANAL
LASNACVGGP TTAAAMASAK NWKSLVVPAM LVGVLGYTVA TFLGIAFGKT VLARM