Gene OSTLU_35149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_35149 
Symbol 
ID5003763 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp454572 
End bp456722 
Gene Length2151 bp 
Protein Length704 aa 
Translation table 
GC content56% 
IMG OID640419184 
Productpredicted protein 
Protein accessionXP_001419812 
Protein GI145350857 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0721723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0346434 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTGG GAAGCTCGCA GAGACGCTCG GGCGGATTGT TCGGTAAGCC AGAGCGCACG 
ATAGTGTCGC GACCAACCGA GCTCGCCGAT GGTCCAAAGT TACCCGACTG GAACTATAAG
CGGCCGTTTC TCACGGGCGC GCATCTCGGG AGTGGGGCGG AGCAAGACAG AAACGTCAAG
GCGTTGAGCG ATTACGGCAC CGCCGAGCAA GAGTTGCTGG TTTTAGATGA TTTGTTATAC
GCGATGATGG GCGTCGACGG GCGATACATC AGCGCGTGGA AGGGCACAGA CGACGAGACG
AGCTCGGGTG TCGTGGTCAA AGACGATCGA CTGACGCGAG TAAAGTTTGA GGTTGAGCTG
GGGCTCGAGG CGCCGCTCGC GGCGTTGGTT AAGAATATGC TTCCGTTGTG CTCGGACGCG
GCGACCGTGC GTGCCTTCAT CGAATCTCGG CATGAGTTCA AGCATGGCTA CGTATCCCAC
GCGTTAGCGG CGGAAATGCG CGATTTGCTG AACGATTGGC ACACGTTGAT CGTGCAGCTC
GAGCACCAAC GCAACATCGG CTCACTTTCA CTTCAGGCGG CGTGGTTTTA TTGTCAACCC
GCCGCGCCGG CTTTGCGCCT CATGGCGAGC GTGGCGAGCA GAGCGTACCA TTTGAAAGGT
GCGAGCATAC TCAATCTGTT GCACCGAGAA GGCTGCGAAC ACGCTGGCGA TGGTGCGGTG
TCGGCTTTGG TGCAGCGTTT GAGCAAAGCG ACTTCGGCGC CGTACAGCCG CGCCATCGAG
CTGTGGGTTT ACGATGGACA AGTCGACGAT CCTTACGACG AGTTTTTGAT CGTTGAGCAA
CGTGAGATGA AGAAAACGTC TCTGGCCGAT GATTATAACT CGGCGTATTG GACGAAACGC
TATTCCCTTC GCGAGGAGAT TCCGCAATTC ATAGGGAAAC AACTTGCGCA AAAAATACTC
ACCACGGGGC GCTATTTGAA CGCGGTGCGT GAGACCAAGG TGTCCGCGAT TGCCGAGCTC
CCAGCGAAGC CGAGAGACGG GCTCGGTAAG ATGTACTTCG GGCCGAACAT GATCATCGGC
ACGGGGAAGT ACGCGGATCG CATTGACGAT AGATTTGAGC ACGCTTCGCG CAAATTGTTA
CAAATCATGT GGGAAGACGG CGAGTTGAAA TCACGACTGA TGAGCATGAA GATGTACTTC
TTGCTCGCTC GAGGAGATTA TCTCGTGCAT TTCCTAGACA CCGCGGCGTC TGAGTTGGAA
AAGGACGCAG ATGACATTCG TCTGCCCAAG CTTCAAACGC TTTTGGATAT CGCCGTGAAG
TCGTCCAGCA CGGCGACCGA CCACCACGGC GACGACTTGT TATGCTCTAT CGACGGTCAC
GGTCTATCGA GACAACTTTC CAGCATTGAT GATGACGACT CTGCCGCTGT GACTCCTTCG
AAAGCCACCG GTGACGGGGA CGAGCTTTCT GGATTCGATG CCTTCGTGCT CGATTACGAC
ACGCCTTGGC CGGCGAGCGT CGTGCTCAAC CGTCGTGCCG TGACAAAGTA TCAAATTCTT
TTCAGACATC TGTTCAATTT CAAGTGCGCC GAACGCGAAC TTTGTGCGGG TTGGCAGCGT
CTGCAAGTCA TGCGCGGCGC GCAACTCGGT CGCATGTTCG CCCAGGCCCA CACGCTCACG
CAGCGAATGT TGAACTTTTT GCAAAACTAT TTGTACTACA TCACGAATGA GGTCATCGAG
CCTCATTGGG ATAGGATGAT CGCGCGCGTG GACGACGCGC AGTCCGTGGA TGAACTGATC
GCCGGACACG ATGCGTTTTT GGAGGCCTGC ATGAAGGATG CGATGTTGTT CTGGCCCAAG
ATTTTGAAGC GTTTGGAGCG CGCGCGAGCG GCGTGCTTGC GCTTTGCCCG GGACAGCCAG
CGTTTCGCAG ACACCATCGA ACGGTTGAAG GAGAATAGCA TGGACGCTAT GACGGCGGAT
AGATTGGTCG CATTGGAGGA AGAGATCGAA GCCGTGACGA GCGATACGCG GTCGCAGTTC
CGGCACTTCC TCGGCGATTT ATTAAACGCT CTGAACGACG CCGGCGACGT CGACACCAAC
GTTGCGAGTC TGTTATCGCG ACTCGACTTC AACGGCTACT ATGGAATTTA G
 
Protein sequence
MTVGSSQRRS GGLFGKPERT IVSRPTELAD GPKLPDWNYK RPFLTGAHLG SGAEQDRNVK 
ALSDYGTAEQ ELLVLDDLLY AMMGVDGRYI SAWKGTDDET SSGFEVELGL EAPLAALVKN
MLPLCSDAAT VRAFIESRHE FKHGYVSHAL AAEMRDLLND WHTLIVQLEH QRNIGSLSLQ
AAWFYCQPAA PALRLMASVA SRAYHLKGAS ILNLLHREGC EHAGDGAVSA LVQRLSKATS
APYSRAIELW VYDGQVDDPY DEFLIVEQRE MKKTSLADDY NSAYWTKRYS LREEIPQFIG
KQLAQKILTT GRYLNAVRET KVSAIAELPA KPRDGLGKMY FGPNMIIGTG KYADRIDDRF
EHASRKLLQI MWEDGELKSR LMSMKMYFLL ARGDYLVHFL DTAASELEKD ADDIRLPKLQ
TLLDIAVKSS STATDHHGDD LLCSIDGHGL SRQLSSIDDD DSAAVTPSKA TGDGDELSGF
DAFVLDYDTP WPASVVLNRR AVTKYQILFR HLFNFKCAER ELCAGWQRLQ VMRGAQLGRM
FAQAHTLTQR MLNFLQNYLY YITNEVIEPH WDRMIARVDD AQSVDELIAG HDAFLEACMK
DAMLFWPKIL KRLERARAAC LRFARDSQRF ADTIERLKEN SMDAMTADRL VALEEEIEAV
TSDTRSQFRH FLGDLLNALN DAGDVDTNVA SLLSRLDFNG YYGI