Gene OSTLU_18874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18874 
Symbol 
ID5006440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009373 
Strand
Start bp76254 
End bp78515 
Gene Length2262 bp 
Protein Length754 aa 
Translation table 
GC content66% 
IMG OID640421861 
Productpredicted protein 
Protein accessionXP_001422431 
Protein GI145356423 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00000500904 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00000019569 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGCGGG ACGATGACGC GGCGCGGGGG GCGTTCGCGG CGATGCTGGT GGACGGGGAG 
GGAGACGCGG AGGAGGACGC GCGCGCGGTG GCGAGACGGG TGGGCGCGCG CGGGAGACGC
GCGGCGGCGT CGGTGCGCGC GATCGAACGC GCGATCGAAC GCGCGGTGGG CGCGGAGGCG
TCGATGGGGA CGATGGGACG GTCGGGGTCG GGGTCGGGGG GGCGTCGAGG CGCGCAGACG
ACGGCGACGG GGCGACGCGG GACGGACGCT TCGGACGCGC TCGCGGCGGC GATGGAGACG
TTGCGCGCGC TCGCGGTCGA TGAAAGCGGC GGTGCGGTCG TCGACGCGGT CGGCGGTGGC
GACGACGACG ACGACGACGC GCGACGTCCG CGTCGCGCGA ACGAGGATTT GGTCAAAGAC
GAAGACGCGC GCGATTACGC GCGTCGCGGC GAGGATTCCA TCGGTTTACG CGACGACGTC
GACGTCGAGC GTTGGGTGCG CCGAGGATTC GAGTGTTTAC ATCGATTCGC GACGATTCGA
AGCAAACGTC GCGATTCGCG GGCGACGCTC GGTGATCGCG AGTTCGACAC GAGCGTCGAC
GCGCCCGCGC CGCCGCTCGT CGACGCCTCG GGCGGATTGG ATGAAGCTAA ACAAGCGCAC
GATCTCGACG TCGTCAGGCG CATGGACGAC GAAACTTCTG AAAGCGTTCA TAACCTCGTC
AAGTCTTGGC TCGCCGAGCA GCGCGACGCC GAACGAGCGA TGCTTGAGCT AGCCGAGCTC
GTGCGCGACG ACAACGCGGA TTTGCTCGCG ATTCAGCGAC GTGAGCTCTT GACGCGCGCG
CTCGGCGCCT GGTGCGAATT TCGCGACGTC ACCGCGGCGC TCGAGCTGGG TGTGGAAGAA
TTCCGTCTGG CCGAGAAGCG CACGCGGTTT TATCTTCGCG AGAACGAAAT GTGTTACAAG
CACGTGACGC CGCCCGGGGA TTCCAAGGCG CTGGAAATCG TGCTGAATAA GCATTTGGAT
CACGCGAGGA ATTCTTTCGC CGTCGCGGCA CCGAGCGCGA AAGAGATCGA GCGCCACGCC
AAAGAATGCG TGCTGGCGAC GCGAGCGTTG AACTCGTGTG ACGGTGATTC GTCGCGCGTT
GTCGACAAGG CGTACGCCGA TCGTGCGCTC GACGTCAAAG TCATCCTGCG AGACGTCATG
CGGGATTTAA AGCGCTCGTT TCGAGCCATC ACTTCCGCCG CGAGCGGCGG CGCTTCCTCC
ACGAACGCCG GCGCGACGTC GACAAATTAC GATGTCGAAG GTTTGCATCG AAGAATCCAA
CCCGTGATGG ACAGGTACAA ATCCACCGCC CTCGACGGCG TCGTCGACCG CGTCGTTCGA
GCGCTCAATC TTGAGGGCGC GCACTTGGCG TACGGCATCG AACTCGCAAA GTGCGCGACG
TTCGTGAAAG TCGTGAGCGG TGCGGTCAAA GAGCACGTTA AAGCACTGGA CGACGCGCGT
TTGGAAGCTG CGCTCAAGGC GTTCGAAGAT GACGACGATG TTCTCCGCGG CGGCGACGCC
AACTCTAAGA GATCCGCCGG CGGCGCGTCT TCGAAGAAGA AACGGAAATC TTCGTCGTCC
AAGGCGAAGC GAGCGCTGAC GAAAGTCGCG CGCGACGTCG AGCGCGATGA AATCACCGTC
GCTGACGACG CCGAGGACGA ACCCGAAGCG CCGACGACGC CGCGATCGAA GACCCCCGAA
CCGACGCTCG AAGACGCGGC GAATTCGGAT TCGGGCGGCG AGTGGACGGC GGCGCGATCG
CGAAGGAAGC CCAACACGCC GCCGCGACCG ACGTTCGCGA AGGAAATGGA TGCGCCTCGA
AGACCAACCG TCGCGACGAT GCCGAATCCA CCGCCGCTGC CACCGAATCC GCCGCCGTTG
CCGAGTGGAA AGCCGCCGGC GCGCATCCCG GCGCCGAGAT CGACCGCCGT CGCCGCCGCC
GCGCCGCCTC CGCCTCCGCC TCCGCCTCCG CCGCCGACGG GGCTGTCCGC GCTACCGGAT
TTTCCACCCA AGCCGAAAGT CGCATCGCAA TCGACGTCCG AGTCCGCGAC CCCCTCGAGC
GACGCCGAAA GCGCCGTCGA TCGAGAGTCG CGATTGAAGG AATTCCCGGC GTTGAAGACG
AACGATTCGC CGGCGCTCGC GGCGCCGCCG TCGAGCAAGA CGCACGAAGA AAACCAGCCA
TCGTCCAGCG TGCCCGCGGC GCCGCCGAAG AAGTCGACGA TG
 
Protein sequence
MLRDDDAARG AFAAMLVDGE GDAEEDARAV ARRVGARGRR AAASVRAIER AIERAVGAEA 
SMGTMGRSGS GSGGRRGAQT TATGRRGTDA SDALAAAMET LRALAVDESG GAVVDAVGGG
DDDDDDARRP RRANEDLVKD EDARDYARRG EDSIGLRDDV DVERWVRRGF ECLHRFATIR
SKRRDSRATL GDREFDTSVD APAPPLVDAS GGLDEAKQAH DLDVVRRMDD ETSESVHNLV
KSWLAEQRDA ERAMLELAEL VRDDNADLLA IQRRELLTRA LGAWCEFRDV TAALELGVEE
FRLAEKRTRF YLRENEMCYK HVTPPGDSKA LEIVLNKHLD HARNSFAVAA PSAKEIERHA
KECVLATRAL NSCDGDSSRV VDKAYADRAL DVKVILRDVM RDLKRSFRAI TSAASGGASS
TNAGATSTNY DVEGLHRRIQ PVMDRYKSTA LDGVVDRVVR ALNLEGAHLA YGIELAKCAT
FVKVVSGAVK EHVKALDDAR LEAALKAFED DDDVLRGGDA NSKRSAGGAS SKKKRKSSSS
KAKRALTKVA RDVERDEITV ADDAEDEPEA PTTPRSKTPE PTLEDAANSD SGGEWTAARS
RRKPNTPPRP TFAKEMDAPR RPTVATMPNP PPLPPNPPPL PSGKPPARIP APRSTAVAAA
APPPPPPPPP PPTGLSALPD FPPKPKVASQ STSESATPSS DAESAVDRES RLKEFPALKT
NDSPALAAPP SSKTHEENQP SSSVPAAPPK KSTM