Gene OSTLU_15211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_15211 
Symbol 
ID5001559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp837819 
End bp839849 
Gene Length2031 bp 
Protein Length676 aa 
Translation table 
GC content61% 
IMG OID640416980 
Productpredicted protein 
Protein accessionXP_001417618 
Protein GI145346276 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0000157617 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAATG AGGTTAGCGA CGCGCACGCC GCGTGTCGAT GGATTTCAGA ACTCTTCGCC 
GAATGCGACG GTCACGAGTC GAGCGTGAAC TGCAGCTGGC GAACTCTAGA TTTAGTTTTG
CGCGTGTTGC TACGCGATAG GTATTGGGCG TGCGAAGCCA CGCGATTCGC GCTGTGCGAA
ACGTTGCTAT TGCGAAAGAG CGCGCCGCGC GCCGCGCTGC CGGCGCTCTT ACGGTTGACG
ATTCTGCGAC CCATTCGTCG AGGCGCCGAC GCCACGCGCG TCAAAGCGGC GAGCGATGCG
AATATATTCG CACTCTTGGA TACTTGGTCC GGGGAAGGTT TCGTGCGTGA GGCGTCCGTC
GAGCTACAAC GGACGCTCAC GTCGAGCGTT CGAGCAATTC TTGCGGCGTT GCCGAGCGAA
GAGTGGGACG CTATGCGAGG AAAGCCGACG CAAATGTTAT TGAAAGGCGT GAGCGCGCGT
CTCGATTCGC CATCATTGCG TCCGCGCAGA CACGCGAGCA AAGTCGCCTT GGAACTTTCG
CTGAAGATGG ACGCGAGCAA GCCTTTGCGA CTCGTCGATG ACGGCGACGA CGTCGACGCG
TCATCGGACG AGGAATCGGA ATGGGAACAG ACCATAGAGT CGATCGTTCA AGACGACGTC
TTCGTCGTCT GCAATGATGA CGACGGCGGC GGCGGCGGCG ACGATGAAAA AGAATCCATC
GCCGCCAACG TCAATCTAAC CGGTTCGTCT GGGTTTCAGA TTAAAGTCGA AGACCCGGAC
GAAGTTGTCG ATATGTGGAG CCTTCGCCGC GACGAGAGCG ACTCGGATGC CTCCGATAGC
GATGACTATT CCGATGACGA GCTCGTACCG TATGATATGG ATAGTGATGA CGACGCGACG
CTTCGTTCTC GTGATCCAGC GAGTTTGACT GAAGCGCGAA TCGCGTCGCT GCCGAAGCCG
CAAACTTTAC GAGAGTGCAT CGGCGCCTTG AGGCAGGCGC GTTCGGGCGA TGCGTCGACT
CGGCAGACCG ATATCGATAT TGCGGACGCA GCTGAAGGTG CGGTGCACGC AGTTTCCGAT
ATCGTCATGC GCCAGCCGCA CGAATTGGCG TCGTGCGCCG CGGATCTCGC TGTCACCATT
TTGCACGCGC AACCGCCCAC GCCGGACTCC GACCCACTCG ACCGCGCGCG GCGCCAAGGC
TTAGCTTCGG TGCTCACGGT GACCCCTGGT CTCGCAGGCC CAACAATTAT CGACCACGCG
TTGTCGGAGA AATGCGACGC GTCCCAAGTT ATGGACACGC TGAGCGCGTT GGACATGGCG
ATGACGGAGC TCGCGTCGCC GTCAAAAGCC GAGCACCTGC TCGATGCTTC GGCGTCGACT
CGTTCGACTG CTCGCGTGGG TATCGAGCGA AGGTTTGCGC CGAAATCCAT GGAGATGCAA
GCGAAAGGTG CGACATCGGC GCCGACGCGA AGTCATCTCA TAGGCGAGTG CTTCGTCGCG
CCGCTCTTGC GCGCGGCGGC GCAAAGGCTC GAGCGTCAGT CCGCGCCAGA ATACAACCCC
GATGGCCTGG ACGCCATGGT GAACGGACAT ATTCTCTACA CACTCGGTCA GTGCGCCAAG
CACGCCAAGA ACGCGACCGA TGGACCGTTG ATCGGCCGCG CGGTGCTTGA ATTCGCCGTC
ACTCCCGCGT TGGCAGACTC CGACCAACCC CATCTCCGCC GTTCCGCCCT CGTCGCCGGT
GCCCTCGTCG CCACTTCTCT CCGAGACACC CCCGTCGCCA TCGCATACGC CGAAAACTCC
CCTCTGTCCA CCGCTTTAGA GCATTTCACC GCCATCGCCG CGCGGCGCCA TCGCGCCGAC
TGCGACGTCG ACGTTCGCGC CGCGGCTTCC TTCGCCGTCG CCGCCGCCGC TGATTGCAAA
GCTCGCGCCT TAACAGCGCT CGAGCGCATC GCGGACGACC CAATCGCGCT CGACGACGCG
CGCTCGTCGG CAATCACCAC CCGAATCCCT CGATTAGACG TAAAATTGTA G
 
Protein sequence
MMNEVSDAHA ACRWISELFA ECDGHESSVN CSWRTLDLVL RVLLRDRYWA CEATRFALCE 
TLLLRKSAPR AALPALLRLT ILRPIRRGAD ATRVKAASDA NIFALLDTWS GEGFVREASV
ELQRTLTSSV RAILAALPSE EWDAMRGKPT QMLLKGVSAR LDSPSLRPRR HASKVALELS
LKMDASKPLR LVDDGDDVDA SSDEESEWEQ TIESIVQDDV FVVCNDDDGG GGGDDEKESI
AANVNLTGSS GFQIKVEDPD EVVDMWSLRR DESDSDASDS DDYSDDELVP YDMDSDDDAT
LRSRDPASLT EARIASLPKP QTLRECIGAL RQARSGDAST RQTDIDIADA AEGAVHAVSD
IVMRQPHELA SCAADLAVTI LHAQPPTPDS DPLDRARRQG LASVLTVTPG LAGPTIIDHA
LSEKCDASQV MDTLSALDMA MTELASPSKA EHLLDASAST RSTARVGIER RFAPKSMEMQ
AKGATSAPTR SHLIGECFVA PLLRAAAQRL ERQSAPEYNP DGLDAMVNGH ILYTLGQCAK
HAKNATDGPL IGRAVLEFAV TPALADSDQP HLRRSALVAG ALVATSLRDT PVAIAYAENS
PLSTALEHFT AIAARRHRAD CDVDVRAAAS FAVAAAADCK ARALTALERI ADDPIALDDA
RSSAITTRIP RLDVKL