Gene OSTLU_32727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32727 
Symbol 
ID5002944 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp537403 
End bp539686 
Gene Length2284 bp 
Protein Length735 aa 
Translation table 
GC content61% 
IMG OID640418365 
Productpredicted protein 
Protein accessionXP_001418964 
Protein GI145349072 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00387308 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.976838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCGC GAGGGAGCGG CGCGAGCGCG TCCGTGGGGA CGACCGCGCG GGCGTGGATC 
GCGGCGTCGG CGACGCGCGC GGCGCGCGCG GACGCGCTCG GACGCGGATA CGCGACGAAA
ACGACGAAAA CGACGACGAA GTCGACGTCG GGGGATGGCG CGGCGACGGC GACGGCGAGG
GGCGGTGGAA AGGGGCGGGC GAAGGCGAAG ACGGGGACGG GGACGGCGCG CGCGGCGCCG
GCGAAGGCGA AGACGTCGAG GAAGAAGAAA CCGCCGATCG GGAAGATTCG CGCGGTGCCG
CACGCGGCGG CGAAGAAATT TCCGCTGCAT GAATCGGTGT TGGGGGATCC GAGGACGTAT
AAGGCGACGA TGGGGGACGC GACGGCGGCG GCGCGGGCGA CGGCGCGGGC GCGGGCGACG
GCGGCGCCGC CGCCGACGAG CGGCAATAGG GTGATGCCGC ACGCGAAGGC GGCGACGTTT
CCGCTGCATG AATCGGTGTT GGCGGAGGCG AAGTCGATGG CGACGAAAGG GACGGTGTTG
AGGGAGCCGT TATCGAAGCC GGCGGGCGCG GCGGCGACGG CGCCGCCCGC GGCGGCGAAC
GCGACGCCGA CGGAGGAAAG CGGGTCGAAT CGCACGGCGC TTCTAGCCGC CGCCGGTGCT
GGGATCGCGG CGCTTTTAGC GTATTCCATG AGCGTCGTGG ACGAGGAAGA TAGGCCTCCG
CCGCCGCGAC GGGCGAAACC AAAGGCGGCG GCGGCGCGGA AAGACGTCGA GGTGCAAATC
GAAATCGTCA AGCAGCCCGA CGCGAAAGAG TTGACGAACT CGAAGAAAGC CACAACCGAG
TCGAAGAAGG ATGGCAAAGA AACGGCGAAA GATGAACTGT CGGACGCGTT CAGCGCGACC
GCGGACTTCC TCGCCAACGT AGAGGTACGC AACATTTCGT ATTTTTGTGA CGCTTATTCT
CGGGGCCTTA TACTCATCTG TCGCGGCTTC TCGGTGCCGC GCCGGACCTA TCAAGGGCCA
ACATTAAATG CTAGCTTGGC GCGCACGCAT GCAAACCGCG CGTCAAGCGC CGCTGGAGTT
CTAAGGCTCG TGGCTTTACC TCGCCTCGTT CCTGTGAGCC AGCCGACATC TTTCATTCGC
AATTAACTCG AAGATTTACT CGTACTGACG GCGCGTCTTT CGCTCGCAGG TGCCGCAAGA
CGAACCGGAT CAAAAGGAAC CTGAAGTGCT CGCGAAAGTG AACGTACCGA AACCGCCCGC
GCGAGAAAAT GTCAGCATTA TGAACGCTTC GAAATCTCTG AGCGCGGCGC GGCTGCTCGA
AGAAGCCGTC AAAACACTCG GAGAAACTCT GAGCGACGAC GAAAAGGCGG CGTTAAGGGA
CGGCATGAAG GTTCAATCCG AATCCGACAT GAAAATTTTC AGCGAGTTAT CCTCGTCGAT
GGTTCAAAAC TTCGAAACTG TCGTCAGTGC CGAGCGATCG GTTACGGACG AGCTCATCGC
TGCGATCAAC GTCTTGGAGC AGCGCGCCGA GGACGCCTCT CGTCAACTCG AAAGAGAAAA
GGAGCGCGCC GTCGTCGATA AAGAACGCGC TCTGAAGACG CAAGAGAAAA AATTGAAAGC
AGAGCACGCA GACTTTCTGG TCGCTGAACG CATCGAGCGA ATCAAGGCGC TAGATGAAGA
GCGCATCCGC ATGGGAGCTT TGAGACAAGT GCTGACGAAG CGCCGCGAAG CGCTCGAGCG
AGCGCACGCA GTGCAAAGCT TTGAGCTCGC CGTCATGGAT TTCGGATCTC GCGTGGAGAA
CGGCGAAGCC TTCGAAGACG CACTAGCGCT GCTCAACACG TGCGCAAAAA AGGATCCGTT
CATCGCCACC ATCATTCAAG GCTTGGATCA AGATATGGCA AAGCGAGGCG TTCCCACGCG
CTTGCAATTG GCGGAACAGC TGGAACGCGT TCGGGACACG GCGAGAAAGT TGTCGTTGGT
TCCGCAAGAC GGTGGCGGTA TGTTAGCGCA CGGTTTAGCG TACGCCGCTT CACTGCTTCG
CGTGAAGGAT ACATCCGACG AAGGCGCGCA GGGTATTGAA GGCGCCATCG CCAAGGCGGA
AACTCACTTA GCAAACGGAG AATTGATGCA CGCGGCGAAG TCGCTTGCGA GCGCCGCGGA
GGGTACGAAA GCGGCGACTT CCGTCACCGA ATGGGCACAT AGCGTTCGTT CACGAGCCGA
GGTTGAGCAG GCTCAAACTG CATTAAACGC GCACGCGCAG TGCCGTGCAT CGGCGCTTGT
TTAA
 
Protein sequence
MAARGSGASA SVGTTARAWI AASATRAARA DALGRGYATK TTKTTTKSTS GDGAATATAR 
GGGKGRAKAK TGTGTARAAP AKAKTSRKKK PPIGKIRAVP HAAAKKFPLH ESVLGDPRTY
KATMGDATAA ARATARARAT AAPPPTSGNR VMPHAKAATF PLHESVLAEA KSMATKGTVL
REPLSKPAGA AATAPPAAAN ATPTEESGSN RTALLAAAGA GIAALLAYSM SVVDEEDRPP
PPRRAKPKAA AARKDVEVQI EIVKQPDAKE LTNSKKATTE SKKDGKETAK DELSDAFSAT
ADFLANVEVR NISYFCDAYS RGLILICRGF SVPRRTYQGP TLNASLARTH ANRASSAAGV
LRLVALPRLV PVPQDEPDQK EPEVLAKVNV PKPPARENVS IMNASKSLSA ARLLEEAVKT
LGETLSDDEK AALRDGMKVQ SESDMKIFSE LSSSMVQNFE TVVSAERSVT DELIAAINVL
EQRAEDASRQ LEREKERAVV DKERALKTQE KKLKAEHADF LVAERIERIK ALDEERIRMG
ALRQVLTKRR EALERAHAVQ SFELAVMDFG SRVENGEAFE DALALLNTCA KKDPFIATII
QGLDQDMAKR GVPTRLQLAE QLERVRDTAR KLSLVPQDGG GMLAHGLAYA ASLLRVKDTS
DEGAQGIEGA IAKAETHLAN GELMHAAKSL ASAAEGTKAA TSVTEWAHSV RSRAEVEQAQ
TALNAHAQCR ASALV