Gene OSTLU_32787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_32787 
Symbol 
ID5002785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp638825 
End bp641350 
Gene Length2526 bp 
Protein Length741 aa 
Translation table 
GC content57% 
IMG OID640418206 
Productpredicted protein 
Protein accessionXP_001418999 
Protein GI145349142 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.390866 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.043937 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCCGCGGTCG CGGCGACCGC GCGCACGACC GCGGCGAGCG ACGGGAGGCG TCGAGAAGAG 
CGCGAACGCG CGTGACCCGA CGTCGCGCGA GACGCGAGCG CGACGCGCGG GGACGGGAAA
CGCGCGAACT CGCGCGAGGC GTTCGAATCG AGGACTCAAC ACCCTTCGGC GAGCGCGGCG
TCGATCGCGC GTGGCGACGG AGGGTGCGCG AAGGATAGGG CGCGCGGCGC GAGGATAGGC
GCGGGAAAGG ATGCTCGGGA TGATACCGAG CGCGCGAGCG AGCGCGACGG AGCGCGTGGC
GTTGCTGGGG ACGTCGACGA GCGCGGCGGC GAGCGAGGAT CGCGCGCTCG AGGACGAGCC
GACGACGAGT CCGGGGCGTC GAACGCGGAG AGATGGGTTG GTGAGCGTGC GTGGGATCGT
GAGTTTAGTT TCATTCGTCG CTGTTGTCGT AGCGGTGGCG TTTGGGTACG TGCGAGAGGG
TGAGAGACAC GCGCGTGCGA GGCGATTCGC CTCGCTCGGG GCGACCGGGT ATCCCGGCGA
CGGGTATCCC GGCGATGGGT ATCCCACGAA TGAGTACCCG ACGGATGGCA AGGTGCATCA
CTTGAAGGTG AAACCCGTGG GGCACGCGAT TTGGCTCGGC TCAGAGCTGC CGAACGTGAA
GCGCGCGTTC TTGAACCAGA ACAAGAAGAT TCTTCACAAG GTGGGATGGA AGATCAAGCT
TTGGCGGAGC GCGGACATTA CGCCTGAAAA CTTTCCGTAC ACCTATCACA CCATCAAGCG
TGCGTTAAAG TACAACTTAA AGCATCATGA AAACGTGTTT TCGATGATTG GGGATTTGAT
GAAGTTTGAG ATCATGTATC ACCACGGTGG GCTGTACATG GATACAAACA TCGAGCTCTT
GAGAGACCCG ACGGATTTGT TCTCCGTCAC GGCGACGCTC GGTAAGGAGG TTTTCTTAGT
GGCGGATCCC GGGGGAGACA AACGATTCGC ATCAGCTGGT TTCATGGGGG CGCTGGAGAA
GCATTCCGCA CTGTTTGCGT CGCTGATCAC CAATAAGGAA TATCTCGACG CCATTCCGTT
CGACGAGCAC TGCATCGCGA ACGCCTGGAC GGGGCCGATG TACTTGACGC TCAAGGTGTT
ACATTTCGAT AACGCTGTCA TTTTAGATCG CGATACTGCG TATCCATTGC TGTGCGGCCA
AGATGGGCGA GACATGTGCT TAGAGCAGGT TGATCACGAA GCTTTAGTGG GCGAACGCGT
GTCAAAGAAG GGCTCCAAGC TCGTCGTAGA TAAAAACGAC CCGGAGAAGA TTTGGCGCAT
CAAAGTTCCT TGCAAGGAGA TTCGTAAGCT CTACCCCGAT GCGACTGCGC TGGATCATTT
CAGTCTCGGC AAGTCGTGGA CGGGATGCGA TCCCGAGGTG AAAACGCAGC GTTTAGTAGA
CTGGACGCTC AACTACGTGC ACGATCCTTC GTCTTTCATT TTCGATTTCA TTCTCGATCA
AGTTCGTTGG TCCGAGGCTG ATAACGTAGA TGTCATGACT GAAAAGCTCG TGAATCGCCG
CATGGGCTTG CTCGGTACGG CCGACGTCTC GGCGCAGCTC GTGGTGTTAG CGAACACAGC
GCGTTCCGGC GCACAGCTGA TGTCTGAACT CTTGCGACAA GCATTCAACG CCGACGAAGA
CACGGTGGTT TGGGAAGATG ACACCGGTTT CCACTCTGAT ATTCCTTTGT ACAACAGATT
CTTGGACATT GGAGATTTGT GGTCAGAACG TGCGTTAGAG ATGGACTGGG TGACGCGGGA
CATGAAGCCG AAGATTTTGC AGCTCGGCGT CGACGAGGAC GTCTTCAACA ACTTGGTGAA
CCACCCGCGT ACGTTCCTCG ATCAGCTTTT GTTGGGCGCG GCGAGACACG GTTATCGGTA
CGTGATGACG AGAATTGGTT GGAACAACGC CTTGGGTGGT GCTCGAGCGA CCAGACGCAT
CGCACGTACG ATCTTGGATG AATTCCCCAA GGCGAACGTC TTCTTCCTTG AGCGCGCGAA
CATTCTCGCC CAATACGCTT CGTTGGCGGC GGCTGAGGAG ACGGGTGTTT GGATGAAGTT
CTCGGATCCG TTGAAACCGG AATCGAAGGA CGAAACCATA CCGACACAAG TTGTGTTTGA
CTTTGAAAAG TTCAAGCAAA CTTTCCAAGA AAGACACGAC TGGATTGAGT TCTTCATGAA
TGAAATGCGA GCGCGCAAGC GGCAATACGT GCACTTGGTG TACGAAGAGC ATTTGGCAAC
CGCGGCTCGC CAAGTACAAA CGTTGAAGAG AATCAAAGAA GCATACGGCT GGGACATCGA
CGTTAACCCG ACGTACTTGC GCAACTCGGT GCACTTAGTG CCCACGCAAG AGACGGAGCT
GCGCCAGCGT TTCGACAACC CAGAGGCGAT TCCAGAACTG CTCACGCAAG CGTTCCCGCT
CGATGAGCTC GTATAGGGAG TGACACGATT GACTCCACAC TATAGAATAA CTTAACGCAT
TCGATG
 
Protein sequence
MLGMIPSARA SATERVALLG TSTSAAASED RALEDEPTTS PGRRTRRDGL VSVRGIVSLV 
SFVAVVVAVA FGYVREGERH ARARRFASLG ATGYPGDGYP GDGYPTNEYP TDGKVHHLKV
KPVGHAIWLG SELPNVKRAF LNQNKKILHK VGWKIKLWRS ADITPENFPY TYHTIKRALK
YNLKHHENVF SMIGDLMKFE IMYHHGGLYM DTNIELLRDP TDLFSVTATL GKEVFLVADP
GGDKRFASAG FMGALEKHSA LFASLITNKE YLDAIPFDEH CIANAWTGPM YLTLKVLHFD
NAVILDRDTA YPLLCGQDGR DMCLEQVDHE ALVGERVSKK GSKLVVDKND PEKIWRIKVP
CKEIRKLYPD ATALDHFSLG KSWTGCDPEV KTQRLVDWTL NYVHDPSSFI FDFILDQVRW
SEADNVDVMT EKLVNRRMGL LGTADVSAQL VVLANTARSG AQLMSELLRQ AFNADEDTVV
WEDDTGFHSD IPLYNRFLDI GDLWSERALE MDWVTRDMKP KILQLGVDED VFNNLVNHPR
TFLDQLLLGA ARHGYRYVMT RIGWNNALGG ARATRRIART ILDEFPKANV FFLERANILA
QYASLAAAEE TGVWMKFSDP LKPESKDETI PTQVVFDFEK FKQTFQERHD WIEFFMNEMR
ARKRQYVHLV YEEHLATAAR QVQTLKRIKE AYGWDIDVNP TYLRNSVHLV PTQETELRQR
FDNPEAIPEL LTQAFPLDEL V