Gene OSTLU_31624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31624 
Symbol 
ID5001701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp243571 
End bp244730 
Gene Length1160 bp 
Protein Length381 aa 
Translation table 
GC content55% 
IMG OID640417122 
Productpredicted protein 
Protein accessionXP_001417962 
Protein GI145346988 
COG category[R] General function prediction only 
COG ID[COG3491] Isopenicillin N synthase and related dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.492816 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00491035 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
CGGCTTTCGA CGGCATGCCC GCCGCGCGCG AAGACGCAGC GTCCGAGTAC CCCGTCATCG 
ATGTCGGCCC TTTACTCAGC GCCGGCGATC CGAACGAGGA CGTTCGCTTC ATTCTTGCCC
GACGACGCGA GGTCGGTCGA GCGCTTCTCG CCGCGTGCGA ACGGTTCGGC TTCTTTTCCA
TCGTATGCAC CATTCAAAAG GAATCAATCC CGTGGGCCGA CGTCGTTGAC CTGTTAACGC
TGAACACCCG TGATGATTTG CCCGACGACT TTCTCTGGGA GTACGCGCGG CATGACCGGT
CAGAAGGGTG TCTGATCGAG GGGAAAGATT TGACCTTTGC ACACGTCGAC GCGTGGTTTT
CACAGAAACA GGCTGACAAG GACGCGTACG CGATGACGAA CGGAAGAGGA TATCAACGAA
TAGGGGAGAA TGTGACGAAT GGGAGACGAG ATCAGCACGA AGCCATTGAT TTTTATCGAC
CGTGTGCCGT TAGTGATGGT GGTTTGCGCG CGCCGCATCC GTACATTGGT CCGAACGATT
TGAACGAGCG CGTCGACAGA TACGCGAAAG GGATGACGCG CATTGGTAAA TATATTTTGC
GCGCGCTATT GGGCGCGATT CGGAAAGAAT ACCTCCACTA TCGCATAGCG GACGACTTCA
CAGAGGACAT ACTCGAGGAT GACATCGCGG GATGTCCGTT TTGGATCTTA CGGCTCATAA
ACTCGCCCGG TTGCGATTCC GATGGCGAGA AAACTTCGTG CGGTTGGCAC ACCGACTACG
GTTTGCTGAC TTTCATTCAC GCCACGCATC CAGGCTTACA AATCGAAGTA TCAGGCAAGA
TTATCGATGT TCCTCACCAT CCAGAACACA TGGTCTGTAA CGTTGGCGAG ATGCTTCAGC
TCTTCACGGA CGACAGTCTC AAAGCCACCC GGCATCGCGT CGTCCGAAAG CCGAGCGACG
AAAACTGCGC GCGTCCTCGT ATTTCGATTG CGTTCTTTTA TGAACCCAAC TACGACGCTG
TGATATCGAA CAGGCATCTC ACGGATCAAA GTGCTTTGGA ATCGGGTTAC TCATCTCCGC
GCGAGGTGCG ATACGCCGAC TTCTTGCGAC AAAAGGTCGC CACGAACTTC GCAAAGGTTG
ACGAGGACCC TCGGGCGTAG
 
Protein sequence
MPAAREDAAS EYPVIDVGPL LSAGDPNEDV RFILARRREV GRALLAACER FGFFSIVCTI 
QKESIPWADV VDLLTLNTRD DLPDDFLWEY ARHDRSEGCL IEGKDLTFAH VDAWFSQKQA
DKDAYAMTNG RGYQRIGENV TNGRRDQHEA IDFYRPCAVS DGGLRAPHPY IGPNDLNERV
DRYAKGMTRI GKYILRALLG AIRKEYLHYR IADDFTEDIL EDDIAGCPFW ILRLINSPGC
DSDGEKTSCG WHTDYGLLTF IHATHPGLQI EVSGKIIDVP HHPEHMVCNV GEMLQLFTDD
SLKATRHRVV RKPSDENCAR PRISIAFFYE PNYDAVISNR HLTDQSALES GYSSPREVRY
ADFLRQKVAT NFAKVDEDPR A