Gene OSTLU_30597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30597 
Symbol 
ID5001067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp433308 
End bp434882 
Gene Length1575 bp 
Protein Length524 aa 
Translation table 
GC content67% 
IMG OID640416488 
Productpredicted protein 
Protein accessionXP_001416947 
Protein GI145344870 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.355586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0445175 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCGCC CGACGCGCGC CGAGCTCGCG CGTCACGCCC CGGCGCTGGC GTGCTACTTT 
TTCCTGACCC TCGCCGTGGA GTCGTCGACG ACGCCGCTCG CGCTGGTGCG GAATCGCGCG
CTCGCGTGGG ACGAAACGCC CGAGGGCGCG AATCGCTTTT ACGCCTTTGT GTTCGCGGTG
GCGACGTTAA AGCCGCTGTA CGCGTCGGTG AGCGACCGAT CGCGCGCGAG GGGGGGCACG
CGAGCGGCGC ACGTCGCGCT CGGATGCGCG GTCGCGCTGG CGGGCGCGCT CGGGTCGAGC
GCGGCGCGGA CGACGGCGCA GACGTACGGG TTCGGGACGC TGGCGAGCGC GGGCGCGGCG
CACGCGTACG CGTCGTTGGA TGGATACGTC GTCGAGAGGT TCGGCGGGGA AGCGGGAAGG
TCGAGGGATG AGGTGGTGAT GGCGCAGGCG TGCGCGATGG CGGCGAGGAC GGCGGGGAGC
GTGGTGGGGG ATTTAGCGAG CGCGGGAGGA CTGGCGGCGG CGAGCGCGCG GACGGCGGCG
GCGGCGAGCG GGATTTGGAT GCTCGTCGCG ATCGCTGTGG CGTTAGTGAG CGTGGATGAG
AGTGATATAT CGCGAGATGT CGATTCGGGA AGAGAAGATG AGCGTGAAAT GGAGTCGCGG
TCGTGCGCGT CGTGGACGGC GCGAGCGAAG GAGGCGTACG CGCCGCTCGC CGAGGTTGAT
TTTTTACGGT GCGCCGCGTT GGTGTTTTTA TACCGCATCG CACCGACGGC GTTGGATACA
TTCGCGTCGT ATACGTACGC CGTGTTCAGC GATAGGATGA AGGATTATGA GTTTGGTTTG
GTGGCGTTCT TTACGTCGCT CGGCGCGCTC GCCGCGCCGG CGGCGTTCGG TTGGGCGTTC
GGAGACGCAA GCGCGTCTGG TAGTTCTGTG GGTGAAAACG ACGCCGGAAC GTTGACGAAG
ATTCGTGCGC TTCTCGTCTC GTCGCCCACG TGGATGATGT TCGTCTTCGG CGCCGTCGTA
GACGCGGCGC TGGGTCTCTG TCGACTCTTC ATCGTGTGGC GGCCGCCCGC AACCGGCGCC
GTGGCGGCGT TATCTATCGT CAACGCGCTC GCAATTTTTG GCTTGCGCGT GGGTTACATG
CCAATCGTCA CATTAGGCGC GATCATGGCT CCGCAAAACC TCGAAGCCGT CGGTTTCGCG
GCGCTGATTT TCGCCAGCGA CGTCGGCGCG CTCGTCTCCG CCTACGTCTC CGCCGGCGTC
GTCCGCGCCC TACACATCGG TGCGCCCACG CGCACGGACA CCACCGGCGC CGTCATTCCA
ACCGATCGTT CGTGGTCACC TCTCACCGCC TTCCTCGTGC TCGTCGCCGC GTGCAAGATC
ATCATCCCGT GCGTCTCCGC GCCGCCGCTC CTTTCGTCGG CGTCTCGGCG TTCGCGCGCC
GCCGACTTCT CCCTCCTCCC CGCCGACGCC GATCGATCCC ACGCCACCGT CGACGACGCC
GCGCGCGATT CCAACGCGTC AACGCGGCCG TCTTCTCCCC CTTTCGACCT CGCATCGCCG
TCCGCGGAGC TGTAA
 
Protein sequence
MTRPTRAELA RHAPALACYF FLTLAVESST TPLALVRNRA LAWDETPEGA NRFYAFVFAV 
ATLKPLYASV SDRSRARGGT RAAHVALGCA VALAGALGSS AARTTAQTYG FGTLASAGAA
HAYASLDGYV VERFGGEAGR SRDEVVMAQA CAMAARTAGS VVGDLASAGG LAAASARTAA
AASGIWMLVA IAVALVSVDE SDISRDVDSG REDEREMESR SCASWTARAK EAYAPLAEVD
FLRCAALVFL YRIAPTALDT FASYTYAVFS DRMKDYEFGL VAFFTSLGAL AAPAAFGWAF
GDASASGSSV GENDAGTLTK IRALLVSSPT WMMFVFGAVV DAALGLCRLF IVWRPPATGA
VAALSIVNAL AIFGLRVGYM PIVTLGAIMA PQNLEAVGFA ALIFASDVGA LVSAYVSAGV
VRALHIGAPT RTDTTGAVIP TDRSWSPLTA FLVLVAACKI IIPCVSAPPL LSSASRRSRA
ADFSLLPADA DRSHATVDDA ARDSNASTRP SSPPFDLASP SAEL