Gene OSTLU_30814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30814 
Symbol 
ID5000570 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp793834 
End bp795068 
Gene Length1235 bp 
Protein Length389 aa 
Translation table 
GC content59% 
IMG OID640415991 
Productpredicted protein 
Protein accessionXP_001416764 
Protein GI145344489 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0754602 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCGCGGCACA TCGCGCGGCA TGGCGACGCC GACGGTGATC GAGTGCCGCG TGCGCGGCGC 
GTACTGGCAC GTGAAAAACC ACCCGATGTC CGGCGTCACG CTCGTCGGGT GGGCGAGGAC
GCTCTGGGCG CACGGACGGT CGATCGATGC GGTGGCGTTC GCGCCGAGGC TGATGTTTCT
GACGTGCATG GCGCTCGCGA ACACGCTGGC GGCGATCGCG GACGGCGCGC TGCGCCCGAG
GTGGGGTCGG ACGAAAGTGC GAGACGACCC GGTGTTCGTG CTGGGACATC CGAGGACGGG
GACGACGCAC TTGCATAATA TATTGGCGAA AGACGAGACG CGCTTCGCCG CGGCGACGAC
GTTCGACGTC GGGTTTCCGA GCGGGTTTCT CTCGAGCGGG TTCGTGAAGC CGTACCTGGC
GAAAATGATG GATTCGACGA GACCGATGGA TAACATGGCG CTGACGATGG ACACGCCGCA
GGAGGACGAG CTGGCGACGA ATCAATTGAG CGGGTGCGCG TCGCCGTACG CGCCGCTGAT
GTTTATGCGA GACGAGGCGA AATTTCGCAA GTATTACGAG CTTCGAGAGG ATCACGACGA
GTATCCCATC GAGCGCGCAG AGCTGGAGGC GTGGAAATCG GCGTTCATGA CGTTCATGAC
AAAGTTGCAG TACAAGCACG GGGAGCACAA GCGGTTGGTG TTGAAGTCGC CCGTGCACGC
GGCGCGCGTC GAGGTGCTTC GCAAACTCTT TCCGCGAGCG CAATTCGTGT TCATTTCTCG
TCACCCGTAC GATGTTTTCA GATCTGCGGT AAACATGGCG GACAAGTACT ACTGGCAGTG
CTTTTTGCAA CGCCCCACCG TGGCGGACGT GCAGGAATTC ATCCTCAAGC AGGGAGAAAT
TTTACACGAC GCGTACGTGC GAGACTCGAA GTCGCTCCCG CGCGAAGCCT TGTTTGAGAC
GCGATTCGAC GATCTCGACG CCGATCCCGT GGGCACGTTG TCGAAAATTT ATAAACATTT
CGGATGGGAT GGATTCGACG AAACGGTCGC GCCGGTGTTG AAGGAATACG CGACGTCGCT
CGCCGACTTT AAAAAGAATA GCTTTGCCGA GCTCTCCGAC GACGCCAAGG AGGTGATCAA
CAGTCGCTGG GCGCGCTGGT TCACCGACTT GAACTACGAG AAACGATAGC GCTGTAGCGT
AGAAATAACA GTAGAAAGAA GATTGCTCAT TAAAG
 
Protein sequence
MATPTVIECR VRGAYWHVKN HPMSGVTLVG WARTLWAHGR SIDAVAFAPR LMFLTCMALA 
NTLAAIADGA LRPRWGRTKV RDDPVFVLGH PRTGTTHLHN ILAKDETRFA AATTFDVGFP
SGFLSSGFVK PYLAKMMDST RPMDNMALTM DTPQEDELAT NQLSGCASPY APLMFMRDEA
KFRKYYELRE DHDEYPIERA ELEAWKSAFM TFMTKLQYKH GEHKRLVLKS PVHAARVEVL
RKLFPRAQFV FISRHPYDVF RSAVNMADKY YWQCFLQRPT VADVQEFILK QGEILHDAYV
RDSKSLPREA LFETRFDDLD ADPVGTLSKI YKHFGWDGFD ETVAPVLKEY ATSLADFKKN
SFAELSDDAK EVINSRWARW FTDLNYEKR