Gene OSTLU_15756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_15756 
Symbol 
ID5002311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp323060 
End bp324445 
Gene Length1386 bp 
Protein Length461 aa 
Translation table 
GC content60% 
IMG OID640417732 
Productpredicted protein 
Protein accessionXP_001418216 
Protein GI145347529 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.417195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.162276 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACGG TGGCGAGAAA AACGTCGGTG TGGTCGAACG CGCTCGCGGC GAGCGCGACG 
GTCGCGGACG CGACGACGGC GAGGGACGAC GCGCGCGCGA AGGGTGCGAG TTTTGAGGCC
CGAGGGGAGG ACGAGGAGGC GCGCGTGCGA CGGGCGGCGA GCGAATTGCG GGTGAAGTTG
CTCGATGAGA TCGTCAGAGG GGCGGCGAAG CGCGAGCCGA GCGATGAAGA CGGGGTGAGG
GGGTCGGAGG ATGCGGACGC GCGACGAGCG AGACGCGCGG GGCGAGACGC GTTTCTCGCG
GCGACGGCGA AGGATGGAGA CGAAGACGAG GACGCGATCG CGCGGATGCG CGCGGCGGTG
AATTTCGTCA TGGGATTTTG CGCGGAGATT TTGCAACCGA GCGATGACGG CGGTTCGCCG
GCGGTGATGC CGCCGCTCGG GACGCACGTC ATTCTACGCG CCGACGTCGA GGCTGAAGGC
GAGCGAAGCG TGTACTTTCA AGAGCAATGT TCTTACGATT TCGACGAAAT ACGCACCGCG
GATCACTTCA TCGACTATGA ACGACAGTTT GTTCACGCGA GCGCCAAGGT TCAAACCGCG
CCGCGGGTGT CAACATCCGG GTCTGTGATC GACGTCCACT TCGATTTGCC GAAGCCGAGC
GACAGTTCGA CCACGAACAA TATTCGAGAA GGGAACTTTG TCGAAGTGCG CGGATGCTCT
GGCCACCCGG ATTACTTGGG ACGGCGAGGC GAATGCAACA ACAGTGGGGA AGTGTTTGAG
TTCAATTTAC CGTCGCTTCG CATCTCTGGT CTCGGTCGAG CGTACCCCGT CGCCGAAATC
GTAGGCGAAG TGACTGTTCG AGGTTGTACG TCGGGATTGA AAACGTCGCT AAAGTTTCGA
CCGTTCGACG TGGAAACGTC GCCGTCGTTT CGAAACATCG TCAGCGGAAC CATGGTTCGA
AATAACACCG AGGTCAAACG ATTGATTCTC GGCACTTGGG ACACGCGGGT GTTGCTCGGC
TGCATTTGCG AGGATCGTGA CTCGCTCGAC GACGTCTTGC ACGCCGCGAG AGATTTCAAA
GTCCCTCCGT TGACGACGGC GACGGGAGTC CGAAGTTTAC TCCAAGCTCT CGAATCACCC
GGAAGTATGT CCAACAAAAG ACTATGGCAA ACCATCGTGG AGGCGCTGCG CGTGGCGAAT
TTACACGAGC CTGAACGCTC CGTCGTGCAA GAAGTGCTCG GCGAACCGGC GACGGCGACG
CGCGAGAAAA TTGAAGAGGG CTCGACGATG GCTTATGAGC TCGCGGTCAT AGTGGCGAAC
TCGCCTCCGC CGGATGCACC GCCGCCTTTA CCCAGATACT GGTCTCCGCG AACGAAAAAG
AATTAA
 
Protein sequence
MKTVARKTSV WSNALAASAT VADATTARDD ARAKGASFEA RGEDEEARVR RAASELRVKL 
LDEIVRGAAK REPSDEDGVR GSEDADARRA RRAGRDAFLA ATAKDGDEDE DAIARMRAAV
NFVMGFCAEI LQPSDDGGSP AVMPPLGTHV ILRADVEAEG ERSVYFQEQC SYDFDEIRTA
DHFIDYERQF VHASAKVQTA PRVSTSGSVI DVHFDLPKPS DSSTTNNIRE GNFVEVRGCS
GHPDYLGRRG ECNNSGEVFE FNLPSLRISG LGRAYPVAEI VGEVTVRGCT SGLKTSLKFR
PFDVETSPSF RNIVSGTMVR NNTEVKRLIL GTWDTRVLLG CICEDRDSLD DVLHAARDFK
VPPLTTATGV RSLLQALESP GSMSNKRLWQ TIVEALRVAN LHEPERSVVQ EVLGEPATAT
REKIEEGSTM AYELAVIVAN SPPPDAPPPL PRYWSPRTKK N