Gene OSTLU_28159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_28159 
Symbol 
ID5006083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009370 
Strand
Start bp344243 
End bp345656 
Gene Length1414 bp 
Protein Length355 aa 
Translation table 
GC content66% 
IMG OID640421504 
Productpredicted protein 
Protein accessionXP_001422043 
Protein GI145355593 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0300614 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCGCGCGCG CGCGAGCGTC GACGCGCGGA GATGTCGAGC GAGGACCCCG CGACGGGGCT 
GATGGCGCGC GGCGCGTCGA CGGCGTACGG GACGGAGTCG CGCGCGCGGG AAGCGAATGG
TGTTGAGGAC GAACGCGTCG CCGAGGTGCG CCGCGCGAGG CGAATCGGCG CGCGAACGCG
CGGCTCGGGG ATCGCGGTCG CGGTGGCGGT CGCGCTGGCG TGCGCGCTCG CGGTCGCGGC
GGTGCGAGAG GGCGCGGCGG GGCGATCGAG GGCGACGGCG AGGCTGGGCG GGACGGCGAA
CGAGCGAGGG AACTTTTTCG GAGCGCCGTT GAGCGCGGAA GCGCGGATGC GGGCGCCGGC
GAAGGCGGAT GTGATAAATA GGATATCGCT GCGCACGCGC GTGCCGGGAC TGCGGTCGCC
GGGACGCGGC GTGCCGGGGC TGCGGACGCA CGTGCGAGAG ACGCAGCGGG CGAGCGACGA
CGCGCGGATA TTAATCCTCA CCAAGCCGTT CGAGTGGGGG ATGACGTATT TGCAGATTCA
GAGCGTTAAA CGATGGGCAC CGGCGTTGTT GCCGCACGTC ACGGTGCTGA CGTACGACAC
GGAGACGCAG CGATCGTGCG AGGCGCATCG CGGCGTGGAT TGCTTTTACG ACGCCGACTT
CGCGCGGGCG TACGGGCAAG ATCCGAACGA TGGCGTCGCG CGCGACGCCA TGTCGTGGCG
CAAGGTACAC GCGGCCTTAG AATTACTGAA AGCGCGCATC CCAGTGGTGA TGCTCGATTC
GGACACGGTG TTTTTGTCCG ATCCAACCGA AGCGTGGACG AGCGCGCTCG AAAAGTACGA
CGTCGTCGTG AGCTCAGACG TCGGTAACGA GTTCGAGGCG CAAGGAAACA TGAACACGAA
GCTCGTCATT TTCCCAGCCA CGACGCGGAG CGTGAGTTTG TGCGAGCGAT GGCTCGAGGG
GGAGAGTCGA TTAGTATCCA AGGTGAACTA CGGCGAATTT CCGGAGCAGA GTTACTTCAA
CTACGTTCTC GTGCCGACGA CGGCGGGAGA GTTCCACATC CACGCCATGA GCACCGCAGA
ATCGGGAAAT TTCATCACCG CCAACGCGGG AGACGATGGT ACGTTCCCAG GCGCGCACAC
CGTCACGGCG TCGTATTGCG GAGACGCGCG CGACAAGGAA CAATTCCTCC AGCACGTCCT
CGAAACCAAG CACAACGCCG AAGCCGCGCT CGGAATCGGG CCCGACGCCA CGACCGCCAT
TCCATTACGA TCCGCCACCG ATTTCGACCA CGACGGCGTG GCCGACCTCG CGCGGTTCCC
GCACCCCGAC CTTCGATGCG ACCACGTAAA GCGTCGCCTC GTGGACCAGC GCCGATACGA
GGTCACCTCC GACCGGCACA TCGTGTGGAC GTAG
 
Protein sequence
MRAPAKADVI NRISLRTRVP GLRSPGRGVP GLRTHVRETQ RASDDARILI LTKPFEWGMT 
YLQIQSVKRW APALLPHVTV LTYDTETQRS CEAHRGVDCF YDADFARAYG QDPNDGVARD
AMSWRKVHAA LELLKARIPV VMLDSDTVFL SDPTEAWTSA LEKYDVVVSS DVGNEFEAQG
NMNTKLVIFP ATTRSVSLCE RWLEGESRLV SKVNYGEFPE QSYFNYVLVP TTAGEFHIHA
MSTAESGNFI TANAGDDGTF PGAHTVTASY CGDARDKEQF LQHVLETKHN AEAALGIGPD
ATTAIPLRSA TDFDHDGVAD LARFPHPDLR CDHVKRRLVD QRRYEVTSDR HIVWT