Gene OSTLU_19051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_19051 
Symbol 
ID5006774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp333632 
End bp334918 
Gene Length1287 bp 
Protein Length428 aa 
Translation table 
GC content64% 
IMG OID640422195 
Productpredicted protein 
Protein accessionXP_001422555 
Protein GI145356680 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.570106 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00721554 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCCGCG CGCTGGACGA CGACGATGAT CTCGGAGGCG CGCGTCGACG ACGCGACGGC 
GCGCGAGCGC GCGCGGGGGC GCGAACGAGC GCGGCGAAGG CGGATGGGGA CGGTTGCGCC
GCGCGCGGTG CGTCGGCGGC GGCGCGCGAG GAGAACGGCG GCGCGGCGGC GCCGGGCGAG
GGCGAGGGCG CGGGAGGGGC GATGCGCGTC GCGCGCGAGT GGGTCGGGGG TGATGGCGCG
AAGGCGCGCG CGAGGGGCGT GGGGAGCGGG GGCGCGGCGA GGCGGGCGGC GCGCGGGCTA
AATATAGCGA CGACGACCGC GGTCGGGATG TACGTGTATT GGATACTCAT GGGGACGGTG
GCTTGGATCT CGCACTTGGG CGAACGCGCG CCGTGGTTGA GGTTCGCGCT ACCGGCGGAA
GGATATCCGG GAAACGCGCC GTCGAGCGCG GGGGCGCTCG GATTCGTCGC GAGAAACGTG
CCGCTGAGTC TCATCTTTGT CGTGCCGCAT TCAGTGTTTT TGCCGAGTCG ATTGCGGAAG
ATTTTCGGAA AACATGGGCG TTTGATGTAT AATTTCGTGT CTGCGGCGAC GTTGCACTTC
TTCTTGTTGA ATTTTACGCC GTTGAAGACG CCGGTGGTGA TGACGATTCC ATTCAACACG
AATTTTCACA ACGCGCTCTC GATTGGATGT CTCGCTTACG CGTCGTATGC GTTCCTCAGC
TCACCCGCAA CGTTAGGTTT ATTAGGTGTG AGCTCGGCGC TCGAGCTTCG CGACAGTAAA
TACTCCAATC CAGCGGCTGG CATGGACGCC ATCACGTGGA TGGGCGTGAC GACGTGGCGG
CTCGGCGGCG CCTCCGCCTT TGTATTGTTC ACCGGTCTAT CCATCATCCC GCGCGAGCTC
ACGTTGGGTG ACTGCATCAC GAGATGTGTT GCTGCGGTGT ATTTACGTCA GCGCTCGCGA
TCGTTCCGCG AGTGGGTGGA AAAGATCGAG GGCGTCCACC TCTTGACGTG GATTTTACGA
GGCACGCTCT TGTCCTTCGC GTGTCACGGC GCCTTGCAAG GCGGTGGAAA CGTTCGCACA
GTGGGATGGA TTTTATTCGG CGCCGCGAGT TTAGCGGGAA TTCTGCGTCT CGCCGAGTCA
GAGAGTCCGA ACGCTAAGAA GAAACCCATT TCTCGCGCCG CGTCCTTTGA CGTCGCCGAA
GCCGCGCCCG CGGCGCTCGA CGACGTCGCC CGAGTCAATC GCCCGGAGCG CTGGCACGCG
TGGAACCCGC ACGCTCGCAT GACGTGA
 
Protein sequence
MRRALDDDDD LGGARRRRDG ARARAGARTS AAKADGDGCA ARGASAAARE ENGGAAAPGE 
GEGAGGAMRV AREWVGGDGA KARARGVGSG GAARRAARGL NIATTTAVGM YVYWILMGTV
AWISHLGERA PWLRFALPAE GYPGNAPSSA GALGFVARNV PLSLIFVVPH SVFLPSRLRK
IFGKHGRLMY NFVSAATLHF FLLNFTPLKT PVVMTIPFNT NFHNALSIGC LAYASYAFLS
SPATLGLLGV SSALELRDSK YSNPAAGMDA ITWMGVTTWR LGGASAFVLF TGLSIIPREL
TLGDCITRCV AAVYLRQRSR SFREWVEKIE GVHLLTWILR GTLLSFACHG ALQGGGNVRT
VGWILFGAAS LAGILRLAES ESPNAKKKPI SRAASFDVAE AAPAALDDVA RVNRPERWHA
WNPHARMT