Gene OSTLU_41366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41366 
Symbol 
ID5002465 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp173195 
End bp174820 
Gene Length1626 bp 
Protein Length461 aa 
Translation table 
GC content63% 
IMG OID640417886 
Productpredicted protein 
Protein accessionXP_001418399 
Protein GI145347903 
COG category[S] Function unknown 
COG ID[COG1565] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.151662 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0721349 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCGCG CGCTCGCGCG CGCGGCGCGC GCGATCGACG GCGAACGACG CGCGCGCGGC 
GTCGCGATCG ACGCGCGCGC GCGCGCGACG ACGACGACGC GAACGCGAAC GCGGGACGAC
GGGGACGGCG GACGCGCGCG CGCGCGCGCG CTCGCGACGA CGCCGTCGCG CGAGGACGAC
GGGCGGCGGG GAGAGATCGC GATCGATCGA TCGGGCCTGC GACGCGCGCT CGAGGGACGG
GCGCGCGCGG GCGACGCGCG GGAGGAGGGG ACGCGCGGCG GGTTGTTGGG ACACCTGGAG
CGCGCGATGA AGTTCGCGGG AGGGTCGATT CCCGTGAGTG AGTACGTGCG AGAGTGCTTG
ACGCATCCAG AGTATGGGTA TTACATGCGC GACGCGGACG TGTTCGGCAA GAAGGGGGAT
TTCGTGACGA GCCCGGAGAT TTCGCAGGTG TTCGGGGAGT TGATTGGCGT GTGGGCGGCG
CTGCAATACG AGGCGCTGGG TTCGCCGGAT ACGCTTCGAA TCGTGGAGTT CGGGCCGGGG
AGGGGGACGT TGATGGCGGA TTTACTGCGA GGGACGAGGA AATTTGCGAA ATTTCGCGAT
GCGGTGAGCG TGCACTTGAT CGAGGTGTCG CCGGCGCTTC GAAAGACGCA GGCGAAGACG
TTGCGGTGCG GGGAGTTAGA AACGACGGCG GCGGAGGGGA ACGCGAGATT TGTCGTCCCT
AAGAACGTGT TAGAGGACGA AACCGACGCT ACCGGTGACG CTGGTGGATC GTCGGGCGAC
GCTCCGGTGG GTGAGGCGCA CACGCGCGGG AAGTCTGAAA TCAACGACGC AGAAGTGTTT
TGGCACGACG GTTTGGAGAG CGTGCCGAGA GGGCCGACTT TGGTGATTTG TCACGAGTTC
TTCGACGCAT TACCGGTGCG ACAGTTCCAA CGCACCGAAA GAGGATGGTG CGAGAAGCTC
ATCACGATAG ATAGCGGTTT AGTCGCGGAG GATGGCGAAG ACGTTAGCGG GGAATCGAGC
GGCGCCTCGA GACGCGATTT AGAGATGGTT CTCTCTCCTG GGCCGACACC GGCGAGTCAC
ATGCTCGTGT CTCGCCGTCT CAAGGGAATA CCGAAAGAAA AAGCAGATAG TTTGCGACTG
ATCGAGCTTA GCCCACCGAG CATGACGCTC TGGGACGCCT TGGTGGATCG CATCGAGAAG
AATTCCGGCG CCGTGCTCGC GATCGATTAC GGCGAAGAGG GGCCGCTCGG GAACACGCTC
GAAGCGATTA AAGACCACAA GTTTGTGCAC GTGCTCGACA GCCCCGGAGA AGCCGATCTC
TCGGCGTACG TTGACTTCGG AGCTTTGCGT CAAATCGTCG AGGAAAAACC GCAGTCCGGA
GTCAAGTGTT ACGGCCCGGT GACGCAACAG CAGTTGCTGT TGAGTTTAGG CCTGGTGCCT
CGCCTTGAAA AGCTCGTGGA AAACGCGTCG AGCGAGGCAC AAGCGGATGA ATTGGTGCAA
GGATGCGAAC GTCTCGTGGG TGACGGCGCT GGAGATCCGG AATCGGGCGT CGCGCCGGGA
ATGGGGTCTA GATACAAAGC CATCGCCATG GTCTCTCGCG GATTGCCAAA GCCAGTGGGA
TTCTGA
 
Protein sequence
MRRALARAAR AIDGERRARG VAIDARARAT TTTRTRTRDD GDGGRARARA LATTPSREDD 
GRRGEIAIDR SGLRRALEGR ARAGDAREEG TRGGLLGHLE RAMKFAGGSI PVSEYVRECL
THPEYGYYMR DADVFGKKGD FVTSPEISQV FGELIGVWAA LQYEALGSPD TLRIVEFGPG
RGTLMADLLR GTRKFAKFRD AVSVHLIEVS PALRKTQAKT LRCGELETTA AEGNARFVSE
INDAEVFWHD GLESVPRGPT LVICHEFFDA LPVRQFQRTE RGWCEKLITI DSEKADSLRL
IELSPPSMTL WDALVDRIEK NSGAVLAIDY GEEGPLGNTL EAIKDHKFVH VLDSPGEADL
SAYVDFGALR QIVEEKPQSG VKCYGPVTQQ QLLLSLGLVP RLEKLVENAS SEAQADELVQ
GCERLVGDGA GDPESGVAPG MGSRYKAIAM VSRGLPKPVG F