Gene OSTLU_16823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16823 
Symbol 
ID5003806 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp425003 
End bp426430 
Gene Length1428 bp 
Protein Length475 aa 
Translation table 
GC content61% 
IMG OID640419227 
Productpredicted protein 
Protein accessionXP_001419804 
Protein GI145350840 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0199026 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTGA GGGCGTTGGC GCGGGCGTTT CACGATTCCG CTCGCGCGCG ATTCGACGCG 
CCGGCGACTA CGTTGCTGCG CGCGGCGGCG GCGCGGATCG AACGCGAGCG GGAAGCTTCG
ACGGCGGCGA GGGACGTGAC GAGCGCGATT TGGGGCGTGG AGAACTATTG GCGGCGGCGG
GACGTGCGCG AGGACTCGGA GGATATGCGG GCGCTTCGAG AGTTTGCGCG CGTCGCGAGC
GCCGCGGCGG CGAGCGTCTT CGTCGAGTGG GATTCGGCCA AGGACAAGCG AGGGGCGGCG
ACGCTGTGTC GGGCGCTGAA GCACGTGATC GCGATTCATA AAATCACGGG CGAGCCCGTG
CCGATGGATT TCATCGCCTC GCTCGTGAAC GTGGCGTCGC ATAACGCGGT GGCATTTTCG
CCGATGCAAG TGTCGTTCAT CTTGCACGAC GTTGTCTCGG CGAAGGCGAC GGATATTTTA
ACGGAGCGAG TCATCGCATC ATTCTCACAC ATCATGCAAG TCGACGAATC GACGCCCTTT
GCGGCTGTTT CAGCGCTGTT GTGGAGCTAC GCGAAGATGG ACGCGTTGAA GAGCGGTTTG
GTGTCGACGA AGCACACGGA TGAGTTGCAC TTTGTCACTC GCGCAAAACT CGATCGTGGC
GAAAAAGTGG CTTCGCGCGA TATCGCGATG AATATGTACG CCGTCGCCAG GCTCGGTGCC
GAACACGTCG GTTTTGCCGA TGGCGATTAC CACGACGTGG CGTGCAAGAC GCTGGCGAAA
GAAATCGACA CCCTCAACCA ACGCGCTCTT CTAATGATTG CGTGGTCGTT GAACACCATG
CGACCGAGCG AGGATAACGT ATACATTCAT TCAACGTTTC TTGACGCGCT GGGCGGTGCG
GTTCAGCGGT CTGTGCACGC GTTCGCGCCT TTCGAGTTGG CGCCGACGAT GCACGCGCTC
ACGTCGCTTC GCGTGACGAA TCCAAAGCTC TTGGAACTCG CGCGAGACAG GTTTCGCGCC
GACGTTTCGG GTTACGCGGA GAAGCCTCAA AACCTCACGT TAATGCTTTG GTCGTTCGCG
GCGGCGGAGT ACGATATCGG TGAAGACACC TCGCGCATGG CCGCGTACGC GTACCTCGAC
GTCGCGCCGA TCGCTTCCGC GCTCGAGACG AAAACCATCC TTCAATCTCT GGCGCGTCTG
CACTTCGTCT TCGACGAAGA CGACGCGCGC GTGAAGAACA TATTAGATGA CGTCATCGAA
CGATATTTGG ACGAGTACTC CGAGGCAGAC TGCGAGGTGC TCGCGTGGAG TCTACTCGCC
CTTCGCGTTC CGGCGAGCGA GCGGTTGCTC GAGCGCGTCG GAGTCGAGTC CGTCGCCAAC
GACGCCGGTG ACGTCGAGTA CGTCGTGCAC AAACCAATCG ACGTGTGA
 
Protein sequence
MSLRALARAF HDSARARFDA PATTLLRAAA ARIEREREAS TAARDVTSAI WGVENYWRRR 
DVREDSEDMR ALREFARVAS AAAASVFVEW DSAKDKRGAA TLCRALKHVI AIHKITGEPV
PMDFIASLVN VASHNAVAFS PMQVSFILHD VVSAKATDIL TERVIASFSH IMQVDESTPF
AAVSALLWSY AKMDALKSGL VSTKHTDELH FVTRAKLDRG EKVASRDIAM NMYAVARLGA
EHVGFADGDY HDVACKTLAK EIDTLNQRAL LMIAWSLNTM RPSEDNVYIH STFLDALGGA
VQRSVHAFAP FELAPTMHAL TSLRVTNPKL LELARDRFRA DVSGYAEKPQ NLTLMLWSFA
AAEYDIGEDT SRMAAYAYLD VAPIASALET KTILQSLARL HFVFDEDDAR VKNILDDVIE
RYLDEYSEAD CEVLAWSLLA LRVPASERLL ERVGVESVAN DAGDVEYVVH KPIDV