Gene OSTLU_3784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_3784 
Symbol 
ID5000808 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp734446 
End bp735618 
Gene Length1173 bp 
Protein Length342 aa 
Translation table 
GC content59% 
IMG OID640416229 
Productpredicted protein 
Protein accessionXP_001417038 
Protein GI145345053 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.461536 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.270485 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGAAGATACG TCGCCGTGTA CGCGCTCGGA ACGTTCGGAG ATTGGATCCA AGGGGCGTAC 
CTGTACGCCG CCTACCGCCG ACACGGACTG GTGAAACGAG AGATAGGATA CATATACGTG
CTCGGGTACG TCGTGAGCGC GACGATCGGG ACGACGTGCG CCGCGCTCGG GGACACGCGC
GGGCACCGAG CGCTCGCGGT GGCGTACGGG ACGCTGTACG CGGCGAGCTG TTTGTTGCTG
CGGTCGAGCG CGATGACGAC GCTGATCGCG AGCCGGATAC TCGGGGGGAT TGCGTATTCG
TTGTTGTTTA CAAACTTTGA GTCGTGGGTG ATCACGGAGG CGGACGCGAT GGGGATCGAT
AGGAAAAAGT TGGCGGGAGT GTTTAGCGTG GCGACGTTGT TCAACGGGGC GAGCGCGGTG
CTGGCGGGAT TGGTGGGGAA TTTTGTCGTT GAATTCGCTG AATCGAGCCA GTTCTCGTGG
ATTGGAATGG ACGAGGTTAG GCTCGAGATG GGGGCGGAGG CGGATACGTC AGGAAGCGTG
GTGATGATGT CGAAGAACGT GTACGCGCCG GCGTTTGACG TCGGCGCCTT GTCGCTGTTG
CTGTGCGCCG CGGGGGCGAA GTTTTTATGG AGTAATCGGA CGAGCGCGGC AAATTTGGGG
GCGCCAGGGT CATCCGCGGT CGATGATACG CAAGTCGGCG CGGTGAGCAT CTCTAGTGCG
GTGAGAATGA TTATGAGCTC GGTGGAGTTG TTTCGCCTCG GTGCGGCGAA TTCTCTCTAC
GAAGGCGCGT TGCATCTCTT TGTCTTCGTA TGGACGCCAG TTCTAGAGAA AAGGTCGGCG
ATAGACGCCA CGGTGCCATA CGGATCCGTG TTCAGCGCGT TCATGGTGTG TAAAATGTTC
GGAAGCCAGG CGTTCAAGGT GCTGGAGGCT AGGATTCCCG CCGAGAATCT TCTGCGGATG
GTTCTCGTGG GCTCCGCCGT CAGCTTTTCC ATCGCCGTGT TGTTCACGGG GTATTGGGTC
ACACTCGCCG CGTTTTGCGC GTTTGAGTTC GGTTTGGGGA TTTATTGGCC CGTGATGTCC
ATACTACGGG CAAAGTACGT GCCGAACAAG ATGCGAGCGA CCATGACGAG CGCTTTCCGG
ATCCCGCTCA ACATCTTGGT CGTCGCGTTG TTG
 
Protein sequence
RRYVAVYALG TFGDWIQGAY LYAAYRRHGL VKREIGYIYV LGYVVSATIG TTCAALGDTR 
GHRALAVAYG TLYAASCLLL RSSAMTTLIA SRILGGIAYS LLFTNFESWV ITEADAMGID
RKKLAGVFSV ATLFNGASAV LAGLVGNFVV EFAESSQFSW IGMDEVRLEM GAEADTSGSV
VMMSKNVISS AVRMIMSSVE LFRLGAANSL YEGALHLFVF VWTPVLEKRS AIDATVPYGS
VFSAFMVCKM FGSQAFKVLE ARIPAENLLR MVLVGSAVSF SIAVLFTGYW VTLAAFCAFE
FGLGIYWPVM SILRAKYVPN KMRATMTSAF RIPLNILVVA LL