Gene OSTLU_4215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_4215 
Symbol 
ID5004300 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp16477 
End bp17610 
Gene Length1134 bp 
Protein Length357 aa 
Translation table 
GC content58% 
IMG OID640419721 
Productpredicted protein 
Protein accessionXP_001420235 
Protein GI145351767 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0564] Pseudouridylate synthases, 23S RNA-specific 
TIGRFAM ID[TIGR00005] pseudouridine synthase, RluA family 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AGGTACTGCG CGCCGTACGA CTTTGACTTT GTGTGCGGCG TCAAGGCGCG GTGGGCGGGA 
CGGAGGCTGT GCGATTTATT CGCGAGTGAG TTTCCGATGC GACCGAAAGA GTATTACGTG
AAAGCACACG CGATGGGACG GTTGTGCGTG GAGGCGAACG GGTGCGCGCG GACGAACGGG
GAGGAGACGA CGACGAAGAC GACGAACGAG GCGCGAGAAC ACGAAGACGG GCCGATGCTC
GCCGGAGGAA ACCGCGTGCG ACACTTCATA CACAGGCATG AACCTCCCGT GATGGCGGAT
GAAGTGCGCG TGTTGACTGT CAATGATGAC GTCGTATCGG TGTGGAAACC GGCGACGGTG
CCCGTACACC CCACGGGGCA ATACAGGCGA AACACGGTGC TCGCTTTGCT GGCGGCGTCG
CGTAGGGATT TGGGACGGTT ATTTCCCATT CACAGGCTCG ATAAGAACGT GTCGGGGTTA
CTCTTGCTTG CACGTTCGTC AGAAGCCGCG AATGAGATGC GAGTGAAGAT GGAGGCGCGA
GAGATGCGGA AGGAGTACGT CGCGCGCGTG CGAGGAGCGT TCAACGACGG TGACGCGACG
CCGGTGTCAA ACGTAGAGAG TTTAGGATTC GATAGCAAAG GCCGTGTGGC GATTTGGCGC
GGAAAAAAAG GTGTCACCGA TCTCGACGAG CGGACTTTGA AGTCATTCAA GGATGCGTCT
ACGAAATTTA CGTGTATTAA GACGCTCGTT GACGGGACTT CGCTCGTGCG ATGCGAACCC
TTCACCGGAC GCTCGCATCA AATACGAGCG CACTTAGCCA TGCTCGGGTA TCCAATCGCA
AATGATGTCG CGTATGGGGG TGCATTGGTC GAAGTTGAGC GCGCTCGAGC GATTCAGCAC
GCGTGCGACG TCACCGTGCT GAACGAAGCG GGCGAATTAG TCAAAGACGA GTCGCTCGCC
ATCGATTACT CCGCTCCCCG CAAGAGCCGG GAGCAGAGCT CCGATTTGTG TCCACACTGC
CCGAGAATCG TGCACGCTGG AGACCAGGCA ATCGACCTAG AGGCGATATG GTTACACTGC
GTGCGATATA GCGGCGAAGG GTGGGAGTTT GAGTGTCCGA TGCCAGACTG GGCT
 
Protein sequence
RYCAPYDFDF VCGVKARWAG RRLCDLFASE FPMRPKEYYV KAHAMGRLCA REHEDGPMLA 
GGNRVRHFIH RHEPPVMADE VRVLTVNDDV VSVWKPATVP VHPTGQYRRN TVLALLAASR
RDLGRLFPIH RLDKNVSGLL LLARSSEAAN EMRVKMEARE MRKEYVARVR GAFNDGDATP
VSNVESLGFD SKGRVAIWRG KKGVTDLDER TLKSFKDAST KFTCIKTLVD GTSLVRCEPF
TGRSHQIRAH LAMLGYPIAN DVAYGGALVE VERARAIQHA CDVTVLNEAG ELVKDESLAI
DYSAPRKSRE QSSDLCPHCP RIVHAGDQAI DLEAIWLHCV RYSGEGWEFE CPMPDWA