Gene OSTLU_33550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_33550 
Symbol 
ID5003792 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp395519 
End bp397764 
Gene Length2246 bp 
Protein Length718 aa 
Translation table 
GC content60% 
IMG OID640419213 
Productpredicted protein 
Protein accessionXP_001419797 
Protein GI145350824 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.542187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000796544 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGATGTCG CCGCCCTCGA CCGCGCGCTC GACGATGTCT CCACGTCCCT CGAAGCCTTC 
GCCGAGGTGC GTTTCGCGTC GTCTTTCGAA CCAACTCGAA CCAACGAACT TTGTCGAGAC
GACGCGCGGT GACTGACTCG CATCGACGCG CGCAGCACGG CGCGAGGCGC GCGCGCGACG
TCGCGGTGTC ACTCTGCGCC GAAGCCGCGC GCGGAGGGGT GGAGGACGCG ATCGCGGGCG
CGCAGAAGGT GGTTGACGCG CTGGAGCGAC GAAGGGCGGA GAGCGCGGCG CGCGCCGAGC
TTCGGCGAGC GAGCGCGGAG GGCGCGGCGC GCACACTGCG GGAGACGGTG AAGAAACACG
CGGTAGAACG CGAAAGTGTG TGCGAGGACA TGGCGAGGGC GTCGTTAATG ACGGAACGCG
TGTTGGAACG GTTGGAGTCG CCGGCGAACG CGGAAGCTGC GAGCGCGAGT CGACGCGCCG
ACGCGTTGGG GACGTTGGCT GATGTTTTGG AGTGCGTGTG TGCGTTTTCT AAGGGTGGCG
AAGCGGTCGC GAAGGCGTGC GATGGACTGT TTGCGGACGC GGCGCGACGA GGCGAAGCGG
GACGCTTGGC GAAAAGATCA CTCTCCGTCG TCGAACGCGC GCAGTCGTGT CGCATCGCGG
ACGAGGACGC GGGCGCGGAG CGATTAGCGC TCAAGGACGC CTCGACGGCG TTGCAAAGGT
ACTGTGAAGA CGTTGAAAAC GCACTTTTGG ACAAATTTGC CGCGGCGTGC GCGAATAAGT
CGTACGCAGC AGCGCGAGCC GACGCTGAGG CGTTATTTGA GTTCAACGGC GGACACAGCG
TCGTCTCGCG GTACGTCGCG AGCCGTGAAA TGTTCATCTC TGCGAACGCG ATGGAAGAGA
TTCAACGCCT TCGTAACGTC GTCGAGCGAT CGGTGATTCT CGGCGAGGAT GAATCTTTGT
GCACCGAGTG CGTTCGCACA TTTTTTGCGC AAATTCAAAG CGCGGTGAAA AGGGAATTTC
AATCGATTAC CGCCGCGTTC GATTCGCGCG CTATCGTCGT TCTTAACACT CTCGTGCATC
GGATCGCGGA GCAGCGAATC GGGGCGTACG TTGAGACTTT TCTCACCACC GAATTGCGCA
CCGCAGCGTC GCTTCGGCAG CGCCTGGCAC TCACGGCGCT GTCGTTGCGC GAGATTGAGA
TTTTGAATCG CGCGATACGT GAAATGACGA ACGACGACGC AGACGTTGAG GCGATCGATC
CGGACGTGTT TTTTGGCATC GGGTGTGAGA CGCTCGTGGA CGATGAGTGC GCTTGTTTGG
ACTCGGTGCC CAGCGATGAC GACGTCTTGC CGCAGCGCGA CGCGACTGAG TTCTCTCTGG
ATGATTTATG TGCTCGTTAC AAAGAAGCGC TTCTTCGAGT CGAATCGTGC GTGCCTGGGA
CTTCGATTGA AAGCGTGCGA GCACGGCTCG CCGAGGCATT CCTGGGCCGC GTCACCCGTC
TCGCGCAGAG GCATCTTCGC GAATCGATTG CGGCGACAAA GTCGGCGAGC GCGCGTCTCA
ACGCCTGGAC GACGCGTGAA GAAGCACGCG CCGATGTGTT CGAGCCCGTG TTGCTCGCTT
CCAGGAACAT GAACGCGTGG CTCGCACGTG CGCGCGATGC GATCGATACG ATGAACTCGG
AGCTCTCTCC TGGCACAGTG CGGCAAATGT TTGCCGTCGC GCAAAGCCGA CTCGCCGAGG
AAATGTCGGA GGTCTTAGAG GCCATGACGA CGCGCGGCAT GACGCTAGTG GACGCTAAAT
TCCGTGGTAC GCAGCGAGCG GTAGACTTCA ATGACGAATC ATCGTTTGCC GAGCGTGAGA
CAGAGGCGTG CGTAGCCGTG GTTGATGCTC TCGACGGTTT GGCGAAGACG ACGCTCGAAT
GCTTGGATTC GGCAAACGCG GCGACGCTCT TGAATGAAAT CGGCGCTCAA TTTTATTCCT
TGCTCTTCAG ACACGTGTGC AGATATACGT ATACCATAGT TGGCGCTATG CAACTCAAGC
TCGACGTCAA CGCGTACGTT TCATGGGTAC GAAAGACGAT GACAGCTCGC GAGAGCATCG
AGCGATTCGA AGCTCTGTCG ACTCGCCTGA ATTTGCTCGT CGTACCGGAA GATGCGCTTG
ATGATTTCGT GTATGAACTC GAAGCTCACG CTCACGAGAA TATCGCGGAA ATTAGAATGC
TACTCAGGCT TAGAAAGACA ACCTAG
 
Protein sequence
MDVAALDRAL DDVSTSLEAF AEHGARRARD VAVSLCAEAA RGGVEDAIAG AQKVVDALER 
RRAESAARAE LRRASAEGAA RTLRETVKKH AVERESVCED MARASLMTER VLERLESPAN
AEAASASRRA DALGTLADVL ECVCAFSKGG EAVAKACDGL FADAARRGEA GRLAKRSLSV
VERAQSCRIA DEDAGAERLA LKDASTALQR YCEDVENALL DKFAAACANK SYAAARADAE
ALFEFNGGHS VVSRYVASRE MFISANAMEE IQRLRNVVER SVILGEDESL CTECVRTFFA
QIQSAVKREF QSITAAFDSR AIVVLNTLVH RIAEQRIGAY VETFLTTELR TAASLRQRLA
LTALSLREIE ILNRAIREMT NDDADVEAID PDVFFGIGCE TLVDDECACL DSVPSDDDVL
PQRDATEFSL DDLCARYKEA LLRVESCVPG TSIESVRARL AEAFLGRVTR LAQRHLRESI
AATKSASARL NAWTTREEAR ADVFEPVLLA SRNMNAWLAR ARDAIDTMNS ELSPGTVRQM
FAVAQSRLAE EMSEVLEAMT TRGMTLVDAK FRGTQRAVDF NDESSFAERE TEACVAVVDA
LDGLAKTTLE CLDSANAATL LNEIGAQFYS LLFRHVCRYT YTIVGAMQLK LDVNAYVSWV
RKTMTARESI ERFEALSTRL NLLVVPEDAL DDFVYELEAH AHENIAEIRM LLRLRKTT