Gene OSTLU_88696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_88696 
Symbol 
ID5004294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp585015 
End bp586592 
Gene Length1578 bp 
Protein Length525 aa 
Translation table 
GC content59% 
IMG OID640419715 
Productpredicted protein 
Protein accessionXP_001420556 
Protein GI145352440 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACCG AAGGCGACGA ACCCGCTGTC GCCCCGGACG CGGCGGCGAG CGGTGAAAAG 
AGCGAGAAAC TTCTCAAGCG TGAAGCCGAA AAAGCCGCCA AGAAGGCGGC GAAGGACGCC
GCGAAAGCGT CCAAAGCGGC CGCCGCCGAA CAGCGCTCGC GGGGGCAAAA CGCCGTCGTG
AGCGTGCAGT GCTCGACGAC ACCGCCGGTC GAGCTCGAGG CGTGCAGCGG CACGCGCGAT
TGGTATCCCG AAGAGTTTCG TTTGCAGCGA TGGCTTTACG AAAAGTTTCG AGCCACTGCG
CGAGCGACCG GGTTCGAGGA ATACGACGCG CCCGTGCTCG AGAGGCAGGA GCTTTATAAA
CGCAAAGCCG GCGAGGAGAT TACGCAGCAA ATGTACGCGT TCGTGGACCA AGATGGGGTG
GAGGTGACGT TGCGACCCGA GATGACGCCG ACGCTCGCGC GCATGGTACT CGGTCGCGCG
CAGTCGATGA TGTTGCCTTT GAAGTGGTTT TCTATTCCGC AATGCTGGCG TTTCGAGACG
ACGCAGCGCG GTCGTAAGCG CGAGCACTAC CAGTGGAACA TGGATATCAT CGGGTGCAAG
TCTGTGAGCG CAGAAACCGA GCTGTTGTTC GCGGTGTGCG AGTTCTTCAA GTCGATCGGG
ATCACGTCCG CCGACGTCGG CATCAAGGTG AACTCGCGCA AGGTCATGGC GAGCGTGTTG
GATTCATACG GAATCACCGC GGAAAAGTTT GCGCCTGTGT GCATCGTGAT GGATAAGTTG
GACAAAATCG GCGCCGATGC CGTCAAGGCT GAGCTCGTGG ACACGCAAGG ATTACCCGCG
GAGACGGCTG CGAAAATCGT AGAGTGTTTG GCGTGCAAGA CGGTGAGCGA CCTCGAGGCG
CTCTGCGGCG AGGGTGCCGA TCAAACCGGC ATCGATGAGT TGAAGAGGCT TTTCGAGCTC
GCCGAAGATT ACGGCTACGG TGATTGGCTG ATTTTCGACG CATCCGTCGT GCGAGGTTTA
GCTTATTACA CCGGCATCGT CTTCGAGGGC TTCGACCGCG CCGGGGAGTT GCGCGCCATT
TGCGGTGGCG GTCGCTACGA TAGGTTGCTC TCTTTGTACG GTGCCGTGAC CGAGGTGCCG
GCGTGTGGTT TCGGCTTCGG CGATTGCGTC ATCGTGGAGT TGCTCAAAGA TAAAGGATTG
CTCCCTGAGC TTCCCAAGTC GATCGAGTTC GTCGTCGCCG CGTTCAACGA AGGCATGCAG
GGCAAGGCGA TGAAGGCGGC GTCGATGATT CGCGCCGGAG GTTCGGATGT GGATATGCTT
CTCGAGCCGA AGAAGAAAGT AGCGAGCACT TTTGATTACG CCAATCGTAT CGGCGCTCGA
TACATCGTCT TCGTCGCGCC GCAAGAGTGG GAAAACGACA TGGTGCGAAT CAAGGATTTG
CGCGCCGATT ACACGGACAA AGACGAAGAA AAGCAACTCG ACGTCAAACT TAGCGATCTC
GGTAGGGTGT CGGAAGTATT AGCGGCGCAC GCGGCGGCGA TCGGCGCTGC GAACAAAATG
GGCGGAATGG CCGTTTAG
 
Protein sequence
MTTEGDEPAV APDAAASGEK SEKLLKREAE KAAKKAAKDA AKASKAAAAE QRSRGQNAVV 
SVQCSTTPPV ELEACSGTRD WYPEEFRLQR WLYEKFRATA RATGFEEYDA PVLERQELYK
RKAGEEITQQ MYAFVDQDGV EVTLRPEMTP TLARMVLGRA QSMMLPLKWF SIPQCWRFET
TQRGRKREHY QWNMDIIGCK SVSAETELLF AVCEFFKSIG ITSADVGIKV NSRKVMASVL
DSYGITAEKF APVCIVMDKL DKIGADAVKA ELVDTQGLPA ETAAKIVECL ACKTVSDLEA
LCGEGADQTG IDELKRLFEL AEDYGYGDWL IFDASVVRGL AYYTGIVFEG FDRAGELRAI
CGGGRYDRLL SLYGAVTEVP ACGFGFGDCV IVELLKDKGL LPELPKSIEF VVAAFNEGMQ
GKAMKAASMI RAGGSDVDML LEPKKKVAST FDYANRIGAR YIVFVAPQEW ENDMVRIKDL
RADYTDKDEE KQLDVKLSDL GRVSEVLAAH AAAIGAANKM GGMAV