Gene OSTLU_29417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_29417 
Symbol 
ID5006740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009374 
Strand
Start bp199254 
End bp200790 
Gene Length1537 bp 
Protein Length405 aa 
Translation table 
GC content64% 
IMG OID640422161 
Productpredicted protein 
Protein accessionXP_001422517 
Protein GI145356603 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.024328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00343587 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
CGCGACGCGC GCGCGCGTCG TCGACGATGC GCGCGCGACT CGTCGCGTCG AGCGCGCGGT 
TGTGCGCGCG ACGCGCGGTG AGGGCGCGAC GGGAGTGCGC GCGGGCGTGC GCGACGACGC
CGGGGGCGAG AGTCGCGCCG CGACGGGCGT GGGGGACGCG AACGCGCGCG ACGTCGTCGC
GAGGGGGCGA CGGCGGACGG ACGACGACGA CGGTGGATCC GACGCGCGCG AGCGAGCGCG
GGGCGAAGCG GGCGACGATC GATCTGCAGC CGCCGAAGGG GACGCGAGAT TTCCCGCCGG
AGGAGATGCG ACAGCGGTCG TGGCTGTTTG GACACTTTCG AGAGTGCGCG AAGGTTTTCG
GGTTCGACGA GTTCGACGCG CCGGTGCTGG AGAGCGAGGA ACTGTTCACG AGGAAGGCTG
GGGAAGAGAT CACGACGCAG TTGTATAACT TTTCGGATAA GGGCGATCGC AGGGTGGCGC
TGAGGCCGGA GTTGACGCCG TCGTTCGCGC GGTTGATTTT GCAGCAAGGC AAGTCGTTGG
CGTTGCCGGC GAAGTGGTTC GCGATCGGGC AGTGCTGGAG ATACGAGCGC ATGACGCGAG
GAAGACGTCG GGAGCATTAT CAGTGGAATA TGGACATCGT CGGCGTGAGC GGGGTGGAGG
CGGAGGCGGA GTTGTTGGCG GCCATTACGA CGTTTTTCAA GAGGGTGGGG GTGACGAGCG
CCGACGTAGG CATCAAGGTG AGCTCGCGAA AGCTGTTGCA GGAGGTGTTG ACGCGGTTCG
GGATCGACAG CGAATCTTTC GCGCCCGTGT GTGTGGTGGT GGATAAGATT GAAAAGCTCC
CGCGCGAAAA GATTGAGGAA GAGCTCAGAG AGCTCGGCGT GAGCGACGAG GCGGTGGAGG
GCATCTTGGC GGCGACGTCG ATGCGCACGG TAGAAGAGCT CGAGGCCCTC ATCGGCCCGG
ACGCGGAGGC GGTGAAGGAC TTAAAGAAGC TTTTTGAGTA CGCCGATGCG TACGGCTACC
GAGATTGGCT CGTGTTCGAC GCGTGCGTCG TTCGCGGTTT GGCGTACTAC ACGGGCATCG
TCTTCGAAGG TTTCGATCGC GCGGGCGAAC TTCGCGCCAT CTGCGGTGGC GGGCGATACG
ACATGCTTCT TGGGGCGTTA GGCGGCGAGA ATCAACCCAT GGTCGGGTTC GGGTTCGGCG
ACGCCGTCAT CGTGGAGCTG CTCAAGGATA AGGGTTTGAT GCCCGACTTT TCCAAGGGCG
ACGTCCAAGA CTTGGTGTTC CCGCTCGGCG AGTCGCTGCG CCCGGCGGCG ATGCGCGTCG
CCGCCCAGCT TCGCGACGCC GGTCGCACCG TTGATCTCAT CCTCGAAGAC AAGAAAGCGA
AATGGGCGTT CAAGCAAGCC GAACGCGTCG GCGCCCAACG CGTCATTTTG CTCGGCGAAA
AGGAATGGGA AGCGGGGAAC GTTCGCGTCA AAGACTTAGC CAGCCGCGAA GAGGTCGACG
TCAAATTGGA AGATCTCAAA TAATTAATAA ATGATGC
 
Protein sequence
MRQRSWLFGH FRECAKVFGF DEFDAPVLES EELFTRKAGE EITTQLYNFS DKGDRRVALR 
PELTPSFARL ILQQGKSLAL PAKWFAIGQC WRYERMTRGR RREHYQWNMD IVGVSGVEAE
AELLAAITTF FKRVGVTSAD VGIKVSSRKL LQEVLTRFGI DSESFAPVCV VVDKIEKLPR
EKIEEELREL GVSDEAVEGI LAATSMRTVE ELEALIGPDA EAVKDLKKLF EYADAYGYRD
WLVFDACVVR GLAYYTGIVF EGFDRAGELR AICGGGRYDM LLGALGGENQ PMVGFGFGDA
VIVELLKDKG LMPDFSKGDV QDLVFPLGES LRPAAMRVAA QLRDAGRTVD LILEDKKAKW
AFKQAERVGA QRVILLGEKE WEAGNVRVKD LASREEVDVK LEDLK