Gene OSTLU_42734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42734 
Symbol 
ID5003081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp295141 
End bp296454 
Gene Length1314 bp 
Protein Length437 aa 
Translation table 
GC content66% 
IMG OID640418502 
Productpredicted protein 
Protein accessionXP_001419335 
Protein GI145349841 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0343] Queuine/archaeosine tRNA-ribosyltransferase 
TIGRFAM ID[TIGR00430] tRNA-guanine transglycosylase, queuosine-34-forming
[TIGR00449] tRNA-guanine transglycosylases, various specificities 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0479124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTCG CGCGCGTCGC GCGCGCGCGC GGCGTCCTCG ACCGCGTCGC GCGCGCGTCG 
CGATGCGCGC GCGCGCGCGA CGCGTCGGCG TCGGCGCGTT CGAGCGCGTC GACCGACGCG
CGTCTCGAGC GGGCGCCGGC GGCGTCGCCG GCGTCGCCGC GCTTCCGGTT CGAGGTCCTG
CGCGCGTCGA CGCGCTCGAA CGCGCGGGCG GGGATGATTC ACACGCCGCG CGGGACGATC
GCGACGCCGG GGTACGTCGC GGTCGGGACG AACGCGGCGA TGAAGGCGGT GCGGGGCGAC
GCGCTGCGGC GCGCCGGGAT CGATTTGATG TTCGCGAACA CGTATCACCT GATGCTGCAG
CCGGGCGCGG CGACGGTGGA GGCGATGGGA GGGATACACG CGTTCATGGG ACGCCGAGGG
CCGGTGATCA CGGACAGCGG TGGGTTTCAG GTGTTTTCGC TCGGGACGCC GGACCGCTCG
GCGAACGCGA AGGGGAAGAC GAAGGAGTTG AAGAGCAGAA AGGCGACGAA GCATCGACGG
GAGAATTTAT TGATCGACGT GAGCGAAGAG GGGGCGACGT TTCGGTCGTA CAGGGACGGG
ACGAAGATGA CGCTCACGCC GGAGTCGAGC GTGTTGAGTC AGAAACAAAT CGGGGCGGAC
ATCATCATTC CGCTCGACGA GTTGCCGCCG TACGACATCG ATCGCGAGAC GTTGGAGGAG
AGCGTGCATC GGTCGCATCG ATGGATGACG AGGAGTTTGG AGACGCACTT GAAGGACGTG
CGTCAACAGG CGATGTACGG CGTCGTGCAC GGAGGCGTGG ATCGCGAGCT ACGACGGATG
TCGGTCGAGT ACCTGAGCGC CCTGCCGTTC GACGGGCTCG CGATTGGCGG TTCTCTCGGC
CGCGACGCCG CCGAACTCGG CGCTTTGCTC GAGTTTTTGA TGCCTCTCCT TCCGAAACAC
TTGCCGAATC ACCTCCTGGG GATCGCTGAC ATGGAAAACA TCGAGCACGC GGTGGCGAAC
GGCGTCGACA CGTTCGACAG TTGCTACCCG ACTCAAGTGG CGAGACACGG CACGCTATTC
ACGCGCTCTC GCGGACGCAT AAACTTCCGT CGCGCCGAAT TCCGAACCAG CACCGAACCC
GCGTGCGAAG AGTGCGAGTG CACGCTCTGC ACCAAGCACA CCTTGGGCTA TCTTCATCAC
CTCGATCGCG CCAACGAACC GCTCGCTTGG TCCTTGGCGA GCGAGCACAA CCTGTATCAC
ATGGGCGATA AGATGCGTCG CGTTCGCGAG GGCATCCTGA ACGGCGAGAT TTGA
 
Protein sequence
MRVARVARAR GVLDRVARAS RCARARDASA SARSSASTDA RLERAPAASP ASPRFRFEVL 
RASTRSNARA GMIHTPRGTI ATPGYVAVGT NAAMKAVRGD ALRRAGIDLM FANTYHLMLQ
PGAATVEAMG GIHAFMGRRG PVITDSGGFQ VFSLGTPDRS ANAKGKTKEL KSRKATKHRR
ENLLIDVSEE GATFRSYRDG TKMTLTPESS VLSQKQIGAD IIIPLDELPP YDIDRETLEE
SVHRSHRWMT RSLETHLKDV RQQAMYGVVH GGVDRELRRM SVEYLSALPF DGLAIGGSLG
RDAAELGALL EFLMPLLPKH LPNHLLGIAD MENIEHAVAN GVDTFDSCYP TQVARHGTLF
TRSRGRINFR RAEFRTSTEP ACEECECTLC TKHTLGYLHH LDRANEPLAW SLASEHNLYH
MGDKMRRVRE GILNGEI