Gene OSTLU_46103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_46103 
Symbol 
ID5002751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp46726 
End bp48713 
Gene Length1988 bp 
Protein Length564 aa 
Translation table 
GC content55% 
IMG OID640418172 
Productpredicted protein 
Protein accessionXP_001418593 
Protein GI145348306 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.036188 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ACGGGGGCGG GCGCGAGCGC GGCGGCGAGC GATAAGGCGA ACGCGACGGC GAGCGACGCG 
GAGCCGGTGA AACTGCTGAC GAGCGATGAG AGCGAGAATT TGTTGAAAAT ACGACACACG
GTGCGTTGGA AATGCGAGGC GTGGGGAGAC TTGCGTCGGT GGTGGATCGG ACGACGACGC
GCGAGGGGCG AGCGGAGGGG GTTTGTGCGC GACCTGAACC ACGCAGGGAC GAAAGGAGAC
TGACGATGGG TTCGCTGATT TTACGCTTCG CAGAGCGCGC ACGTGATGGC GATGGCGGTG
CAAAAGGTGT TTCCGAGCGC GCAGTGCACG ATCGGGCCTT GGATCGACCG GGGATTTTAT
TACGACTTTT ACTACCCAGA GGGTTTCACC GATCAAGACA TGAAGAAGAT TCAAAAGGAA
ATGTATAAGA TTATTCGCAA GGATTACCCG CTTCGCAGGG AGGAAGTGTC GCGAGAAGAA
GCCGAACGCC GAATCCGGGA GATTAACGAG CCGTACAAGC TCGAAATCTT AGAAGCCATC
AAGACGGAGC CGATTACGAT TTATCACATC GGCGACGAGT GGTGGGATTT GTGCGCGGGA
CCTCACGTGG AGTCCACGGG CAAGCTTGAT CAGAAAGCGT TCGCGCTCGA AAGTTTGGCT
GGGGCGTACT GGCGTGGTGA CGAAACGAAG CCGATGTTGC AGCGCATTTA CGGCACGGCG
TGGGAGAATG AAGCACAGCT TCAGGCGTAC AACGATTTCA AGGCGGAGGC GAAGCGTCGC
GATCACCGAA CGATCGGTAA GGATTTGGGT TTGTTCTCCC TTCAACAAGA TAACGCTGGC
GGCGGCTTGG TGTTTTGGCA TCCGAAAGGC GCGCACATGC GACACATGAT CGAGACGTAC
TGGAAGGATC TCCATCTGGC GCGCGGGTAC GAGCTCTTGT ACTCACCGCA CGTCGCCAGG
CAAGAGTTGT GGAAAACCTC TGGTCACAGC GATTTTTATT CCGAGAACAT GTACCAGCCC
ATCAAGGTTG AAGATGAAAT GTATCAGCTC AAACCGATGA ACTGCCCGTT CCACATCGTC
GTTTACCAAG ACGGATACTA CTCGTACAAG GATTTACCCA TTCGTTGGGC TGAGCTTGGC
ACGGTGTATA GATATGAACG TAGCGGCACC ATGCACGGCT TGTTCCGAGT GAGAGGTTTC
ACGCAGGATG ACGCGCATAT ATTCTGCCTC CCGGATCAAA TCACAGACGA GATCAAGAGC
GTTCTCGATT TGACTGAAGA AATTCTGAGC ACGTTTGGTT TTAAGGAGTT TGAAGTCAAC
CTATCCACCA GGCCGGAAAA GTCTGTCGGC GACGACAAGA TTTGGGACAC CGCGGAAGGT
GCGCTTAAGG ACGCATTGCA AATGAAGGGA TGGGATTACA TCGTCGACGA CGGCGGTGGA
GCGTTCTACG GGCCGAAGAT TGACATTAAG ATTTTGGACG CAATCGGGCG TAAGTGGCAA
TGCTCCACGG TGCAGCTCGA CTTCAACCTG CCGGAGCGAT TCGACCTATC CTACGTCGAT
CGCGAGAACG CAAAGCAGCG ACCAATCATG ATTCACCGCG CCATTTTCGG TTCCCTTGAG
AGATTCTTCG GTATTCTCAC CGAGAACTAC GCCGGGGAGT TCCCGTTGTG GCTCGCCCCG
ATTCAAGTGC GCTTGCTTCC TGTGACGGAC GAAGTCAGCG ATTATACCGA AGGCGTCGCG
AAAAAGCTCC GCGATGCGGG CGTGCGCGTT GAAATTTGCA CCGGACAACG TCTCGCTAAG
CTCGTGCGCA CGGCTGAGAA GGCAAAGATC CCGGTCATGG CGGTCGTCGG TAGAGAAGAA
GCGGAGAACA ACACGTTGGC TGTGCGTACG TTCAAGGATG GCGACGTCGG TACATTGTCT
GTCGACGAAG TGTTGTCACG CGTCACCACC GCGAACGCGA CAAAGGGTCA AAGCTTCTAG
GAAGAGTC
 
Protein sequence
MAMAVQKVFP SAQCTIGPWI DRGFYYDFYY PEGFTDQDMK KIQKEMYKII RKDYPLRREE 
VSREEAERRI REINEPYKLE ILEAIKTEPI TIYHIGDEWW DLCAGPHVES TGKLDQKAFA
LESLAGAYWR GDETKPMLQR IYGTAWENEA QLQAYNDFKA EAKRRDHRTI GKDLGLFSLQ
QDNAGGGLVF WHPKGAHMRH MIETYWKDLH LARGYELLYS PHVARQELWK TSGHSDFYSE
NMYQPIKVED EMYQLKPMNC PFHIVVYQDG YYSYKDLPIR WAELGTVYRY ERSGTMHGLF
RVRGFTQDDA HIFCLPDQIT DEIKSVLDLT EEILSTFGFK EFEVNLSTRP EKSVGDDKIW
DTAEGALKDA LQMKGWDYIV DDGGGAFYGP KIDIKILDAI GRKWQCSTVQ LDFNLPERFD
LSYVDRENAK QRPIMIHRAI FGSLERFFGI LTENYAGEFP LWLAPIQVRL LPVTDEVSDY
TEGVAKKLRD AGVRVEICTG QRLAKLVRTA EKAKIPVMAV VGREEAENNT LAVRTFKDGD
VGTLSVDEVL SRVTTANATK GQSF