Gene OSTLU_37018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37018 
Symbol 
ID5001624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp263686 
End bp265179 
Gene Length1494 bp 
Protein Length497 aa 
Translation table 
GC content57% 
IMG OID640417045 
Productpredicted protein 
Protein accessionXP_001417461 
Protein GI145345949 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.00212861 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTCGC AACGAAATGC GGTGACGCAG ACGAAGGTTG TGCAGCGCAC ACGCGCTGCG 
GAAGGGCGCG CCGGGCGCTC TGCGCGCGCT GGGGTGCGCG CGAACGCGGA CGCCGACCTG
GGGAAAGCCG CGGCGCGGGA AAACAAACCT TTGGAAAAGG CCAACTTGTT CAGCGCGAAG
TTTACGCCGT TCGCGGGCGA TGCGAGTGAA GAGTATTCCT TGGATGAGGT CATTTACAGA
AGTAAATCTG GTGGCTTGCT CGACGTGACG CACGACATGG AGGCGCTCGC GATCTATCCT
CCGGAATATT GGAAGGCGTT GTTTGACGAG CGCGTTGGCA AGACTACGTG GCCGTACGGT
TCTGGGGTGT GGAGTAAGAA GGAGTGGGTT CTCCCAGGGA TCGCCGATGA AGATATCGTG
TCCATGTTCG AAGGTAACTC CAACTTGTTC TGGGCCGAAC GGTACGGACG AGAGTACCTG
GGCATGAGCG ACCTCTGGGT GAAGCAGTGC GGTAACTCCC ACACTGGGTC TTTCAAGGAT
CTTGGTATGA CCGCGTTAGT GTCCCAAGTC AACCGCATGC GTAAGATGGG TAAGCCTCTT
TCCGCCGTCG GCTGTGCGTC CACTGGTGAT ACTTCCGCCG CGTTGAGCGC GTACGCGGCC
GCTGCGGGTA TCCCATCCAT AGTCTTCCTT CCGGCAGACA AGATCTCTGT TGCACAACTT
GTCCAGCCGA TCGCCAACGG CGCGCTTGTG CTTTCTATTG ACACCGATTT TGACGGATGC
ATGCGTCTCA TTCGCGAAGT CACCGCGGAA CTTCCGATCT ACTTGGCGAA CTCTCTGAAC
TCTTTGCGCC TCGAAGGTCA AAAGACGGCG GCGATTGAAA TCTGCCAACA GTTCAACTGG
GAAGTTCCTG ATTACGTCAT CATCCCGGGT GGCAACCTCG GCAATGTCTA CGCTTTCTTC
AAGGGGTTCA AGATGTGCAA GGACCTTGGT CTCGTCGACA GACTGCCGCG CATGGTCGTT
GCGCAAGCGG CGAACGCCAA CCCGTTGTAC CGCGCCTACA AGAAGGGTTG GGATAAGTTC
GAAGCCGTCA AGGCTGAGCC GACTTTCGCG TCCGCGATCC AAATCGGTGA TCCTGTTTCC
ATCGATCGCG CGATTTATGC GCTCACGGAA ACGAACGGTA TCGTTGAGGA GGCGACGGAA
GAGGAAATGA TGGATGCTGC CGCGGAAGCC GATTTGACGG GTATGTTTAA CTGCCCGCAC
ACTGGCGTCG CGCTCGCTGC GCTCAAAAAG CTTCGTGAGC AACAAGTGAT CGCGCCGAGT
GATCGCACTG TTGTCATCAG TACCGCTCAT GGCTTGAAGT TTACTCACAG CAAGGTTGCG
TACCATGAGA AGAGGCTCGA GGGGCTCGAA TCGAGATACG CCAACCCGCC GGTCGTCGTC
AAAGACGACT TCAGTGCTGT CATGGATGTT CTCAGTACTC GCTTGAACAA GTAA
 
Protein sequence
MLSQRNAVTQ TKVVQRTRAA EGRAGRSARA GVRANADADL GKAAARENKP LEKANLFSAK 
FTPFAGDASE EYSLDEVIYR SKSGGLLDVT HDMEALAIYP PEYWKALFDE RVGKTTWPYG
SGVWSKKEWV LPGIADEDIV SMFEGNSNLF WAERYGREYL GMSDLWVKQC GNSHTGSFKD
LGMTALVSQV NRMRKMGKPL SAVGCASTGD TSAALSAYAA AAGIPSIVFL PADKISVAQL
VQPIANGALV LSIDTDFDGC MRLIREVTAE LPIYLANSLN SLRLEGQKTA AIEICQQFNW
EVPDYVIIPG GNLGNVYAFF KGFKMCKDLG LVDRLPRMVV AQAANANPLY RAYKKGWDKF
EAVKAEPTFA SAIQIGDPVS IDRAIYALTE TNGIVEEATE EEMMDAAAEA DLTGMFNCPH
TGVALAALKK LREQQVIAPS DRTVVISTAH GLKFTHSKVA YHEKRLEGLE SRYANPPVVV
KDDFSAVMDV LSTRLNK