Gene OSTLU_31740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31740 
Symbol 
ID5001867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp455813 
End bp457475 
Gene Length1663 bp 
Protein Length468 aa 
Translation table 
GC content63% 
IMG OID640417288 
Productpredicted protein 
Protein accessionXP_001417777 
Protein GI145346606 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.414163 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CGCGACGCGC GACGCGGACC GCACTCGCGA CGCTCGGCGC ACCGAAGTCC GATGATGACG 
CACGCGATGC CGACGCGCGC GGCGATGCCG ACGCGCGCGG CGACGACGCG ACGCGAAAGA
TGCGATGACG GCGTCGCGAT GACGACGACG ACGACGACGC GCCGACGCGC GGTGACGCGA
CGCGCGGCGA CGCGAACGAC GACGCGGACG CGAACGGTGA TGCGCGCGGT GATACCGCGA
CCGGGCGCGA GCGTGGCGGA GGGACTGGAG ATACCGTCGT ACGAACACGT GCGAGAGGCG
TACGAAAAGG TGAGCGGAGG GGTGGTGCGC TCGCGGTGCA CGCACTCGAA GACGCTGAGC
GCGCTGACGG GGATGGAGAT TTACTTGAAA CACGAGTGGG AACAAGCGAC GGGGTCGTTT
AAGGAGCGAG GGGCGCGGAA CGCGCTGATG GCGCTCGACG CGGAGCAAAG AAAGCGAGGC
GTCATCGCGG CGAGCGCGGG CAACCACGCG CTGGCGCTCG CGTATCACGG AAGAGAACTC
GGGATTCCGG TGACGGTGAT CATGCCGTCG ATCGCGCCGC TGACGAAGAT TACGAAGTGC
CAAAAGTTGG ACGCGCGCGT GATTTTGGAA GGGGACACGA TCGCGGACGC GGCGGCGTAT
GCGAAAGAGA ACTACGTCGT GAAGGGCGAA ATGCTCAAGT ACATCAACGG TTTCGACGAC
TTTGAAATCA TCTCCGGCGC CGGTTCGGTC GGTATCGAGA TGCTCGAAGA CGTTCGCGAC
GCGGACGCCG TGGTGATTCC CGTCGGCGGT GGTGGCTTGA TTGCCGGTAT CGCGCTCGCC
GTGAAGCACC TCAAGCCGGG GTGCCAAGTC ATCGGCGTCG AGCCCGAGCG CTGCGCCTCC
ATGACGCTCG CGCTCGAAGC GGGTGAACCC GTCAAGGCGC CGACGACGGC GACGCTCGCT
GATGGTCTCG CCGTTCCTAC CGTGGGTCCG CGCTCGTTCC AAACCGTTAA AAACTTGATT
GACGATATCG TACTCGTCAG CGAGTCAGAC ATCGCCGTCG CCATGTTGAG ACTTCTCGAA
AACGAAAAGC TCGTGCAAGA GGGCGCTGGG ATTTCCGGTT TGGCCGCACT CTTGACCGAT
AAGCTCCCCG GTCTCAAGGG CAAGAAAGTA GTGGTCGCCA TGTGCGGTGG TAACATCGAT
ACGAGCACGC TCGGTAAAGT GCTCGAGCGC GGCTTGGTCT CGGACGGGCG ACTCGTGCGC
TTCGCGTGCG TCGTTCCCGA CAGACCGGGT GGTATCGCTG GCGTGTGCAA CAAGATAGCC
GACGTGGGTG CTTCGATCAA GCACATCGTT CACGAGCGCG CTTGGTTAGA CCAAGACTCT
CACTGCGTCC TCGTCGATGT CGAATGCGAA GTGACCGACA GTAGCATGGG CGAGGAGCTC
TACGACGCCA TCAGCTCGGC GTACGTCCTG CGTACCGCTA ACTTTGGACC GGCGAACCGC
CCGAAGGTCC CCGCGCGAAC GCCCGCACGT CGAGACGACG TCGACGCAAA TGATCTGAAG
AATCCTGTGA CGTCGTGCGA TTTGTTCGAC GAAGACTGCA CGTCCATTTA CCCCGAAGAC
GAGTAACTCC GGTTACGCCG TCGCAGCATC GCATGCATCG CGC
 
Protein sequence
MRAVIPRPGA SVAEGLEIPS YEHVREAYEK VSGGVVRSRC THSKTLSALT GMEIYLKHEW 
EQATGSFKER GARNALMALD AEQRKRGVIA ASAGNHALAL AYHGRELGIP VTVIMPSIAP
LTKITKCQKL DARVILEGDT IADAAAYAKE NYVVKGEMLK YINGFDDFEI ISGAGSVGIE
MLEDVRDADA VVIPVGGGGL IAGIALAVKH LKPGCQVIGV EPERCASMTL ALEAGEPVKA
PTTATLADGL AVPTVGPRSF QTVKNLIDDI VLVSESDIAV AMLRLLENEK LVQEGAGISG
LAALLTDKLP GLKGKKVVVA MCGGNIDTST LGKVLERGLV SDGRLVRFAC VVPDRPGGIA
GVCNKIADVG ASIKHIVHER AWLDQDSHCV LVDVECEVTD SSMGEELYDA ISSAYVLRTA
NFGPANRPKV PARTPARRDD VDANDLKNPV TSCDLFDEDC TSIYPEDE