Gene OSTLU_16744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_16744 
Symbol 
ID5003688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009363 
Strand
Start bp249959 
End bp251419 
Gene Length1461 bp 
Protein Length486 aa 
Translation table 
GC content63% 
IMG OID640419109 
Productpredicted protein 
Protein accessionXP_001419756 
Protein GI145350738 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.544898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.122901 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGACG ACGACCGCAC GTACGACGCG GCGGTGCGCG AGCTCGCGAA GACGATCACG 
GGTAAAAAAC GCGCGGACCC GGGCGCGGTG ACGTGGGAGG CGCAGTTCGC GCAGCTGGAG
ACGTACGTCT CGAGGCTGGA CCTGCGAGGG CGCGTCGACG CGCTCGACGT GGTGCACGTG
GCGGGGACGA AGGGGAAGGG ATCGACGTGC GCGATGGTGG AGGGGATGCT CAGGGCGAGC
GGGACGCGGG TGGGGACGTT CACGAGCCCG CACTTGATGG ACGTGCGAGA ACGATTTCGG
ATCGACGGGG CGATGGTGGA CGAGGAGACG TTCGCGAGGG AGTTTTGGTG GACGCGGGAC
GCGATCGCGG CGCGGTGCGG AGATCTGGGG ATGCCGGCGT ACTTTAGGTT TTTGACGCTG
TTGGGGCTGC GAATTTTCTC GAGCGCGGGG GTGGAGGCGT GCGTGCTGGA GGTGGGGTTG
GGGGGGCGCC TGGACGCGAC GAACGTGGTG CGCGCGCCGG CGGCGTGCGG GATCACGTCG
CTCGGGATGG ATCACGTGGA TATTTTAGGG GATACTTTGG GGAAAATCGC GACGGAGAAG
GCGGGGATCA TGAAGCCGGG CGTGCGAACG TTTACGGCGC CGCAAAAACC CGAGGCGATG
GTCGCGCTCG AGCGTCGCGC CGCCGAGGTT GGTTCGCCGT TAGTCGTCGC GAGAGATTTA
GACAGCTATG AGGGCGGGAG TGAGATCGAA GTCGGGTTGG CGGGACCGCA CCAACGCGTC
AACGCCGCCG TCGCCGTGGA GTTGGTGCGG GAGTGGGCGA ACGCCACGAA CCAGCCGTGG
GCGGAGGAGA TGGAGGCTTC GTTCGCGCGA AACGAGTTAC CGGAGAGCTT TCGCGTCGGG
TTAGAGAAGA CGACGTGGCC GGGTCGATCG CAGGTGATGC ACGATCCCGA TGTGATGAAC
TTGACGTTTT ATCTCGACGG CGCGCACACC ATCGAATCCA TGCGCCACAG CGCCGAATGG
TTTACGAGCA CCGCGCGAGC GGTGACGGCG GAGCACAACA TCATGCTTTT CAACTGCATG
GATGATCGCA AACCGGAGGA TTTACTCGAA CCGGTCGCGG ATGTTTTCCA CACGACTTCA
AACGTACAGT TAGAGCGCGC TATTTTTAGC CCGCCGGATA GCACCACGAG CGGGTTAGAC
AAGTGCGGGG ACGCAAAGGC GACCTCTTGG CAAGACAGGT GTGCGCGTAC GTGGGACGAT
ATCATCGTCA AACGTGCGAA CGTGCTCGCA GAAAAAGCTC AAGACACTCG AGGTGTCGTG
GTTTCGAGCA TTCATCAAGC GCTCAGTCTC ATTCGACATC GAGCGCGAGA GGTGGCGCCG
GCGCGCGTCA ACGTCCTCGT CACTGGCAGT CTGTATTTGG TCGGCGACGT CCTGAGGCAT
CTAAAGAAGT GCGTGAAATA G
 
Protein sequence
MRDDDRTYDA AVRELAKTIT GKKRADPGAV TWEAQFAQLE TYVSRLDLRG RVDALDVVHV 
AGTKGKGSTC AMVEGMLRAS GTRVGTFTSP HLMDVRERFR IDGAMVDEET FAREFWWTRD
AIAARCGDLG MPAYFRFLTL LGLRIFSSAG VEACVLEVGL GGRLDATNVV RAPAACGITS
LGMDHVDILG DTLGKIATEK AGIMKPGVRT FTAPQKPEAM VALERRAAEV GSPLVVARDL
DSYEGGSEIE VGLAGPHQRV NAAVAVELVR EWANATNQPW AEEMEASFAR NELPESFRVG
LEKTTWPGRS QVMHDPDVMN LTFYLDGAHT IESMRHSAEW FTSTARAVTA EHNIMLFNCM
DDRKPEDLLE PVADVFHTTS NVQLERAIFS PPDSTTSGLD KCGDAKATSW QDRCARTWDD
IIVKRANVLA EKAQDTRGVV VSSIHQALSL IRHRAREVAP ARVNVLVTGS LYLVGDVLRH
LKKCVK