Gene OSTLU_28303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_28303 
Symbol 
ID5006178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp183372 
End bp185412 
Gene Length2041 bp 
Protein Length662 aa 
Translation table 
GC content61% 
IMG OID640421599 
Productpredicted protein 
Protein accessionXP_001422225 
Protein GI145355988 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.00317513 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000285759 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACGATCG AGCGCGCGAG CGAGGCGAGG TACGATTTCG GGGCGAGGCG GTTGGAGGAG 
GTGACGACGC CGACGCCGAG CGACATCGAA ATCGCGAGCG CGGCGAAGGC GCTGCCGATC
AGGGAGATCG CGGAGAAGAT GTGCGGATTG ACGGAGGATG ATTACGAGAT GTACGGACGG
TATAAGGCGA AACTGTCGGA AAAGCTGAGC TCGACGCTGG GAGGGCGCGA GGGCGAGGGC
GACGGGCCGA CGTGCGACGC GACGGCGGGG CACGGGTATT ACGTCGTGAC GGCGGGGATC
ACGCCGACGC CGTTGGGAGA GGGGAAGTCC ACGACGACGA TCGGGTTGGT GCAGGCGATG
CAGGCGCACA CGAACGCGCG GGCGGTGGCG TGCATTCGAC AACCGTCCAT GGGACCGACG
TTCGGCATCA AGGGGGGGGC CGCGGGCGGG GGGTACTCGC AGGTGATTCC TATGGAGGAG
ATGAACTTGC ACTTGACCGG GGACATTCAC GCCATCGGCG TCGCGAATAA CTTGCTCGCC
GCGGCGGTGG ACGCGCGCGT GTTCCACGAA TCCACGCAAA GCGATGAGGC GCTGTGGAAG
CGCTTGATGC CGCAAGCCAA GGATGGTTCT CGCAAGTTCT CAGCCATCAT GTTCAAGCGC
TTGAAGAAGC TTGGAATCGA CAAAACCAAT CCCGATGACT TGACGGAAGA AGAGCGAAAC
AAATTCGTGC GTTTGGACAT CGACCCCGAG CGCATCACGT GGAGACGCGT CGTCGACATG
AACGACCGCT TTTTGCGCGA AATCACCGTC GGCGAATCGC CGACGGAGAA GGGCAAGACT
CGCAAGACTG GTTTCGACAT CACCGTCGCC TCGGAAATCA TGGCGGTTTT GGCGATGACG
ACTTCGCTCG CGGACATGGA AAACCGTCTC GGAAACATGG TCGTCGGCCC CGCGCGCGAC
GGCACCCCGG TGACGTGCGA CGATTTGGGC GTCACCGGCG CGTTGATGGC TCTCATGCGC
GACGCCATCA AGCCGACGTT GATGCAGACT CTCGAAGCGA CGCCCGTGCT CGTGCACGCC
GGCCCATTCG CTAACATCGC GAGCGGTAAC TCGTCCATCA TTGCCGATCA AATCGGTCTC
TCCATGGTCG GTAAGGGTGG CTTCGTCGTC ACCGAAGCCG GCTTCGGCGC CGACATCGGC
CTCGAAAAGT TTGTCAACTT GAAGTGCCGC AAGTCTGGTC TGAAGCCGAA CTGCGCCGTC
ATCGTCGCCA CCGTGCGCGC GTTGAAGTGC CACGGCGGTG GTCCGCCCGT GACCGCGGGT
AAGCCCCTCG ATCACTCGTA CACGACTGAA AACGTCGACA TGGTGCGCGA GGGCATGTGC
AACTTGGTCC GCCACATCGA AAACACCAAG TCTTTCGGTA TCCCGGTCGT CGTCGCCATC
AACGCGTTCC CGACCGACAC CGAAGCGGAA CACGCGGTGA TTCGTGAAAT CTCTCTCGCC
GCGGGCGCGG AAGACGCCGT TCTCTGCACC CATCACGCGC ACGGGGGCAA GGGCGCCGTC
GCGCTCGCCA ACGCCGTCAA GGCGGCGTGC GAGGCGAACG AAGACTGCAA GGACTTCAAG
TTTGGATACG AATCTTCTCT CGACATCAAG AGTAAGATTG AAACCGTGGC GAAGAACATC
TACAAGGCTG ACGGTGTTGA GTTCTCTGAA CGAGCCGAAG AAAAGATCAA GCTCTTCACC
GAGCAAGGCT TCGGCGATTT GCCGATTTGC ATGGCCAAGA CGCAGTACTC CTTCTCTCAC
GACCAATCGT TGAAGGGTGC GCCGAGCGGG TTCACGTTGC CCATCGGCGA CGTCCGCTTG
AGCGCGGGTG CTGGTTTCCT CGTCCCGCTC GTCGGCGCGT TCCCGACGAT TCCGGGCTTG
CCCACGCGAC CGGCGTACTA CGAAATCTCT GTGGACACCG AGAACGACCA AATTCTCGGC
TTGAGTTAGA CGGCGATGTG TGCGATAAAA TAACATTAAT TTCTTGATTC ATATTCCGAA
T
 
Protein sequence
MTIERASEAR YDFGARRLEE VTTPTPSDIE IASAAKALPI REIAEKMCGL TEDDYEMYGR 
YKAKLSEKLS STLGGREGEG DGPTCDATAG HGYYVVTAGI TPTPLGEGKS TTTIGLVQAM
QAHTNARAVA CIRQPSMGPT FGIKGGAAGG GYSQVIPMEE MNLHLTGDIH AIGVANNLLA
AAVDARVFHE STQSDEALWK RLMPQAKDGS RKFSAIMFKR LKKLGIDKTN PDDLTEEERN
KFVRLDIDPE RITWRRVVDM NDRFLREITV GESPTEKGKT RKTGFDITVA SEIMAVLAMT
TSLADMENRL GNMVVGPARD GTPVTCDDLG VTGALMALMR DAIKPTLMQT LEATPVLVHA
GPFANIASGN SSIIADQIGL SMVGKGGFVV TEAGFGADIG LEKFVNLKCR KSGLKPNCAV
IVATVRALKC HGGGPPVTAG KPLDHSYTTE NVDMVREGMC NLVRHIENTK SFGIPVVVAI
NAFPTDTEAE HAVIREISLA AGAEDAVLCT HHAHGGKGAV ALANAVKAAC EANEDCKDFK
FGYESSLDIK SKIETVAKNI YKADGVEFSE RAEEKIKLFT EQGFGDLPIC MAKTQYSFSH
DQSLKGAPSG FTLPIGDVRL SAGAGFLVPL VGAFPTIPGL PTRPAYYEIS VDTENDQILG
LS