Gene OSTLU_10204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_10204 
Symbol 
ID5001842 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp363030 
End bp364088 
Gene Length1059 bp 
Protein Length337 aa 
Translation table 
GC content61% 
IMG OID640417263 
Productpredicted protein 
Protein accessionXP_001417747 
Protein GI145346545 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2021] Homoserine acetyltransferase 
TIGRFAM ID[TIGR01392] homoserine O-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.110736 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTCGCGGCGC CGCTCGCGCT CGAGAGCGGT GACGCGCTCG ACGCGTGCGA CGTCGCGTAC 
ACGGTGTACG GCGAGCTGAA CGCGAATAAA GACAACGTCG TGCTCGTCGG ACACTCGCTG
ACGTCGAACA GTAACGTCGG CGAGTGGTGG GGGGAGGTGC TGGGCGAGGG CGACGCGTAC
GCGCTGAACT CGAGGGAGGA TTGCGTGATT TGCGTCAACT ATCTGGGCTC GCCGTACGGA
AGCGCGAGCC CGGTGAGCGC GGATCCGAGA AAGGAGGATC GCGGGGCGTA CGGGGTCGAT
TTTCCGACGC CGGTCACGGT GAGGGATAAC GCGGTGATGT GCATGATGCT GCTGAGGGAG
CTCGGGGTGA ACGGGGTGCG GTGCGCGATG GGCGGGTCGA TGGGTTCGAT GCTGGCGCTG
GAGTTCGCGG CGACGTATCC GGATTTCGTA AAGGAGATAA TCATCATCGC CGGGTGCGGA
CGACACACGG ATTGGGCGAT CGGCATAGGG GAGGCGCAAC GGTACGCGAT CATGAGCGAT
GGGAAGTATA AGGGTGGGGC GTACGAGCGC GATCAAGGGC CGAACGCGGG GTTGGCGACG
TCGCGAATGA TGGCGATGCT GAGCTATCGC GCGCCGGCGA GCGTCGATGG GAGATTTTCG
CGTTCGAACA TGGGAGACGT CGCGAGACCG GCGGAAGAAC CCGAGCTAGG TGTGCGCGCG
CACGAAAAGG AGACTAAGTT GCCGTATTTT GCGGTGGAGT CGTATCTGCA GTATCAAGGG
AAAAAGTTTA TTCGCAGATT CGACGCGAAC TGCTACATTC AGTTGACGTA CACACTGGAC
TCGCACGACG TCTCGCGTGG GCGAGGGGAT TATTTCGATG TGCTGGCAAA TATTAAACAG
CGCGCTCTCG TCGTGGGTAT TCTCAGCGAC GTACTGTATC CGTATGCGCT TCAGCGCGAG
CTCGCCGACG CGTTGCCGAA TTCGCAGCTG TACACCATAG ACTCCCCGCA CGGCCACGAC
TCGTTCTTGA TCGAGATCGA GCAACTCAAC GCCGTCATG
 
Protein sequence
LAAPLALESG DALDACDVAY TVYGELNANK DNVVLVGHSL TSNSNVGEWW GEVLGEGDAY 
ALNSREDCVI CVNYLGSPYG SASPVSADPR KEDRGAYGVD FPTPVTVRDN AVMCMMLLRE
LGVNGVRCAM GGSMGSMLAL EFAATYPDFV KEIIIIAGCG RHTDWAIGIG EAQRYAIMSD
GKYKGGAYER DQGPNAGLAT SRMMAMLSYR APASVDGRFS RVRAHEKETK LPYFAVESYL
QYQGKKFIRR FDANCYIQLT YTLDSHDVSR GRGDYFDVLA NIKQRALVVG ILSDVLYPYA
LQRELADALP NSQLYTIDSP HGHDSFLIEI EQLNAVM