Gene OSTLU_18704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_18704 
Symbol 
ID5006289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009371 
Strand
Start bp209215 
End bp210294 
Gene Length1080 bp 
Protein Length359 aa 
Translation table 
GC content60% 
IMG OID640421710 
Productpredicted protein 
Protein accessionXP_001422127 
Protein GI145355778 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0075] Serine-pyruvate aminotransferase/archaeal aspartate aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.018829 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCGG TGTTCTGCGA GACGATGAAG AACATCGACG GGATGATGAA GCGGGCGTAC 
AACGCGAGCG GCGTGGTGAT CATGCCGGGA AGCGGGACGT ACGGGATGGA GGCGGTCGCG
CGACAGTGGT GCAGCGGGAA AAAGGCGCTC GTGATTCGTA ATGGGTACTT TAGCTATCGA
TGGACGGATA TTTTTGAGCA AACGCAAATC CCGAGCGAGA CGATCGTGAT GCGAGGACGC
GCGGTGGACG ACGAAAAGAC GCCGGCGTTC GCGCCACCGC CCTTGGCGGA GGTCGTGGAG
ATGATCAACA AGGAAAAACC CGCGGTGGTG TTCGCGCCGC ACGTGGAGAC GTCGACTGGG
ATCATTCTGC CGGATTCGTA CATCAAGGCT GTGGCCGACG CCGTGCACGC GCACGGGGGG
CTGTTCGTGC TCGATTCCAT CGCGAGCGGC ACGATTTGGG TGGACATGAA GGCGACCGGC
GTGGACGCCA TCTTGAGCGC GCCGCAAAAG GGCTGGACTG GTCCGGCGTG CGCGAGCGTG
ATCATGCTCG GCGAACGCGG CGTGCACGCG ACGCGCAACT CGCAATCCAC CTCCATGGTC
ATCAACATGC GCAAGTGGCT CGAAGTCATG GATGCCTACT TGGCGGGCGG GTTCGCGTAC
TACACCACCA TGCCCACCGA CGCGTTGACT TTGTTCGAAC GCGCGGCGAT GGCGACCGAA
AAGGTTGGTT TCGACAAGGT CAAGCAAATG GCATGGGATC TCGGCACTGA GTGCCGCAAG
ATGATGGCGA GCAAGGGATT GAAATCCGTC GCCGCCAAGG GGTTCGAGGC GCCGGGCGTC
GTCGTGTCGT ACACGGATGA CGCCACCATG TTCGCCAAGT TCAAGTCTAA GGGTTTCCAA
ATCGCCGCGG GCGTTCCGTT CATGATCAAC GAACCCGCCG GCAACAACAC TTTCCGCATC
GGTTTGTTCG GCTTGGACAA AATCATGAAC AAGGACAACT GCATCAACAC CCTCGAGCCG
ACGTTGGATG AAATTTTACG CGAAAACGCC GAAGCGGCCG GCGCCGAAGC CGCCTCTTAA
 
Protein sequence
MSPVFCETMK NIDGMMKRAY NASGVVIMPG SGTYGMEAVA RQWCSGKKAL VIRNGYFSYR 
WTDIFEQTQI PSETIVMRGR AVDDEKTPAF APPPLAEVVE MINKEKPAVV FAPHVETSTG
IILPDSYIKA VADAVHAHGG LFVLDSIASG TIWVDMKATG VDAILSAPQK GWTGPACASV
IMLGERGVHA TRNSQSTSMV INMRKWLEVM DAYLAGGFAY YTTMPTDALT LFERAAMATE
KVGFDKVKQM AWDLGTECRK MMASKGLKSV AAKGFEAPGV VVSYTDDATM FAKFKSKGFQ
IAAGVPFMIN EPAGNNTFRI GLFGLDKIMN KDNCINTLEP TLDEILRENA EAAGAEAAS