Gene OSTLU_94892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_94892 
Symbol 
ID5004093 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp436859 
End bp437857 
Gene Length999 bp 
Protein Length314 aa 
Translation table 
GC content60% 
IMG OID640419514 
Productpredicted protein 
Protein accessionXP_001420175 
Protein GI145351636 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.00994972 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.638195 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCAA ACGAGCTGTC GCCGGACGCG TTTGTGGACG AGGCGAAGGT CGTCGCGGGC 
GAGCTCGGCG CGAGCGTTAA GGTTGTCGAG ATTAAACGTT ACGCGCAACT CGTCGAGGAG
GGCTACGGAT GTCTGGCGGG CGTCGGAAGC GCTTCGAGTC GCGACGGTCG TGATCCCGCG
CTCGTGCACT TGCGATTTAC GCCGAAAGGT TGCGTCGACC CGGACGCGCC GTCCATCGCG
TTCGTGGGCA AGGGTATCAC GTTCGACACC GGTGGGCTTT CGCTGAAATC TAAAGACGGC
ATGTGCGGTA TGAAGACGGA CATGGGCGGT GCCGCGGGTA TGCTGTGCGC GTTTGAGTCC
ATCGCTCGCG AGGACGCCGA ATCGAATTTC AAGACGCCAC TCGACCTCGT GCTGTGCATC
GCCGAAAACG CAATTGGCTC GGGCGCGATT CGCCCGGATG ACATTCTCGT CGGCAAGAGC
GGAAAGACAG TGGAAATCAA CAACACCGAC GCCGAAGGGC GCCTCGTGCT CGCGGACGGC
GTCGCGTATT GCTCCGATCC CGCGAACGCC GCGTGCAAGC CGCGCATCAT CGTCGACATG
GCCACGCTCA CCGGAGCGCA AATGATCGCC ACCGGGCGTA AACACGCCGG ACTTGTGACC
GATAGCGAAG ACATGGAGCA CACCATCGTG CGATTGGGCA GAATCACCGG AGATCTGGCG
CACGCCCTGC CCTACGCTCC GGAAATGTTC AAGAGTGAGT TCAGTTCCAA AGTCGCCGAC
ATGAAAAACT CCGTCGCCGA CCGCGCCAAC GCGCAATCGA GCTGCGCCGG TCAATTCATC
GCCAATCACT TACACCCAGA CTGGGTCGCT CGCGATGACA CGGCTTGGAT TCACCTCGAC
ATGGCTGGTC CTGGTAACTT TAAAGACGGT TTAGGTTCTG GCTATGGAGT GGCTCTGCTC
AACGCGCTCT ACAAAGAAAT CGACAGTCGA CCGCAATGA
 
Protein sequence
MAPNELSPDA FVDEAKVVAG ELGASVKVVE IKRYAQLVEE GYGCLAGVGS ASSRDGRDPA 
LVHLRFTPKG CVDPDAPSIA FVGKGITFDT GGLSLKSKDG MCGMKTDMGG AAGMLCAFES
IAREDAESNF KTPLDLVLCI AENAIGSGAI RPDDILVGKS GKTVEINNTD AEGRLPRIIV
DMATLTGAQM IATGRKHAGL VTDSEDMEHT IVRLGRITGD LAHALPYAPE MFKSEFSSKV
ADMKNSVADR ANAQSSCAGQ FIANHLHPDW VARDDTAWIH LDMAGPGNFK DGLGSGYGVA
LLNALYKEID SRPQ