Gene OSTLU_2836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_2836 
Symbol 
ID5001913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp295697 
End bp297031 
Gene Length1335 bp 
Protein Length445 aa 
Translation table 
GC content63% 
IMG OID640417334 
Productpredicted protein 
Protein accessionXP_001417730 
Protein GI145346511 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0375368 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.105392 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTGCGGGCGT TCGACGCGGC GTCGGGGGGG GCGGTGCGAG CGATGATCGA GGAGGGAGAG 
TTTAAGGGAA AGGTTGGGAC GTCCGCGTTC TCGAGGGCGA TCGCGGGGAC GGCGAAACAC
GTCGGCGTGG TCGGCGTGGG GAAGAGGGGG GAGGCGACGA CGAGCGCGAT GTTGACGCTG
GGATCCGCGG CGGCGGCGCA AGCGCAAAAG AGCAAGTGCA AGACGCTCGG GATTAAGGGT
GCGGACGGAT TCGAAAACGC GGTCGCGGCT GGGGCGTACT TGGGCGCGAC GGAGGATCAG
AGGTTTAAGT CCAAGCCGCA GGCGAAGACG CTCGAATCCG TGTCGTTGTT GGACGTCTCC
GCGGATGTCT CCGAGGGTAA GGCGCGCGCG GTGGGCGTGC AGTTGACCAA AGATCTCGTC
GCCGCACCGC CGAACGTGGT GACGCCGACC GCGATGGCGG AAGTCGCCAA GTCCATCGCC
GCCGCGCATC CGGATTGCAT GAGCGTGAAA ATTTTGGAAA AGGCTGATTG CGAAAAGCTG
GGCATGGGAT CCTTCTTGGG CGTCTCCGAA GCCTCGGACG AACCGCCGAA GTTTATCCAC
CTCACGTACA AGGGTGCTGG GAGCGACTTG AAGAAAGTGG CTGTCGTCGG TAAGGGGCTG
ACGTTTGACT CCGGCGGCTA CAACTTGAAG GCGGGCGCGG GGTCGATGAT TGAGATGATG
AAGTTTGACA TGGGCGGTTC CGGAGCCACG CTCGGCGCCG CGAAAATCAT CGCGCAGACG
AAGCCAGCGG GCGTCGAATG TCACTTCATC ATCGCGTCGT GCGAAAACAT GATCGGTTCT
CGCGGCTTGC GCCCTGGCGA CATCTTAACC GCGTCCAACG GCAAGACCAT CGAGGTGAAC
AACACCGACG CTGAAGGTCG TCTGACGCTC GCCGACGCCT TGGTGTACGC CGACAAGACG
TGCGGTGCGA CCGCCATCGT TGACTGCGCC ACTCTCACCG GCGCCATCAT CGTCGCGCTC
GGCAACGACA TCGCCGGTCT CTTTTCCCCG AAGGACGCCG CCGCCAAGCG CGTCGAAGAC
GCCGCCAAGG CGGCGGGAGA AGATCTCTGG CGTATGCCCA TGCCCGATTC CATGTGGTCT
ATCATGAAAT CTGAAATCGC CGACATGAAG AACACCGGCT CTCGCGGCGG CGGTTCCATC
ACCGCCGCGC TCTTCTTGAA GCAATTCGTC GACGAGAGCG TCGAGTGGAG CCACGTGGAT
TTGGCCGGAC CGGTGTGGGA CGACAAGAAG GGCGGTGCCA CCGGTTACGG CGCAGCGCTT
TTCGCCGCGT GGGTG
 
Protein sequence
LRAFDAASGG AVRAMIEEGE FKGKVGTSAF SRAIAGTAKH VGVVGVGKRG EATTSAMLTL 
GSAAAAQAQK SKCKTLGIKG ADGFENAVAA GAYLGATEDQ RFKSKPQAKT LESVSLLDVS
ADVSEGKARA VGVQLTKDLV AAPPNVVTPT AMAEVAKSIA AAHPDCMSVK ILEKADCEKL
GMGSFLGVSE ASDEPPKFIH LTYKGAGSDL KKVAVVGKGL TFDSGGYNLK AGAGSMIEMM
KFDMGGSGAT LGAAKIIAQT KPAGVECHFI IASCENMIGS RGLRPGDILT ASNGKTIEVN
NTDAEGRLTL ADALVYADKT CGATAIVDCA TLTGAIIVAL GNDIAGLFSP KDAAAKRVED
AAKAAGEDLW RMPMPDSMWS IMKSEIADMK NTGSRGGGSI TAALFLKQFV DESVEWSHVD
LAGPVWDDKK GGATGYGAAL FAAWV