Gene OSTLU_31131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_31131 
Symbol 
ID5001178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009358 
Strand
Start bp358841 
End bp360247 
Gene Length1407 bp 
Protein Length443 aa 
Translation table 
GC content60% 
IMG OID640416599 
Productpredicted protein 
Protein accessionXP_001417485 
Protein GI145346000 
COG category[R] General function prediction only 
COG ID[COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase 
TIGRFAM ID[TIGR01891] amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones321 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGCGACGCT CGAGCCATGC TCAGAGCGCT GTTCTTCCTC GCCCACGTCC CCGCGGCGCT 
CGCCCTGGAC GCGACGACGC TGCGCGATAT CGCCGCGTCG TCATCAAACG TCGATGCGCG
CGAAATCCTG TCGCAATCGC GCGCGACGCA CGATTACGTC GTCGACCTGC GACGAGAAAT
CCACAAAAAT CCCGAACTCA TGTGGACCGA ACGCGCCACC GCGGACGTCA TCGCGCGCGA
GCTCGACGCG CACGGGATCG AATACGACCG AGTGACGTCC ACGGGGATCG TCGCGCGCGT
CGGACGAGGC GAACGAAGCG TGGGGTTGAG AGCGGATATG GACGCGCTGC CGCTGCGCGA
GGACACCGGG TTGGCGTACG CGAGCGAGAA CGATGGAAAA ATGCACGCGT GTGGACACGA
TGGACACGTG GCGATGTTAC TCGGCGCGGC GAAGGTGATA AAGGCGAGGT ACGACGCGGA
TGAGACGTCC GTGCCGGGAG TGGTGCGGTT CATATTTCAA CCGGCGGAGG AAGGCGGGGC
GGGGGCGAAG GAAATGTTGC GGCCGAGCGA CGGGACGACG GGAATGCTGG ATTTGAAACC
GCCGATTGAA AGCGTGTTTG GATTACATAA TTGGCCGTAT CCTGAAATGC CGAGTGGGAC
GATGGGCACG CGCGGTGGAA CGATCATGGC TGGAGCGGGG AGCTTCGACG TCGTCGTCGT
CGGACGCGGC GGACACGCGG CGGTGCCGCA CAACAACGTG GACGTCATCG TCGCGGGTAG
CGCGATCGTC ACCGCGTTGC AGACGCTCGT GAGTCGATTG ACGGATCCGT TAGATAGCGT
GGTGATCAGC GTCACCGTGT TCAACTCGGG AACGGCGAGC AACATCATGC CAGACACGGC
GAGTTTGCAA GGCACGTTGC GAGCGCTCAA TCCGAAGACT TTTGCGAAAT TTCAACAGAA
AATTGCCGAC ATGGCTTCTG CCATCGCCAG CGCGCACGGT TGCACGGCAG CGACATCTTT
TGAACCTGAG CACAATGGCG TGAAGCGAAT TCCGTATCCA CCGACGGTGA ACGACCCGCG
AGCCGCAGGG TTGGCGATGA ACGTCGCCGC GCAACTGTTC GGAAGCGAGA GCACGCGCGA
CGTCGTGCCA GTCATGCCCG CGGAAGATTT CTCGTTTTTC GGGGAAACTT ACCCATCTGC
CATGATGTGG CTCGGAGCGT ACAACGAAAC CGCCGGTGCG ACGCATCCTT TGCATAGCAC
GAAGTACATT CTGGATGAAA GCGTCTTGAC GTCTGGCGTC GCTTTGCACG CGATGTACGC
GCTCGAATTT CTTCATAGTG GGTTGTAACG AACACTCATT CAATTTCTTC ATAGTGGGTT
GTAATGAACA CTCATTCTAT TTTACAT
 
Protein sequence
MLRALFFLAH VPAALALDAT TLRDIAASSS NVDAREILSQ SRATHDYVVD LRREIHKNPE 
LMWTERATAD VIARELDAHG IEYDRVTSTG IVARVGRGER SVGLRADMDA LPLREDTGLA
YASENDGKMH ACGHDGHVAM LLGAAKVIKA RYDADETSVP GVVRFIFQPA EEGGAGAKEM
LRPSDGTTGM LDLKPPIESV FGLHNWPYPE MPSGTMGTRG GTIMAGAGSF DVVVVGRGGH
AAVPHNNVDV IVAGSAIVTA LQTLVSRLTD PLDSVVISVT VFNSGTASNI MPDTASLQGT
LRALNPKTFA KFQQKIADMA SAIASAHGCT AATSFEPEHN GVKRIPYPPT VNDPRAAGLA
MNVAAQLFGS ESTRDVVPVM PAEDFSFFGE TYPSAMMWLG AYNETAGATH PLHSTKYILD
ESVLTSGVAL HAMYALEFLH SGL