Gene OSTLU_625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_625 
Symbol 
ID5004232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp38805 
End bp40775 
Gene Length1971 bp 
Protein Length657 aa 
Translation table 
GC content54% 
IMG OID640419653 
Productpredicted protein 
Protein accessionXP_001420059 
Protein GI145351381 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TGGACGGAGC ATCGCGCGCG ATCGACGGCG ACGGCGAGCG CTCAGGGGTT ATACGATCTT 
CAAGGTTTGC GCGCGCCCGA AGATTTCGAT CGCTTGGCGC GAGAGACGAT ATCGAAGTGC
GAGGCGATGG CGCGCGCGCT CAAGGCGGCG GCGCCGAGCG CGGCGAGCGT CGGGGCGTTG
GATGAGATAT CTGATGAGGT CTGTCGAGTC GTGGACGTCG CGGAGGTGTG TCGTCACACG
CATCCTTCGC GCGAATACGT CATCGCGGCG GAGAAGGCGT ACGTGCGACT ACAAGATTAC
GTTGCAAGCT TGAACGCCGA CGGCGACTTG TACGAGGCGT TGCGGAGCGC GCGAGAGAGG
GACGCAAAGA ATTTGAGCGA CGAGGGCGCG CGCGTGGCGC TTACTTTGCA AGAAGATTTT
GAACGCGGAG GTATTCATCT CGACCCGGCG AGGCGACGGG ATTTCGACGG GAGCTTGTCG
CGAAGTCTTG AGCTCGGGAT GGAGTTTCAA CGGAATTTGT TGCGCCCAGA GTTAGCGAGC
AGAGTATCGC TCGACAAGAG CACGCTGAGT TCGCTCCCGA AGACAGTGCG AGACCAGTTT
CGAAGTAATG ATGGTAAGCT ATGGACGGGG TTGGTAGATT CATCGAACTC GTCGTTGATG
TTGAGACACC TGAGGGATTC AAACGCGCGC CGAGACGTCT TCATAGCGGC GAACACTGGA
CCCGAGCCAA ACAAAGACGT TTTGGCCAAC CTCATATCTT CTCGTGCTGG TGTCGCGCAT
TCTCTAGGAT TCGAAACATA TGCCAAATAT GCCACCGCAC CGTTGCTTGC TCGATCACCC
GACGCCGTTC GCGAGTTCTT GTTGAGTCTA TCGGATGTCC TTCGGCAGAG CGCAAAGGAC
GAGTACGCCG TCATTGAAAA GTATAATCAC GGCAAACACA TATCGGTGTG GGACAAAACT
TACGCGATGG CACAAGCGCG TGGACACGAG TGCGAATTCA ATTCGGCCGC GATTGCGGAG
TATTTCCCTT TGGAAGGCGT CATCGTCGGC ATCGGCGAGC TTCTTGCGAG AGTTCTCGGT
TTACGCATCG AACTGCAGGA GTTGGCACCT GGCGAAGGAT GGACGAATGA CCTGAAGAAA
CTAGTGGTGA AGACTCGTGA TGGAGACATG CGAGGGACGA TTTATTTGGA CTTGCTCCCG
AGGCCGGGAA AGTTTAATCA CGCCGCACAC TTTGTCATTC GGTGTAGCCG CATGGTGTCG
CCCACAGAAC GGCAGCACCC ATCGGTCGCT CTCGTGTGTA ACTTCCCTCC CGTATCGGGC
AAAGGGCGAT CTTTGTTAAG TCACGGAGAA GTGGAAACGT TTTTACACGA GTTTGGTCAC
GCCATGCACT CTGTGCTTTC GGATACAGAA TTCCAACATT TGTCTGGAAC TCGAGCGCCA
ATGGACATCG TTGAGGTTCC GAGTCATTTA TTCGAGCACT TTGCGTGGGA TCCAAGTGCT
TTGAAGCTTC TTGGAAAACA CTACATGACG CACGAGCCCA TACCTGACGC CATGATTTCC
GCGTTGCGCA AGTCGAGGAA TATATTTCGT TCGATTGAGT CGCAACAACA GGTTGTCTTC
GCGTTGACCG ATCTCGAGCT TCACAACCAA ACATCCGAAC TATCGTCAAA ATCGATCGCT
GATCTCGCTG CAACGATTCA AAACGAGCAC AGTATGTTCA AACCTGTGTC CGGTACGAAT
TGGGAGCTTA GATTTGGTCA CTTTGTCGGC TACGGAGCGA CATATTACTC GTACCTGTAC
GCCGATGTCC TGGCCGATGA CATTTGGAAG CGTTACTTTG AGGGCGACAG CCTCGCGGCG
GGCGCGGCGG AGAGTCTTCG TGACAAATTA TTGCGACACG GGGGATCGAG AGATCCAGAA
AAAGTGATTA GAGATTTGCT AGGAAAGGAT TCGTTGATAG AAGTTAATGG A
 
Protein sequence
WTEHRARSTA TASAQGLYDL QGLRAPEDFD RLARETISKC EAMARALKAA APSAASVGAL 
DEISDEVCRV VDVAEVCRHT HPSREYVIAA EKAYVRLQDY VASLNADGDL YEALRSARER
DAKNLSDEGA RVALTLQEDF ERGGIHLDPA RRRDFDGSLS RSLELGMEFQ RNLLRPELAS
RVSLDKSTLS SLPKTVRDQF RSNDGKLWTG LVDSSNSSLM LRHLRDSNAR RDVFIAANTG
PEPNKDVLAN LISSRAGVAH SLGFETYAKY ATAPLLARSP DAVREFLLSL SDVLRQSAKD
EYAVIEKYNH GKHISVWDKT YAMAQARGHE CEFNSAAIAE YFPLEGVIVG IGELLARVLG
LRIELQELAP GEGWTNDLKK LVVKTRDGDM RGTIYLDLLP RPGKFNHAAH FVIRCSRMVS
PTERQHPSVA LVCNFPPVSG KGRSLLSHGE VETFLHEFGH AMHSVLSDTE FQHLSGTRAP
MDIVEVPSHL FEHFAWDPSA LKLLGKHYMT HEPIPDAMIS ALRKSRNIFR SIESQQQVVF
ALTDLELHNQ TSELSSKSIA DLAATIQNEH SMFKPVSGTN WELRFGHFVG YGATYYSYLY
ADVLADDIWK RYFEGDSLAA GAAESLRDKL LRHGGSRDPE KVIRDLLGKD SLIEVNG