Gene OSTLU_10035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_10035 
Symbol 
ID5005257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp696437 
End bp698587 
Gene Length2151 bp 
Protein Length717 aa 
Translation table 
GC content57% 
IMG OID640420678 
Productpredicted protein 
Protein accessionXP_001421360 
Protein GI145354159 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GCGAACGCGC CGGAGAAACC TCGAGCGAGG CGCGAGACGC GTCTCGTAAA GTTTGGCGTC 
GTCGAGGGCG AGAACCGCGG CGAGGACGCG ATGGATCCAG CGCGGACGAT AGAGGACGAT
TTGTTTTGGT TGCGAGACGA CGAACGGAAG AGCGATGAGA TTTTAAAGCA CTTGGAGCGG
GAAAATGCGT ACACGGAAGC GCACACTGAA CATCTGAAAA AATTGGAGAC GAAGATTTAC
GAAGAAATCA TCGCGACGAT TGAAGAGACG GATGTGGACG TGCGCTTCGC GTGGGGCGAT
CGTCACGAGT ATTGGGTCAA GACGCAAAAG GGCAAGGCGT ATCCGATCAT TTATCGATGC
GAGCGCGCGA ATGGAAAGCA CGTGGAGAAG GTTTTGGACG TCAACGAGGT CGCCGCCGAG
ATGAAGTATT GCTCGCTCGG CGGGTTCAAG CCGTCGCCCA CGCACGACGT GTTAGCGTAT
GCCATCGACG CCACGGGGTT CGAAACGTAC ACGGTTCGGT TCAAGGACTT GAATACGAAT
GAGCTCTTGG ACGACGTCCT CGAGGGATGC GCGGGCGGCG TGAGCTGGGG AGGCGGCGTC
AATCGCGAGG TGTATTACTC GACGATGGAC GACGCTCATC GCCCGGACAA GGTGTGGCGC
CACACCATGG GAACGCCGCA ATCGAGCGAC GTGTGCGTGC TCGAAGAGCC TGACGAGTTG
TTTAACGTCG GGTTCTCGCG AACGTCGAGC GGACGATACA TGATGCTCGA GTCTGAATCC
ACGGAAGAAA ACGAGTGTTA TGTCATCGAT TTGGACGCCG CGGACGCGGC GCCGGATGTG
CGTCTTGTGC AAAAGCGATC GCCACTCCAT CGGTACTACT TGGAACATCG AGGCGATACG
TTTTACGTGC TGACGAACAA GGACGAGAAG ATAAATTTCG AGCTCTTGAT GACGCCCGTC
GACGCCTTAG GGCAAGACAA CTGGGTGCCA GTCGTCGACG GCGCCGGCGC GCCGGTGTTC
GCGTACGACG ATAAACGCAC GCTCGAAAGT TTCTTCGCCT GTAAGAACCA CCTCGTCATC
GATGGTCGCG AAAACGGCTT CAGCGCGATG TGGGTCGTGC GTCTGAGCGA GGAAAGCGGC
GAGGTGGTCG ACTGGCACAA GACGGAGTGG CCGAGCGAAA ACGCCCTCGT GTACCCAAGC
GTCGCCGGTG AGACGCTTCG ATGTGTCGGC GCCAATCAAG TGTGGGACAC AAACGAGATT
TACGTGTCGT ATTCCTCGCT CAATCAGCCG CGCACAGTGT ACAAGTACGA CATGAACACC
AAGTCAAAGA AGGAGATGAA AAAGACTCCG GTGAAGGGCT TCGATACGAG TAAGTATACA
ACGATGCGAC TCGAAGTCAC CGCGCGCGAC GGCGTCAAGG TTCCGGTGTC CATCGCGTTT
AAAACGAATT TGCGAGCGCA CCGAGGACCA CTGTTGTTGG AGGGTTACGG CTCTTACGGT
ATAAGCAACG ATCCGGCTTT CATGCGCACG GCGGTGCCGC TTATGGACCG CGGCGTTACC
ATCGCTATAG CGCACATTCG TGGTGGTGGA GAGATGGGAC GCGAGTGGTA CGAGAAGCAA
GGCAAGTACT TCACTAAGCT CAACACGTTT CATGATTTCA TCGACGTCGC CGAACATTTA
GTGAGCACGG GGTGGACACA ACCGAGCAAA CTCGCGATTA GCGGTCGGAG CGCCGGCGGT
TTGCTCATGG GGGCCACGCT CAACATGCGG CCGGACTTAT TCCGATGCGT CGTTGCGGGC
GTTCCGTTTG TGGACGTCAT GGTGTCCATG TGCGATCCCT CGATTCCCTT GACCACGGGC
GAGTGGCTGG AGTGGGGTAA TCCGAACGTC GAGAAATATT TCGATTACAT GATGAAGTAC
GCGCCTATGG AAAACATACG ACCCATGGAG GTTGCCCCGG ATGTGCTCAT CACCGCGGGA
TTGTACGATC CGCGCGTCGC GTATTGGGAA TCCGCCAAGT ACGCCGCTCG TTTGCGCGAT
GCCGTCAAAA ATGGTGCGCG CGTGTTGCTC AAGACAGATC TCAGCGCCGG TCACTTCAGC
GCGAGCGACA GATATCAGCA CTTCAAGCAA ACCGCGTTCG AGCACGCGTT T
 
Protein sequence
ANAPEKPRAR RETRLVKFGV VEGENRGEDA MDPARTIEDD LFWLRDDERK SDEILKHLER 
ENAYTEAHTE HLKKLETKIY EEIIATIEET DVDVRFAWGD RHEYWVKTQK GKAYPIIYRC
ERANGKHVEK VLDVNEVAAE MKYCSLGGFK PSPTHDVLAY AIDATGFETY TVRFKDLNTN
ELLDDVLEGC AGGVSWGGGV NREVYYSTMD DAHRPDKVWR HTMGTPQSSD VCVLEEPDEL
FNVGFSRTSS GRYMMLESES TEENECYVID LDAADAAPDV RLVQKRSPLH RYYLEHRGDT
FYVLTNKDEK INFELLMTPV DALGQDNWVP VVDGAGAPVF AYDDKRTLES FFACKNHLVI
DGRENGFSAM WVVRLSEESG EVVDWHKTEW PSENALVYPS VAGETLRCVG ANQVWDTNEI
YVSYSSLNQP RTVYKYDMNT KSKKEMKKTP VKGFDTSKYT TMRLEVTARD GVKVPVSIAF
KTNLRAHRGP LLLEGYGSYG ISNDPAFMRT AVPLMDRGVT IAIAHIRGGG EMGREWYEKQ
GKYFTKLNTF HDFIDVAEHL VSTGWTQPSK LAISGRSAGG LLMGATLNMR PDLFRCVVAG
VPFVDVMVSM CDPSIPLTTG EWLEWGNPNV EKYFDYMMKY APMENIRPME VAPDVLITAG
LYDPRVAYWE SAKYAARLRD AVKNGARVLL KTDLSAGHFS ASDRYQHFKQ TAFEHAF