Gene OSTLU_42867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_42867 
Symbol 
ID5003471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009362 
Strand
Start bp384044 
End bp385183 
Gene Length1140 bp 
Protein Length379 aa 
Translation table 
GC content54% 
IMG OID640418892 
Productpredicted protein 
Protein accessionXP_001419147 
Protein GI145349453 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCTTAC CGACGTTCGT GGACATACAC ACACATATAG ATAAGGCACA CACGTGTGAG 
AGGAGCCGAA ATTTAAACGG CACCCTCGCT GGCGCGGACG CCGCGTGCGC GAACGATTTC
CAACACTGGA CGCTCGAAGA CGCCAAGCGG CGCATGGGCT TTGCGATTCA GTGCGCGTAC
GCGTACGGGA CCTCCGCCAT GCGCACGCAC TTGATGTCGG GTGAGGAGAG ACAGAGCAAA
ATCGCCTGGG AAGCGTTCGG GAAGTTACGC GAGGAATGGA GAGGGAAAGT TGAGTTGCAA
GGCGTGTCAC TCTCGGTGTT GAGCTTTTTC CGCGACGAGA CCAAGGCGCG CGCGCTGGCT
CGCATGGTGA AATCTTATGG AGGTATACTT GGCGCCGCCG TGTCGTGTAG CGATGCAGGT
GGTACACCAC TGGATGTCCA CACGACGTGC GGTGCCGATA TGCCCAAGTT ATTGGACGTC
ATTTTCTCTC TGGCGAAGGA ATACAATTTG GATGTCGACT TTCATTGCGA CGAGAACGGC
AATGAATCCT CCAAAGGCTT GCTCCACATT TCTGAGGCCG TCATTCGAAA CAATTTCAAA
GGATCTGTGG TGTGCGGTCA CTGCTGCAGT CTCGCAGTTC AGCCAAACGA ACAGGCAAAG
CGCATCATCG ACGCCGCCCG CGAAGCGGGC GTCACCGTTG TTTCGCTTCC AATCGTTAAC
CAATGGCTTC AAAATCGTGA TCCAACGAAC GAAGCGACGC CAACGCGGCG AGGCGTTACT
CGGGTGAAGG AGCTTGCCCG AGCTGGCGTG CCGGTGTGCC TTTCGAGTGA TAATACGCGA
GATCAATTTT TTCAATACGG TGACTGTGAT ATGCTCGAGG TATTTCGATC GAGCGTTTGC
ATCGCGCATC TCGATCGACC TTTTGGATCG TGGCCGTTGG CTTTAGCTGC GAACCCATCG
CGCGCGATGC GACTTGGCGA AAAGTCGGGA ATGATTGCTC CAGGAGCGAA GGCAAACTTT
GTCTTGTTTC GCGCTAGGAA TTACAGCGAG CTGTTTTCTA GATCGCAACA TGATCGAGTC
GTCATTCGTG ATGGTGTCCG CATCGCTACT GCTTTACCAG ATTATGAAGA GCTAGACTAA
 
Protein sequence
MCLPTFVDIH THIDKAHTCE RSRNLNGTLA GADAACANDF QHWTLEDAKR RMGFAIQCAY 
AYGTSAMRTH LMSGEERQSK IAWEAFGKLR EEWRGKVELQ GVSLSVLSFF RDETKARALA
RMVKSYGGIL GAAVSCSDAG GTPLDVHTTC GADMPKLLDV IFSLAKEYNL DVDFHCDENG
NESSKGLLHI SEAVIRNNFK GSVVCGHCCS LAVQPNEQAK RIIDAAREAG VTVVSLPIVN
QWLQNRDPTN EATPTRRGVT RVKELARAGV PVCLSSDNTR DQFFQYGDCD MLEVFRSSVC
IAHLDRPFGS WPLALAANPS RAMRLGEKSG MIAPGAKANF VLFRARNYSE LFSRSQHDRV
VIRDGVRIAT ALPDYEELD