Gene OSTLU_87686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_87686 
Symbol 
ID5003020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009361 
Strand
Start bp105539 
End bp106747 
Gene Length1209 bp 
Protein Length402 aa 
Translation table 
GC content58% 
IMG OID640418441 
Productpredicted protein 
Protein accessionXP_001418835 
Protein GI145348807 
COG category[L] Replication, recombination and repair 
COG ID[COG1525] Micrococcal nuclease (thermonuclease) homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.230119 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAGCT TTCACGGAAC TTTCATTCGC AACTGCATCG TCACGCACGT CATCGACGGT 
AAGACCATCC GCGTGCTTCT CAACCCCGAC GGCGGTCAGC AACGCTCGTC CACGGGCGAC
GGTTGGGAAG ACTCGCTCGT CGTGGGCAGC GAACCCACGT CTCGCATACC GAGCGACGAG
CTCGACGCGT CGAGCGTCGA CCCGACGACG TGGGAGAAAG TGGACGTGCG GTTGATCTAC
GTCGACACCG AAGAATCGCT CGATGTGAAG AAGGAGGAGC AAGCGTACAA ACCGATCACG
CCGGCGGGCG TGGAGGCGTT CGATTGGCTC AAGAAGCGTC TGGGATCGGC GGCGGATGGT
CGATGCGGCG ACGTCGAGGT GGACATCGAA TTCGACACCT GCGAGTTCAT GACTTCGGTG
TCTCGCGCGC GCGAGTACTC TCTCGATAAG TACGGACGCG TTCTCGCGTA CGTATATCAC
AACGGCAACA GCGTCAACGT GGAAACCGTG CTGGCTGGTC AGTCTCCGTA TTTCACGAAG
CACGGCAGAT CGAGACTCTA TCACGGTGAA TTCGCGCTCG CCGAGAAGCT GGCGATTGAG
AACATTCGAG GAATCTGGGA CCCAGCCGGA GCGTCTCTCG CATCGTTTGG CATGTTAGAG
TACAGCCGGG ACTACCGACG ATTGTTGCCT TGGTGGAGAG AGCGAGAATT GTTCATCGAA
GACTGGCGTC ACTGGGGACA TCTCGGGCTG ACGAAGGACA TTCTGAACCC GCGCGACGCG
CGAGATTACC AAAAACTTCT AGTCGCCGCG GCTGCGCGCG AAAAGGCGAC AATTTTAGTC
GATCTTCAGC CGACGCAATC CAATCTTTAC GACGGCGTCA TGCAGCTCAT CCGATACGAA
GGCGGGAAGC AGGGATTGTG CATTTTCGCC GGCACGCGAC GATATCCGTT TAACCTTTGG
ATGGACGACG CGAACTCGAT GGAAAGCGGA CGACTTCAGG CTTTACTCCA CGCGCGCTAC
TGTCAGAACG CTCGCAACTT TTGCTTCATC ACGGGAAGTC TCTTCATCTT TCACGCTAAA
AATCGACCGC AGATGCTTTT GGAATCGTGC GAGCAAGTCA GCGATTTCCC GTTGCGTCCG
GATGACATGT CGCAAAAGCT TCGAGGGCAC CACCGCGCGG GCGCCGCCAA GTCCTCGCCC
GCGGCGTGA
 
Protein sequence
MHSFHGTFIR NCIVTHVIDG KTIRVLLNPD GGQQRSSTGD GWEDSLVVGS EPTSRIPSDE 
LDASSVDPTT WEKVDVRLIY VDTEESLDVK KEEQAYKPIT PAGVEAFDWL KKRLGSAADG
RCGDVEVDIE FDTCEFMTSV SRAREYSLDK YGRVLAYVYH NGNSVNVETV LAGQSPYFTK
HGRSRLYHGE FALAEKLAIE NIRGIWDPAG ASLASFGMLE YSRDYRRLLP WWRERELFIE
DWRHWGHLGL TKDILNPRDA RDYQKLLVAA AAREKATILV DLQPTQSNLY DGVMQLIRYE
GGKQGLCIFA GTRRYPFNLW MDDANSMESG RLQALLHARY CQNARNFCFI TGSLFIFHAK
NRPQMLLESC EQVSDFPLRP DDMSQKLRGH HRAGAAKSSP AA