Gene OSTLU_30367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30367 
Symbol 
ID5001059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp70436 
End bp72229 
Gene Length1794 bp 
Protein Length597 aa 
Translation table 
GC content56% 
IMG OID640416480 
Productpredicted protein 
Protein accessionXP_001416548 
Protein GI145344042 
COG category[L] Replication, recombination and repair 
COG ID[COG0272] NAD-dependent DNA ligase (contains BRCT domain type II) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00270274 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0884002 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAATG GTAAAGGGCG TTCGCCGCGC GCGCGCGCGG CGCGAGCATT GACGACGTCG 
TCGCACGCGC GCAGGCTCCT CGTGAAGCGC GCGTTCAACA GACGCGCGTT GGAGACGCAC
CCAGACAAAG GAGGCGACGC GGACGCGTTC CAAGCGATGT ACGAGGCGTT TCAAGAGCTC
AAACAAGCGT TTCTCGATGG CTTCTCGGAA TTCGCGAAGT GCATCGAATC GGCGCCGGCG
AAGGCGAAGG CGAAGGCGAA GGCGAAACGA ACGATGGTAG ACGCGCCAGA ACTCATGGGT
AAGCGACGGA CGATTCCACC GTACGAGTTC TTTCAAACCG CCGCTGAACG AGACATTCCG
TTCTGGAAAA TTGAGTTGGC GGCGTCGGGC AGAGCTCGCT GCACCGGCTC GCGCAAGTCC
CTCGGCAGCG TCGGTCGCTG TCGAAACACC AATGAAGGAT CGAAGGTGGA TCAGCCGAGA
AGCGTGGGTC CCGGTCCCGG TCCCGGTCCC GGGTCATCGA GAGCGCTCGT CGCCGACGTG
TTCGGAATCA TCAAGAAGAA CGCTGTGCGC GTGGGCTTAA TGGATCCAGA ATCTGGCGCG
TACTGCCGAT TCGTTCATTT GCGATGTTGG CGTGTGCCTC AGGCGGTGTG GTACAATCTT
CCGCAGTACC CCGACGATAC GACTGAGTAT AGCGTCGATG AGTTCAAAAG ACGCTTGGTC
GATATGGACG GCGTCGTCTT CGTCGGGTTG AACAAACTTT GCGATGAAGA CATCACGCTC
GTCGCCGAAC ACGCGATGAA TCAGGAGGCG TGGGCGGCGT TCAATCAGAA GAAATATGTC
GCCGCTCGAG CTGCTGGCCT TCCGTCATCG CGGGTTCAAC TTCTTTCGAA AGACGAAATC
AAAGTACTGC AGAAACTAAT GTCGAAGGCG ACGTCTTTAG TGCAAGAACC AGAGCAAGAC
GAACCGCTCG CCGCGGTGAC GAAGCAGGAA TCAAAACCGG AACCAAAACG AACGGACCAC
GATGAGTACG ACGTCGTGAC TGATGACGAC GAACACGAAG ATGGCTTGCA AGTGGCGCCG
AATGTCGGCA CCGCGATCGC CGTGCAGAAA CCAAAAGAGG AGAAGCCACC ACCGAAGATT
TTGATTCCCG TCCCTGGCGA AAACGGCGCA ATCGCCAACT CACTTCTTCG CGACAATTCC
AAACCCATGA CGTTCGTGCT GACGGGAATT TTTCCGGATC TCGGAGGCGA TCGAATCGGT
CTGATGCAAG GCAAAGACAT GGCGGAAAAG CTGATTCAGC GGTTTGGAGG CGTCGTGCGG
AGCGCGGTTT CAGGCGCAAC TGATGTTTTG CTCGTCGGCG AAGAGCCAGG CGTTTCGAAA
CTATCAGCTG CGCGCACGAA AGCGAACTGC AAAGTTCTCA ATCTCGAACA CATCCTCGAC
ATGATTCACG GGCGCTCCAT TGAAGGGAAA CGGTTGCAAA TTGAAAGATT CAGCACAGGT
TTTCATGGAA ATTCGTTAGC GTACGATATG ACGCGAGCCG AATTGGAAGA GCTACGGACG
GGAGCCAAAG CGCTTCCTTC GACGATGAAA ACAGAGGAGC CACAACTTCT ACCATCGTCC
AAGTCGACAA CGCGTAAGCG CAAAGCTCTT CCTTTGTCCA CGAAAGACGA AGCGCAAAAG
AATTTGACGT TACCCGCACC TGATAAGCCG CCGCGAAATA AGAAGCCAAA ACGCTCGCCG
AAAGCGCCCG TGCGAAGCTC GCGTCGTCTC GCCGCACTGA CGAACGCCGC GTGA
 
Protein sequence
MLNGKGRSPR ARAARALTTS SHARRLLVKR AFNRRALETH PDKGGDADAF QAMYEAFQEL 
KQAFLDGFSE FAKCIESAPA KAKAKAKAKR TMVDAPELMG KRRTIPPYEF FQTAAERDIP
FWKIELAASG RARCTGSRKS LGSVGRCRNT NEGSKVDQPR SVGPGPGPGP GSSRALVADV
FGIIKKNAVR VGLMDPESGA YCRFVHLRCW RVPQAVWYNL PQYPDDTTEY SVDEFKRRLV
DMDGVVFVGL NKLCDEDITL VAEHAMNQEA WAAFNQKKYV AARAAGLPSS RVQLLSKDEI
KVLQKLMSKA TSLVQEPEQD EPLAAVTKQE SKPEPKRTDH DEYDVVTDDD EHEDGLQVAP
NVGTAIAVQK PKEEKPPPKI LIPVPGENGA IANSLLRDNS KPMTFVLTGI FPDLGGDRIG
LMQGKDMAEK LIQRFGGVVR SAVSGATDVL LVGEEPGVSK LSAARTKANC KVLNLEHILD
MIHGRSIEGK RLQIERFSTG FHGNSLAYDM TRAELEELRT GAKALPSTMK TEEPQLLPSS
KSTTRKRKAL PLSTKDEAQK NLTLPAPDKP PRNKKPKRSP KAPVRSSRRL AALTNAA