Gene OSTLU_41082 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_41082 
Symbol 
ID5002488 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009360 
Strand
Start bp594749 
End bp596473 
Gene Length1725 bp 
Protein Length509 aa 
Translation table 
GC content55% 
IMG OID640417909 
Productpredicted protein 
Protein accessionXP_001418279 
Protein GI145347657 
COG category[A] RNA processing and modification
[D] Cell cycle control, cell division, chromosome partitioning
[L] Replication, recombination and repair 
COG ID[COG5049] 5'-3' exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.523535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0326326 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGTCC CGAAGTTTTT CCGATGGCTC GCCGAGCGCT ACCCGCTGCT CCAGCAAGAG 
ATCGCGGGAA ATCAGATCCC GGGGATCGAT AACTTGTACC TGGACATGAA TGGTGTCATT
CACAACTGCT CGCACGGCGC AGGGACGGAT GTGAATACGC GAATGACTGA AGACGAGATG
ATGTCCAAGG TTTTCGCGTA TTTGGATCAT CTGTTTCGAA TGACGCGACC GAATAAGATG
CTGTACATGG CGATCGATGG CGTGGCACCG CGAGCGAAGA TGAACCAACA GCGGAGTCGA
CGATTTCGAA GCGCGGCGGA GGCGGCGAAG GACCGCGAGG AAGCGCGTGC GAGAGGCGAA
CCGGAGCCGG AGGGCGAACC GTTTGATTCC AACTGTATCA CGCCGGGGAC GGAGTTCATG
GCGCGGTTGA CGGAACATTT GAAATTCTAC GTGCGTAAGA AGCAAACGGA AGATCCACTT
TGGGCAAAGG TGACGGTGAT ATTGTCTGGG CACGAAGTCA GAGGTGAGGG CGAGCATAAA
ATCATGGAGC ACATTCGATG GGCGCGAACG CAGCCGGACT GGGAGCCGAA TCAAACGCAC
TGTTTGTACG GCCTCGACGC CGATCTTATC ATGCTTGCTC TCGTCACGCA CGAACCTCAC
TTTTGTTTGC TTCGTGAGGT CGTGAAGTTT GGCGGCGGCG AGAAAGGGCA GCCGAGCCGG
GAGATTTTGT CCAATCCCAC CGACGACGGC TTCATCTTGT TGCACATCGG CTTGTTGCGC
GAGTACTTGG ATTTAGAATT CCGCGAAAAG AATCTTCCGT TCGGATACGA GCTCGAGCGC
GTCATCGATG ACTTCATCTT ACTGTGCATG CTCGTCGGGA ACGATTTCTT GCCCGCTCTG
CCGACGCTGA ACATCGCCGA GGGCGCGCTG AACACGCTCT TCAAAGTGTA CCACGACACG
TTGCCGATGC TTGGAGGTTA TATCACTGGC GATGAAGGGG GGGGTACTTT TAACCCTGAA
CGTTTGGAGA AGATCATGAG CATCATGGCG ACGTTCGAGC GACAGGTGTT GGAAGAGCGC
GCGATGGATG TCGAGAAGGA AGAGGAGAAG AAGTCTCGAC GAAAAGGTCG CAACGGTGGT
TCCGCGTCCG ATCTCACTCC CGAGGAGAAG TTCGACAAGG ATTTGAGTGA GATGAGCGAC
ACCGAAGGCG TGCCACAAGT TTCTGCCGAC CCGACAATGA TGAACGCTGC GAAGCGGGCG
CTGATTCTCG AAGGTGGCGA GGAGGGCTTG CAAGCGTGGA AGGATACGTA CTATCGCGAA
AAGCTCGGTT TGAAGATTGG CGAAGCTGCA CCGCTGGGTG AAATTAGACA AGCTTATTTC
GATGGTTTGA ACTGGGTCTT GCGTTACTAC TATCGTGGTG TTGCGTCCTG GACTTGGTAC
TATCCCTACC ATTACGCGCC GATGGCGAGC GACTTGTGCG CCGGCATGGG CGGTCTCACG
TCTGAGTTCG ATTACGGCGA ACCGTTCAAA CCTTTCGAGC AGCTCATGGC TGTACAACCA
CCATCGAGCT CCAAGTTACT CCCCGAGCCA TTCCGCCACT TCATGGAAGA TCCGCAGTCG
CCCTTGGCTG AGTTCTTCCC GGAAGACATC AAAGTTGACT TTGAAGGCAA GCGCAACGAC
TGGGAAGGCG TCGTGCTGTT GCCCTTTTTG GACGCCGATC GCTTG
 
Protein sequence
MGVPKFFRWL AERYPLLQQE IAGNQIPGID NLYLDMNGVI HNCSHGAGTD VNTRMTEDEM 
MSKVFAYLDH LFRMTRPNKM LYMAIDGVAP RAKMNQQRSR RFRSAAEAAK DREEARARGE
PEPEGEPFDS NCITPGTEFM ARLTEHLKFY VRKKQTEDPL WAKVTVILSG HEVRGEGEHK
IMEHIRWART QPDWEPNQTH CLYGLDADLI MLALVTHEPH FCLLREVVKF GGGEKGQPSR
EILSNPTDDG FILLHIGLLR EYLDLEFREK NLPFGYELER VIDDFILLCM LVGNDFLPAL
PTLNIAEGAL NTLFKVYHDT LPMLGGYITG DEGGGTFNPE LSADPTMMNA AKRALILEGG
EEGLQAWKDT YYREKLGLKI GEAAPLGEIR QAYFDGLNWV LRYYYRGVAS WTWYYPYHYA
PMASDLCAGM GGLTSEFDYG EPFKPFEQLM AVQPPSSSKL LPEPFRHFME DPQSPLAEFF
PEDIKVDFEG KRNDWEGVVL LPFLDADRL