Gene OSTLU_5620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_5620 
Symbol 
ID5005690 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009369 
Strand
Start bp238577 
End bp239665 
Gene Length1089 bp 
Protein Length333 aa 
Translation table 
GC content61% 
IMG OID640421111 
Productpredicted protein 
Protein accessionXP_001421753 
Protein GI145354983 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.469951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0101798 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCATCA AGGGCCTGCC CGCCGTCCTC GAGCCGTACT GCGAGCGCGT GCACGTCGGT 
GAATACGCGC CCGGCACGCG ATGCGCGGTG GACGCCTACA GCTGGCTGCA CAAGGGCGCG
TTCGGGTGCG TCGACGCGCT CGCGCCCGGA GGCGATCGCG CGTGGGAGCG ACGACCGGGC
GCGACGGCGC CGTACGTGAA ATACGCCGTG CACCGAGCGA ACATGCTGAG GCATCACGGG
ATCGAACCGG TGATCGTGTT CGACGGCGAC AGAGCGCCGG CGAAGCGAGG CGAGGAACGC
GCGAGACGCG AGCGACGGGC GGCGCTGTTG GAGCGAGGAG AGCGGGCGCG CGCGGCGGGC
GATAAGGAGG GAGCGTTTCG GGCGTTTTCG GGGGCGATCG ATGTGACGCC GGAGATGGCG
AGGGAGCTGA TCGTGGCGCT GAAGAGGGAG AAATTCGAGT TCGTCGTCGC GCCGTACGAG
GCTGACGCGA CGATTGCGTC GCTCGCTCTC ACGGCGAAGG AACGGGGGGG GGTAGATTTA
GTGTTCACGG AAGATTCCGA TCTCGTGGCG TACGGGTGTC CGCGCGTAGT GTTTAAGTTG
GAAAAATCCG GCGATGCGAA GGAGCTGAGG TTGGCGAGTT TGTTTGAAGG CGCCGCGCGC
GCGACGACGA CGACGACTAC GGAAACGCCG AGCGATGAAA ACGTCGACGA CAACGCGATC
GGGCGAGCGA ATAAACCGAA AAGCAAAGGT CCGCCGCCGC TGGATTTCAC TGGGTGGGAC
TACGAATTAT TTCTAAGCTT GTGCGTGTTG TCGGGGTGCG ATTTCTTGGA CAACATTCGC
GGCTTGGGTA TCAAAAAAAT GTACAATATT TTGAACAAAC ATCGATGTGT CGACGCGGTG
TTCGCCGAAT TGAGGGCGAA TGAAAAAATT AAGGATTTGA TCGCGGAAGG GTACGAAGTG
GAGTGGAGAA AGGCACGAAT GATTTTCAAG CACGCGTTGG TGTGGGACCC CCACGCCGGC
GCGCTTCGAC ACCTCACGCC GGTTCCCGAG CATTGCGAGT TCGCGAACGA TTTGAGTTTT
CTCGGGCCG
 
Protein sequence
MGIKGLPAVL EPYCERVHVG EYAPGTRCAV DAYSWLHKGA FGCVDALAPG GDRAWERRPG 
ATAPYVKYAV HRANMLRHHG IEPVIVFDGD RAPAKRGEER ARRERRAALL ERGERARAAG
DKEGAFRAFS GAIDVTPEMA RELIVALKRE KFEFVVAPYE ADATIASLAL TAKERGGVDL
VFTEDSDLVA YGCPRVVFKL EKSGDAKELR LATNKPKSKG PPPLDFTGWD YELFLSLCVL
SGCDFLDNIR GLGIKKMYNI LNKHRCVDAV FAELRANEKI KDLIAEGYEV EWRKARMIFK
HALVWDPHAG ALRHLTPVPE HCEFANDLSF LGP