Gene OSTLU_30712 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_30712 
Symbol 
ID5000974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009357 
Strand
Start bp632283 
End bp633323 
Gene Length1041 bp 
Protein Length346 aa 
Translation table 
GC content65% 
IMG OID640416395 
Productpredicted protein 
Protein accessionXP_001416716 
Protein GI145344390 
COG category[F] Nucleotide transport and metabolism
[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0590] Cytosine/adenosine deaminases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.516627 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGCGC GAACGTCGCT GGACGCGCGC GCGCCGCGCG GGATCCCGGC GAAGGCGCCG 
CGGGACGCGG ACGCGGCGTT GCCGACGCGA AGATGCGTCG TGGCGACGGG AATTCCCGCG
CGCGCGACGT CGGCGGTGCT CGCGAGCGCG CGGGCGGCGG CGCCGCTTCC GGCGTCGTTG
GCGCACGTGA AGCGCGCGCG GGCGAGCGCG CGGACGAAGG GGACGACGGA GGTGGTGGTG
AAGTTGGCGA GCGAGGGCGC GAGCGCGAGC GCGCGGGGAG ACGCGGAAAG CGACGCCGCG
GTGCGCGCGG ACGTGCTGGA ACGTCACGCG GATGTGATCG CGAGCGTCGT GTACGCAGAC
GTGCCCGCGG AAGGGCCGGA GGATAGGGAG ACGTGGGAGA AGGCGTGCGC GATTTGGCCG
GTGAGTTTGA CGGCGCCGGC GGAACGCGAG ACGGAGACGC CGAGCGACGA GGAGGCGGCG
TATTTTAGAA AGTGGACGAA GCAGGCGTGC GAGGGGGCGA AAATGAGTGG AAATTGTGCG
ATTATAGTTG ATCCAGCGCG TGATGTTGAG ATCGCCCGGG GCGTGGATGA GTCGGCGACG
CATCCGTTGC GACACGCCGT CATCGCCGCG GTCGATCTCG CGGCGAAGCG GGACGTCGCG
ATGTATCCGG AAAAGGAGCA CGTAGAGGCT TTGATCGAGG CGAGACGGAT GGAAAAGCTC
GAACGCGACG CGCTCGAGAT CGCGGGGGTT GGAGACGACG CAAAAAAACG GAAGCGCGAA
GTACAAACGA AGGGCTCGGC GATGACAGAA ATCATGGGTC GCCCGTACCT GTGCACGGGA
TACGACGTGT TTTTAGCGCG AGAGCCGTGC ATCATGTGCG CGATGGGGCT CGTGCATTCG
AGACTGAAAC GCGTGGTATT TGCCGTGTGC GATAATATCA ATGGCGCGCT CAGCGGACCG
AGTGGCATTC GCCGTCTACA CGGCGTACGG AGCTTGAATC ATCATTATAG CGTGTTTTCG
TTCGATGCGG AAGAGATTTA G
 
Protein sequence
MPARTSLDAR APRGIPAKAP RDADAALPTR RCVVATGIPA RATSAVLASA RAAAPLPASL 
AHVKRARASA RTKGTTEVVV KLASEGASAS ARGDAESDAA VRADVLERHA DVIASVVYAD
VPAEGPEDRE TWEKACAIWP VSLTAPAERE TETPSDEEAA YFRKWTKQAC EGAKMSGNCA
IIVDPARDVE IARGVDESAT HPLRHAVIAA VDLAAKRDVA MYPEKEHVEA LIEARRMEKL
ERDALEIAGV GDDAKKRKRE VQTKGSAMTE IMGRPYLCTG YDVFLAREPC IMCAMGLVHS
RLKRVVFAVC DNINGALSGP SGIRRLHGVR SLNHHYSVFS FDAEEI