Gene OSTLU_43289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43289 
Symbol 
ID5005318 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009368 
Strand
Start bp355873 
End bp357543 
Gene Length1671 bp 
Protein Length401 aa 
Translation table 
GC content56% 
IMG OID640420739 
Productpredicted protein 
Protein accessionXP_001421452 
Protein GI145354354 
COG category[A] RNA processing and modification 
COG ID[COG5239] mRNA deadenylase, exonuclease subunit and related nucleases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGGAT ATCCGCTTCC AGCGACCGCG CACGGATTGG AATTTTGCCA GCACGACGAA 
GTCGACTGGC GATGGTACGC GGACGGCGAG ATCGTCGGCC AAGGGCGCGT GTACACGCCC
ACAAAAGAGG ATTTCGGTAA AAGATTAGCG GTTGAGGCTT TGGACGAGAG GTTTGAGTTT
ACGAACGTTG TGACGCGGCT GGGCGTTGAT CGGTCGGAGG CGTTGCGACG GTTGGAGCCG
AGCGCCGAGA CGTCGGCGGA TTATCGTATC ATGACGTACA ACGTGTTGGC CGATGCGTAC
GCGCACACTT GGGGAACGAT GTTTCCTTAC TTTGACACGG CGTTGGCTAA AGTGGAACGA
CGCTTGCAGC TAGTATTGGA AGATATCTTA CGATCAAAAG CCGACGTCGT GGCGCTACAA
GAAGTGGACA AGAAGTACCA CGAAACGTTG TTCGTGCCCG TACTCACTGC AAACGGATAT
ATCGCGACAG ATTGGGTCGG TAAGAGCGGT CAGACGTTGG AAGGATGCGC GATGTTCTTC
GCATTGAGCA AGTTTGAAAG CATCGAGCGC GAAGAAGCGA TCAAACTCAC GGAAATCGGA
GACAAAGCTT TGCGGCGCTG GATCGCGGAT GACGATAACG CAGAGTTGGC TATGGCTCTC
AAAAAGATCA CGTCCATTGC GCAGCTCGCG CGCGTGAAAG TACGCGCGAG CGGCAAGTCG
TTGTGCGTCG GTAACACTCA CTTATTTTTT CACCCAGGGG CGATGCACTT ACGAGTCTTG
CAAGCTCACG AATTTACGAC TCGGGCAACG GCGTTCGCCG CCGGCGATCC ACTCGTATTG
TGCGGCGATT TTAACGGTGA GCCTGAAGAT GGCGTCATCA GATATCTCAC CAAAGGCGAG
ATTTCCGCGA GTGACGAGGA TTGGGTTCGT GGTAGTCTAT TTCGGTGGGG CGGGACGTCA
TCGCGAGACG CCGCTCGAGA TCTCTTCTAT ATTTGTGACG ATGGTACAGG ATACGAGACA
TTCAATCGCA ACTTTGATGA AGTCACCAAT ACCTTTGAGT TACGAGCGCT CGCAGAACGA
GGTATTTTCA TGTCTCTTTG CGCTCGTACG CTGGATAAAG AGTGGGGCGC GTCGTGTAAA
TGCGACGAAA AATCGAGCGA CGACGCAAAC GCAATCGCAG CCGTGAAGAA GCACGTCAAA
TGCGGATGCA CGTTGAAAAA CTGTCACGCC GTCGCTGCAT TCACACTTCG ACGCGAGAGT
GGACTCGCGC CTGGGTTAAC GATGGATGCC GACGACCACG TGATTGCTCG TCGCGCGTTT
AGTGATATTA AGCGAGCGCA GGATGACGGC TTCGCCGCCG TCTCGGCGAA GCAAGCCGCG
CTCGCTTCAT CCGTCGTCCC CGACGCCAGC GCGGTCCCGA TCGGCTGTGG CGCACACTTG
CGAACCGCGC GACCACTGTT CTCTGCCGGT GGATTCCTCG AATGGACGAA TTACGTCGGC
GGATTCGTCG GCGCGTTGGA CTACGTGTGG TGTTCGACGA GCGATTTCGC GTCGCGCGCC
ACATCGCCGC TCCCCGACAT GTCCGCCGTC CTCGCGCACA CCGCGTTACC AAACGCGCAA
TTTCCATCAG ACCACATCCC CGTGATCGTG GACGTCGACT TAGTTAACTA G
 
Protein sequence
MVGYPLPATA HGLEFCQHDE VDWRWYADGE IVGQGRVYTP TKEDFGKRLA VEALDERFEF 
TNVVTRLGVD RSEALRRLEP SAETSADYRI MTYNVLADAY AHTWGTMFPY FDTALAKVER
RLQLVLEDIL RSKADVVALQ EVDKKYHETL FVPVLTANGY IATDWVGKSG QTLEGCAMFF
ALSKFESIER EEAIKLTEIG DKALRRWIAD DDNAELAMAL KKITSIAQLA RVKVRASGKS
LCVGNTHLFF HPGAMHLRVL QAHEFTTRAT AFAAGDPLVL CGDFNGEPED GVIRYLTKGE
ISASDEDWVR GSLFRAVPIG CGAHLRTARP LFSAGGFLEW TNYVGGFVGA LDYVWCSTSD
FASRATSPLP DMSAVLAHTA LPNAQFPSDH IPVIVDVDLV N