Gene OSTLU_43971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_43971 
Symbol 
ID5004500 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009365 
Strand
Start bp570214 
End bp573162 
Gene Length2949 bp 
Protein Length962 aa 
Translation table 
GC content60% 
IMG OID640419921 
Productpredicted protein 
Protein accessionXP_001420386 
Protein GI145352079 
COG category[K] Transcription 
COG ID[COG0557] Exoribonuclease R 
TIGRFAM ID[TIGR00358] VacB and RNase II family 3'-5' exoribonucleases 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCGCG TCGGGAAGAC GTTCGTCAAG ACGACGAAGC GCGGGCAGGC GGTGACCAAG 
GTGCGACGGG AGCGGTACCT GCGCGACGAC GTGTACGTCG GATGCGGGGA CGCCGCCGTC
GAGGCGCGGT ATCGAGGACG CGACGAGGCG CTGTACGTGC TTCGACCGGA GCGCCGAGGC
GCGAAAAGCG AGCGAACGAT CGCGCTGATC GATTCGAACG TCGCGCTGCA TCAGATGGAC
GCGCTGGCGG ACGCGAGGGT GACGGATGTG GTGATTTGCG CGACGGTGAT GGAGGAGACG
CGAAACAAGT CGCGGACGTC GTACGAGCGG TTACGAGGAT TGTGTAAGGA TAAAGAGAAG
CGGTTTTACG TGTTTAGTAA TGAGAATCAC GCGGAGACGT ACGCGGAAGA CGTCGCGGGG
GAGTCGGCGA ACGATCGGAA CGATCGAGCG ATACGCAAGG CGGCGGCGTT TTATCGCCGA
GCGATTCCGC GCGCGGCGTG CGAACGCGTG ACGCTCGTGA CGAATGATCG AGGAAACGCG
AGAAAGGCGC GAGAGGAGAA TATAGACGTG ATGAGCGTGA TTGAATGGAT ACAGTCGATC
TCAAACGACG CGAGCTTGGT GGATTTGGTC GTCCCGGGCG GAGGGGCGGA GATTCCCGAG
GACGAACGCG ACGCCAAGCG AGCGAAGAGC GGCGGCGGCG CGAGCGCGAG CGTTTTCGAG
GCGTATCTCG ACGCGAATGA CATCAAGGAA GGTTTAGCGA GTGGAAAATT GATCAAAGGC
GGCGTTCGAA CGTCTAGATA TAACCCTTTT GAGGCGCACG TGATGAATGA AGGTACCGGA
GAAGGCGTAC TGCTGAGCGG ACGCGCGGCG ATGAATCGCG CCATCGACGG CGACTTGGTC
GCCGTGGAAA TCTTGCCCGA ATCCCAATGG ATTCGTCCGG GTGGTATGAT CATCACCGGT
ACCGCAGAAG AAGACAAAGA GACGTCCTCC GCGGACGTCG ACGCGAAGGA CGACGCCGGC
GGACTGGCGC CGGAGACTGC GGATGAAACG GACGTCGCTG TCGCGGCGAC GAAGGCGTCG
GGCGTCGTTC CCACGGGAAA AGTTGTAGGT ATCATTCGGC GCAACTGGCG CGAACGCGGT
TATGCGGCGT CGGTCGACAT GGGAAGAGAC GGCAAAGGTC CAACCGCAGC CGGTGGCGGC
TTCGCATCGC GCGTCTTGTG CGTGCCGAGC GATAGACGAT TTCCAAAGAT TCGCATCCAG
ACGCGCCAAC TCGAAGGCTT GCTTGACCAG CGCATCGTCG TCGTCATCGA CGATTGGCCC
GCGGATAGCA TGTACCCCGA AGGGCACTAC GTCAAGTCGC TCGGATTGAT CGGTAGCGTC
GATGCCGAAA CACAGGCACT TCTGCTCGAA AACGACGTTG ACGATCGACC GTTCGCGCCG
GCGGTGTACG CGTGCGTGCC CGCCTTGCCG TGGAAAGTCA CCGACGACCA TCTCGCCGAA
CCCGGTCGCG AGGACTTGCG CGATTTATTG GTGTGCAGCG TCGATCCTCC GGGGTGTAAA
GACATCGACG ATGCGCTGAG CGCGCGCGAT ATCGACGACG AGCGCATTGA AATCGGCGTG
CACATCGCTG ACGTGACGTC GTTTCTGTTC CCAGACACCG CCATGGACGA AGAGGCGGCG
CGTCGAGGGA CGACGACGTA CTTGGTGCAA CGCCGTTTGG ACATGCTTCC GGGAGCGCTG
ACGACGGACA TATGCTCTTT GGTTGGAGGT CAAGAGCGCT TGGCGTTTTC GGCGTTTTGG
ATCGTGCAAA AGTCCACGAT GCTTCCGGAT GAAACCGTCA AACCACGATT TACGAAGAGC
GTCATCAAGA GCGCGGCGGC GTTGACGTAC GAGCAAGCGC AGACGCGAAT CGACGATGCA
TCTTTGAACG ATGATTTGAC GCTCAGCTTG AGGCGACTTC GTGATGTCGC AAGACAGCTA
CGCAAGCGGC GCATGGCGAA CGGCGCGCTG ACATTGGCTT CGCCCGAGGT GCGATTCGAG
ATGGATCAGC AGTCGAGCGA TCCGCTCGAC GTGGGCATGT ACGTCACGCG CGAGACGAAT
CAAATGGTTG AAGAAATGAT GTTGTTAGCC AACGTCAGCA CCGCGGAGCG TATTTTGCAA
GCATATCCCG CGGCGGCGAT GTTACGCCGT CACCCGATTC CCGAGCAGAA AATGTTTGAG
CCCTTACTCA AGGCGGCCAA GGCGTGCGGC GTGGACATTG ACACGCAATC CAACCGCACG
CTCGCGGCGA GTCTAGACGC CGCCGTGCGA CCGGAAGACG CGTATTTCAA CACGCTATTG
CGCTTACAAG CGACGCGGTG CATGTCACAG GCGGTGTACT GCAGCTCGGG CCAATACGCT
GGACCAGAGC GCATGCACTA CGGTCTCGCC ATGCCGTTGT ACACGCACTT TACGTCGCCG
ATTCGTCGGT ACGCCGACGT AATCGTACAC AGATTACTGA GTGCGGCGAT CGGGCTGTGT
CCTCGTCACA AGTCGTTGGA GGACTCCGAT CACGTCAAAT CCGTCGCAGA CGTCTGCAAC
GTGCGTCATC GCAATTCTCA GCAAGCCGGT CGCGCGTCCG TAGAGTTACA CACCTTGGTG
TTTTTCCGCA AGCGCAAAAT TGTCGCCGAC GCGCGCGTCT TCAAAGTGCG CGCAAACGGA
CTCGTGGTTT TTGTCCCAAA GTTTGGCATC GAAGGACCTG TGCTCTTCGT CGAGGGCGAA
AATAGCGACA CCGCGACTTG TACGCTCGAC GAAGACGCTA TGACGGTGAC GCACAAAGGA
AAGACGTGGA AAGTTTTCGA CCGGCTCACC GTGCGAGTGG AGGTCGAACA ACTCCCTGCG
CATCGGAGTC GTTTGCTCAT CACCATCGTT CCTGACGACA CTCCGCGCGG GGAGATCGCG
AGCGCGTGA
 
Protein sequence
MHRVGKTFVK TTKRGQAVTK VRRERYLRDD VYVGCGDAAV EARERTIALI DSNVALHQMD 
ALADARVTDV VICATVMEET RNKSRTSYER LRGLCKDKEK RFYVFSNENH AETYAEDVAG
ESANDRNDRA IRKAAAFYRR AIPRAACERV TLVTNDRGNA RKAREENIDV MSVIEWIQSI
SNDASLVDLV VPGGGAEIPE DERDAKRAKS GGGASASVFE AYLDANDIKE GLASGKLIKG
GVRTSRYNPF EAHVMNEGTG EGVLLSGRAA MNRAIDGDLV AVEILPESQW IRPGGMIITG
TAEEDKETSS ADVDAKDDAG GLAPETADET DVAVAATKAS GVVPTGKVVG IIRRNWRERG
YAASVDMGRD GKGPTAAGGG FASRVLCVPS DRRFPKIRIQ TRQLEGLLDQ RIVVVIDDWP
ADSMYPEGHY VKSLGLIGSV DAETQALLLE NDVDDRPFAP AVYACVPALP WKVTDDHLAE
PGREDLRDLL VCSVDPPGCK DIDDALSARD IDDERIEIGV HIADVTSFLF PDTAMDEEAA
RRGTTTYLVQ RRLDMLPGAL TTDICSLVGG QERLAFSAFW IVQKSTMLPD ETVKPRFTKS
VIKSAAALTY EQAQTRIDDA SLNDDLTLSL RRLRDVARQL RKRRMANGAL TLASPEVRFE
MDQQSSDPLD VGMYVTRETN QMVEEMMLLA NVSTAERILQ AYPAAAMLRR HPIPEQKMFE
PLLKAAKACG VDIDTQSNRT LAASLDAAVR PEDAYFNTLL RLQATRCMSQ AVYCSSGQYA
GPERMHYGLA MPLYTHFTSP IRRYADVIVH RLLSAAIGLC PRHKSLEDSD HVKSVADVCN
VRHRNSQQAG RASVELHTLV FFRKRKIVAD ARVFKVRANG LVVFVPKFGI EGPVLFVEGE
NSDTATCTLD EDAMTVTHKG KTWKVFDRLT VRVEVEQLPA HRSRLLITIV PDDTPRGEIA
SA