Gene OSTLU_119465 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_119465 
SymbolTraS1 
ID5000407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009356 
Strand
Start bp339054 
End bp341447 
Gene Length2394 bp 
Protein Length797 aa 
Translation table 
GC content53% 
IMG OID640415828 
ProductDodoPi transposase-like protein 
Protein accessionXP_001416135 
Protein GI145342107 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00345853 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGACG CCCGCGCGGA GGCACCGCGT CGTGGGAAAG ATTTGAAATT CGAATCGGCG 
TCGCGTCGAA CGAGGGCGAG GGCGTCGCGC GAAGCGTTCT CGCCCGGGTC GAGCGCGTGC
GCGTCCGAGC GTGGAAGTTT CGCGACGACG ACGACGGTCG AGAGCGCGTC GCGACGCGAT
GCGCGGGAAT ACACGCCGCC GCCGTTGACG GCGGAGGAGA TCGCAGAGAT GAACGAAGCC
GTATCTGAGC TCGTCGTCGA GAGCTCCATC CCGTTCGCGT GGACGAGCTC GCCGGCGTTC
TTCCGGTTCA TGCACGCGCT TAGGCCTGAA CTCTTCAGCG CCGCGGAGCG CGCGCGCATC
GAAAGCGGCG CGGAGCTCGT CGACCAGCCC GAGAACCTTC GATGTCGCCG TTGGCACTCG
ACAGCCGGCG TCGACGGGTT GTACGAGAAA CGTATGAAAC GCGCCCTCCG TGCGCTTGAA
GACTGGGAGG AGTTGGCGTT ATCGGCGGAT GGTTGGCAAT CTGAAGATGG CGTGAAAGTA
CTAAACTTGT CCATAACGGT GAGAGAAACT GGTCGCGAGT ACTACTGGAA GAGCGTCGAA
ATAGAAACGG AGTCAGAAAG CGCAGTGTTC ATGCAACAGA CCGCCGCGCA AGTGATGAAA
GACCTCCCAC TGAAGAAGTT CAAATGCATC ATTGGGGACA ACACCTCGCA TGTCGTCAGC
TTCTTGGAAG CTATCGCGGC TACTCCAGAG ACATCTCACA TTACTCCGAT TGGTTGCTAT
GCACATCGTT TCAACTTGCT GGCTGGAGAC GTCCACGAGA CGTTCAAATT CTTATTTCTG
GATCTAGAGC GGGCCATCAA CAAGCTCCGT GTTCAGTCTC GAATGCGCGC GCTTTTCGAC
GCGGCGCGTA AAGAGAAGAA TAGTAAGAAA CAACTTGAGA CGTACTGCCC AACGCGATGG
GCGTCCGCGC ATGCATGTCT ACACAGCTAC GTACTGAATA TCGACGTGTT CAGATTTTTG
GACGAAGACG ACGGTGTGAT CGCAAAAAAT TTCCGTAAAA TGTGTGAAAA AGACTCGGAA
TCAATCTTTA AAGTATTCTT TGACCGGGAT TCGCAGATGG CAGTCCGTAG GTTAGAGCCC
TTGTTCCGAG GTTTGGCGGC GGCAAACAAG TTTATGGAAG CTACTGGCGC TCATCTTGCG
GAAGTCTTCC CGCTGTGCTG GGCACTAGAA AAAGACTTGA GACAGTGGCT CGCGAACGTG
AAAACGAGTG CTCAACAGTC ATGGATTGAA CGTGCGACGC GCCAGTCATG GATTGAACGT
GCGACGCGCC AGCCGACACG CGATGGACTC ACGACGCCTG AGCAATTATC AGACGTACTT
CACCGAGTTT ACAAAGTGCG ATTCGAAGGA CTGGAGGCAC AGCCAGGTAA CCCAAGCACG
AAGCGCACGC CGTTGCGAGA CGAACCCCTC GTACATCTTG CATCATATTT GAGTTACATG
ATGCATGCGA GAATGCGTAT CAAACATCAG TTCGTCGTGC CGGAAGTATA TGAAGCGAAA
AAAGGTATCG GAAAGATTCA TTTCCACTTG GAGGAAAAGA AAGCAGACGT GGAAGCAGCC
AAAACGGCAT TAGAAAAACT GAACGCGGTC ACCTGTCCGA AAGCGGCACG ACCCTACCGC
GACGACTATC GCGTTCGCTT CATCGAGAAT GACGCGTACA TCCAAGCTTG CGCCGACAAA
ACGTTTGTGG ACGTTACACT GGAGCAGCTT GAGTACATCA GGGCACACAA CTGGTGGCGC
AACTTCTTGG AGCCAGAATG CGTGCAAACC TGGCCTGCAT GGAAAGATCT AATTCCTTTC
GCACGTCGTA TAAACGCGAT TGTTCCACAT AGTGCGTCTG TTGAACGCAT GAACTCGTCT
CAGAAACTTG TGCACAACAA GAGGCTCAGC TTGAGCCACG CGAATGTCCA GAAGTTGTCT
TTCATTTACT TCAATGCTAG GCATGAGCGC CGGTTATTCG CGAGTCCCTT TCATCGTTTG
GTATCAGAGA CATACGACGC AGCAGAAAGT CAAGTAGCCG CGAGTCCTGT GGTAGTTGTG
GACAGCGACA GTGATGATGG AGTTGGTGAA GATGAAATCC TCTCGTCAGA TGACGACGAC
TTAGACGAGT ACTTAAAAGA GGTAGACGAG CTCGTCATGT CGCCACACGC AAACGTTTCG
AATGGAACGG AAATGACAAG TTTAGGCAAC CTCGATGAAA CCGTGCCGGT ACAGGATGAT
GTCTCGCATC CCACTCGGAA CAAGGCAAAG GAAACTTTTG CACAGAAGAC AGCTCGCAAG
GCATTAGAGC GTCAAGAGAA GAAGAGACGT AGTAAGAGTA GAAGATTGCA ATAG
 
Protein sequence
MRDARAEAPR RGKDLKFESA SRRTRARASR EAFSPGSSAC ASERGSFATT TTVESASRRD 
AREYTPPPLT AEEIAEMNEA VSELVVESSI PFAWTSSPAF FRFMHALRPE LFSAAERARI
ESGAELVDQP ENLRCRRWHS TAGVDGLYEK RMKRALRALE DWEELALSAD GWQSEDGVKV
LNLSITVRET GREYYWKSVE IETESESAVF MQQTAAQVMK DLPLKKFKCI IGDNTSHVVS
FLEAIAATPE TSHITPIGCY AHRFNLLAGD VHETFKFLFL DLERAINKLR VQSRMRALFD
AARKEKNSKK QLETYCPTRW ASAHACLHSY VLNIDVFRFL DEDDGVIAKN FRKMCEKDSE
SIFKVFFDRD SQMAVRRLEP LFRGLAAANK FMEATGAHLA EVFPLCWALE KDLRQWLANV
KTSAQQSWIE RATRQSWIER ATRQPTRDGL TTPEQLSDVL HRVYKVRFEG LEAQPGNPST
KRTPLRDEPL VHLASYLSYM MHARMRIKHQ FVVPEVYEAK KGIGKIHFHL EEKKADVEAA
KTALEKLNAV TCPKAARPYR DDYRVRFIEN DAYIQACADK TFVDVTLEQL EYIRAHNWWR
NFLEPECVQT WPAWKDLIPF ARRINAIVPH SASVERMNSS QKLVHNKRLS LSHANVQKLS
FIYFNARHER RLFASPFHRL VSETYDAAES QVAASPVVVV DSDSDDGVGE DEILSSDDDD
LDEYLKEVDE LVMSPHANVS NGTEMTSLGN LDETVPVQDD VSHPTRNKAK ETFAQKTARK
ALERQEKKRR SKSRRLQ