Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_119465 |
Symbol | TraS1 |
ID | 5000407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009356 |
Strand | + |
Start bp | 339054 |
End bp | 341447 |
Gene Length | 2394 bp |
Protein Length | 797 aa |
Translation table | |
GC content | 53% |
IMG OID | 640415828 |
Product | DodoPi transposase-like protein |
Protein accession | XP_001416135 |
Protein GI | 145342107 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00345853 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGACG CCCGCGCGGA GGCACCGCGT CGTGGGAAAG ATTTGAAATT CGAATCGGCG TCGCGTCGAA CGAGGGCGAG GGCGTCGCGC GAAGCGTTCT CGCCCGGGTC GAGCGCGTGC GCGTCCGAGC GTGGAAGTTT CGCGACGACG ACGACGGTCG AGAGCGCGTC GCGACGCGAT GCGCGGGAAT ACACGCCGCC GCCGTTGACG GCGGAGGAGA TCGCAGAGAT GAACGAAGCC GTATCTGAGC TCGTCGTCGA GAGCTCCATC CCGTTCGCGT GGACGAGCTC GCCGGCGTTC TTCCGGTTCA TGCACGCGCT TAGGCCTGAA CTCTTCAGCG CCGCGGAGCG CGCGCGCATC GAAAGCGGCG CGGAGCTCGT CGACCAGCCC GAGAACCTTC GATGTCGCCG TTGGCACTCG ACAGCCGGCG TCGACGGGTT GTACGAGAAA CGTATGAAAC GCGCCCTCCG TGCGCTTGAA GACTGGGAGG AGTTGGCGTT ATCGGCGGAT GGTTGGCAAT CTGAAGATGG CGTGAAAGTA CTAAACTTGT CCATAACGGT GAGAGAAACT GGTCGCGAGT ACTACTGGAA GAGCGTCGAA ATAGAAACGG AGTCAGAAAG CGCAGTGTTC ATGCAACAGA CCGCCGCGCA AGTGATGAAA GACCTCCCAC TGAAGAAGTT CAAATGCATC ATTGGGGACA ACACCTCGCA TGTCGTCAGC TTCTTGGAAG CTATCGCGGC TACTCCAGAG ACATCTCACA TTACTCCGAT TGGTTGCTAT GCACATCGTT TCAACTTGCT GGCTGGAGAC GTCCACGAGA CGTTCAAATT CTTATTTCTG GATCTAGAGC GGGCCATCAA CAAGCTCCGT GTTCAGTCTC GAATGCGCGC GCTTTTCGAC GCGGCGCGTA AAGAGAAGAA TAGTAAGAAA CAACTTGAGA CGTACTGCCC AACGCGATGG GCGTCCGCGC ATGCATGTCT ACACAGCTAC GTACTGAATA TCGACGTGTT CAGATTTTTG GACGAAGACG ACGGTGTGAT CGCAAAAAAT TTCCGTAAAA TGTGTGAAAA AGACTCGGAA TCAATCTTTA AAGTATTCTT TGACCGGGAT TCGCAGATGG CAGTCCGTAG GTTAGAGCCC TTGTTCCGAG GTTTGGCGGC GGCAAACAAG TTTATGGAAG CTACTGGCGC TCATCTTGCG GAAGTCTTCC CGCTGTGCTG GGCACTAGAA AAAGACTTGA GACAGTGGCT CGCGAACGTG AAAACGAGTG CTCAACAGTC ATGGATTGAA CGTGCGACGC GCCAGTCATG GATTGAACGT GCGACGCGCC AGCCGACACG CGATGGACTC ACGACGCCTG AGCAATTATC AGACGTACTT CACCGAGTTT ACAAAGTGCG ATTCGAAGGA CTGGAGGCAC AGCCAGGTAA CCCAAGCACG AAGCGCACGC CGTTGCGAGA CGAACCCCTC GTACATCTTG CATCATATTT GAGTTACATG ATGCATGCGA GAATGCGTAT CAAACATCAG TTCGTCGTGC CGGAAGTATA TGAAGCGAAA AAAGGTATCG GAAAGATTCA TTTCCACTTG GAGGAAAAGA AAGCAGACGT GGAAGCAGCC AAAACGGCAT TAGAAAAACT GAACGCGGTC ACCTGTCCGA AAGCGGCACG ACCCTACCGC GACGACTATC GCGTTCGCTT CATCGAGAAT GACGCGTACA TCCAAGCTTG CGCCGACAAA ACGTTTGTGG ACGTTACACT GGAGCAGCTT GAGTACATCA GGGCACACAA CTGGTGGCGC AACTTCTTGG AGCCAGAATG CGTGCAAACC TGGCCTGCAT GGAAAGATCT AATTCCTTTC GCACGTCGTA TAAACGCGAT TGTTCCACAT AGTGCGTCTG TTGAACGCAT GAACTCGTCT CAGAAACTTG TGCACAACAA GAGGCTCAGC TTGAGCCACG CGAATGTCCA GAAGTTGTCT TTCATTTACT TCAATGCTAG GCATGAGCGC CGGTTATTCG CGAGTCCCTT TCATCGTTTG GTATCAGAGA CATACGACGC AGCAGAAAGT CAAGTAGCCG CGAGTCCTGT GGTAGTTGTG GACAGCGACA GTGATGATGG AGTTGGTGAA GATGAAATCC TCTCGTCAGA TGACGACGAC TTAGACGAGT ACTTAAAAGA GGTAGACGAG CTCGTCATGT CGCCACACGC AAACGTTTCG AATGGAACGG AAATGACAAG TTTAGGCAAC CTCGATGAAA CCGTGCCGGT ACAGGATGAT GTCTCGCATC CCACTCGGAA CAAGGCAAAG GAAACTTTTG CACAGAAGAC AGCTCGCAAG GCATTAGAGC GTCAAGAGAA GAAGAGACGT AGTAAGAGTA GAAGATTGCA ATAG
|
Protein sequence | MRDARAEAPR RGKDLKFESA SRRTRARASR EAFSPGSSAC ASERGSFATT TTVESASRRD AREYTPPPLT AEEIAEMNEA VSELVVESSI PFAWTSSPAF FRFMHALRPE LFSAAERARI ESGAELVDQP ENLRCRRWHS TAGVDGLYEK RMKRALRALE DWEELALSAD GWQSEDGVKV LNLSITVRET GREYYWKSVE IETESESAVF MQQTAAQVMK DLPLKKFKCI IGDNTSHVVS FLEAIAATPE TSHITPIGCY AHRFNLLAGD VHETFKFLFL DLERAINKLR VQSRMRALFD AARKEKNSKK QLETYCPTRW ASAHACLHSY VLNIDVFRFL DEDDGVIAKN FRKMCEKDSE SIFKVFFDRD SQMAVRRLEP LFRGLAAANK FMEATGAHLA EVFPLCWALE KDLRQWLANV KTSAQQSWIE RATRQSWIER ATRQPTRDGL TTPEQLSDVL HRVYKVRFEG LEAQPGNPST KRTPLRDEPL VHLASYLSYM MHARMRIKHQ FVVPEVYEAK KGIGKIHFHL EEKKADVEAA KTALEKLNAV TCPKAARPYR DDYRVRFIEN DAYIQACADK TFVDVTLEQL EYIRAHNWWR NFLEPECVQT WPAWKDLIPF ARRINAIVPH SASVERMNSS QKLVHNKRLS LSHANVQKLS FIYFNARHER RLFASPFHRL VSETYDAAES QVAASPVVVV DSDSDDGVGE DEILSSDDDD LDEYLKEVDE LVMSPHANVS NGTEMTSLGN LDETVPVQDD VSHPTRNKAK ETFAQKTARK ALERQEKKRR SKSRRLQ
|
| |