Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | OSTLU_24979 |
Symbol | |
ID | 5003148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ostreococcus lucimarinus CCE9901 |
Kingdom | Eukaryota |
Replicon accession | NC_009362 |
Strand | - |
Start bp | 654431 |
End bp | 657658 |
Gene Length | 3228 bp |
Protein Length | 1065 aa |
Translation table | |
GC content | 57% |
IMG OID | 640418569 |
Product | predicted protein |
Protein accession | XP_001419448 |
Protein GI | 145350074 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0810] Periplasmic protein TonB, links inner and outer membranes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.42763 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGACGA CGCGGAACAC CCGACCGCAC ACTCGACGAA ATGTGCGCGC GGTGGTGACG ATCGGGCGCG CGCTCGCCGT CGCGCTCGCG TGCGTGACGG TGTCGATCGC CGCGGTGGGG ACGCTCGGGG CGTACACGCA CGTGGATGAC CCCAGAGTGC CGCACCCGGG ATGCATGAGG TACAGCCATC GCGGCGTGAC GGACCGCTTC GATAGATGTC CCGGGATCAT GATGCGCGAA AACCCGCGAT ACGCGAACTA CGACACGCAA ATAATGCAAC CTTTTAACTT GCCCAAGTTT CAGAGTGCGC TCACGGCGTA TGTGAGCGAG CATTACGCTG GAGACGCCGA TTATGACGCG TGCGTCATCA CGGGCGCGGA TGACGACGGC AATCGGAACG AGGTGTGGAA AACGTACAGC CCAGATTACT TCGGTAGTCT TCGACTCACT TACACGTGTC CTTGGATGAC TGAATCAGTG CTGAGACCGG CTTCCGGATA CGGTGATCCG AGCGATACGA CTACGTTCGT GATGGACATG CGTTACGCGG CGTTTTTGAT GATAAACGCG CGCGTGCAAC GGGATTTGCA AGCTTGGCTG ACGAGACATG CGATCAGCGA TGCCGATATC GCCAGTGATA CTAAAGACTA TCTATTCGAG TTTTTCCGTC TTTACTATCG CGCGGGGGAT GTGTCGAAAT TAGGCAAGTT TTACCGCAAA GGTTTGTCGA TCAGCGCTTC GACGAAGAAA TGGATGGATC GATTTGAACA ACTTTCCGGT GACAAGCACC TCGAGCTCCA AATCGACGAA CAAGCCGCCT TGGACGATTT CTTGCTTTCA AACGCCGCGA CTGGGTATGC GCCATCGGGT TGGCCGATCC GTGACGTGGT CAACGATCTG GAGGGATACA ACGCAGCGCT CGAGTCGGCC TACGACGCCC ATTACTGGAA TACGGAGAGT TCTCGTGTTT CGACGTGGGC GTCTACGCAC TCGTACAGCG TGACCACTCC AGAAGCAAGA TTGCAATCCA ACGACGACGT GACCAGTACG AACGCTGCAG TGAAAGAACG CTACATTACG ACGAAGGTGG ACGAATGGCA GTCTTCGCAG TGCACAATAA ACACGTACCT CTGTGATCTT TCACTAGATT CAGAAGCTGC GGCTGCAGAT GTTTGGTCGT TCAACCAAAA TGTCAATATA TCTGCCGATG AAGTGTACCA ACAACACGTT GTAGACACAA TCGAGGCCTG GAAAACGACG AATAGTTATT CATTCTTGAC TGATTTGGAT GCGCAGCTCG CGCAAGACGA CTTGGCGACT TTTAACTTGG AAGCAGAAGA GAGGTATCTC GAATATCACA ACGCAGTGCT GGCACCTAAT GCCGCCGCCG CCCTGGCTGC CGCACCGACG CCGACGCCGA CGCCGACGCC AACACCGAGT CCGCCTCCGA GCCCGCCTCC GAGCCCGCCT CCGAGCCCGC CTCCGAGCCC TCCTCCGAGC CCTCCTCCGA GCCCTCCTCC GAGCCCTCCT CCGAGCCCTC CTCCGAGCCC TCCTCCGAGC CCGCCGAGCC CGCCTCCGAG CCCGCCTCCG AGCCCGCCTC CGAGCCCGCC TCCGAGCCCG CCTCCGAGCC CGCCTCCGAG CCCGCCTCCG AGCCCGCCTC CGAGCCCGCC TCCGAGCCCG CCTCCGAGCC CGCCTCCGAG CCCGCCTCCG AGCCCGCCAC CACCGCCCCA CTTTGCGCCA TTCGTGCCGC TCGCGAGTGT TCCGGAATAC GACGAGCACG AATCGTTCGA AATGGAACCT TTCGTCGTGT TTGTCGCTAA CCCATACGTC GTTCCGGTGT ATGGGCACCG TAACAATCAC ACACACATTC CCATTCTCCC AGTTCCATTG GCGAGTATTG TTGATGATGT GGCTGACGAC ATTACGATGC CGGCGTATCT GTCGAAGTAT CATCGCAATA GTCTCAGCAC CTATGGAGCT CAATGGCATT TCAGATACGG CAGAAACTGC AAGGCTACGT GCGCCGGGTC TCGAACGCAA CTCGACGTTC AAGGTCCTGT CGGCGCGCTC TGGAACTGCG ATTCCGATGA CTTGGTTTGC CCTCAAGACA ACGATACGAG CGAGTTGGAA GGCATTCTTG AGGCGCTCTC AGCTTCAGAT CACAAACATC CGAGTTGTCC GGCTGCTGAT CCATGCATGG TCGTTCCTCC GCCGTGCTAC TACGTCAGTA CGGAGTGCAT TCCACAAAGC GTAATAGAGT ACACACTGAA CAACACGTCT ACGCGAGAAG CCGAGGAAGG CGTAGACGAG CGTTATCTTC ACGTCCCGCT GGCAGTCGAT GATTTGACGC AAGTCAAGCT CGTATCACAG AGACCACCTC TTGGTAACCG CATGAGGTGG CCGTACAGCA GAGGTATCGC CATGACTCCG ATAACCGGGC GCGATGGAAA TGATTACTAT CATACATGCA CTTACGGCGC CACACTAAGT GGTTGGTCTG GTTCCGTCGC TTACGGGTTC GCGCTGGCGG ACGCCGAAGG CAATCACGTC TACGATTCGG GACACATTTT CGGTCACGAA ATGTGGCAAG CCTACTACGG AGAGACTGCG TCTCCGGCGT TCACGTCCGA CTGTACGGAC GTGCAGGCGG GAAAGTGGAG AAATCGAGTG CAGTACCACA CCCCGAATCT CGCGTCGCAA AATTTCGTCT TCGGAACGTG TGCAGGACAG TGCTCTGCGA TTCTTGGTTC GGTGCCAGAA GACCAGATCG TTGCTCCTAT TGGTTTAAGT AAAGGGCATC TGCGAGATAG GCTGATTGAA GCGAATGATG CAGGACTCGG CGCCGCAGAG AAAAACGTAA TCACCGCTCA CGAGCTCGTG CGAAGAGCAG CGGCGGCTAC CGTCGCGCTC GGCGTGAAAG AGCTCGGCGT GAAAGATGTC GGAGTCAAAG ATGAGCACCA CGAAAAAGCG CGTGGTGACG CCGTCGACGA CGTACAAGGT AAACACGCTT CCCTCGGCGA TGTGCGCGAT TTTGAGCGTC CATTTGGTTC GTTCGTCGCG TTCGCGGCTC CATTCGCTTT CGTGTGCGTC ATCGTCTTAG CGGTGTTTAA ATCGCGCTCG ATTCTCGACT TCGCTCGTGA GCGTCGCGAG CGCCGCACGC CAGAGGGGCG CGCCGAGCGT GCAAATCTCG TCGTGTGAAC GAGTACGTTT TGTATGCAAT CGAGCATT
|
Protein sequence | MTTTRNTRPH TRRNVRAVVT IGRALAVALA CVTVSIAAVG TLGAYTHVDD PRVPHPGCMR YSHRGVTDRF DRCPGIMMRE NPRYANYDTQ IMQPFNLPKF QSALTAYVSE HYAGDADYDA CVITGADDDG NRNEVWKTYS PDYFGSLRLT YTCPWMTESV LRPASGYGDP SDTTTFVMDM RYAAFLMINA RVQRDLQAWL TRHAISDADI ASDTKDYLFE FFRLYYRAGD VSKLGKFYRK GLSISASTKK WMDRFEQLSG DKHLELQIDE QAALDDFLLS NAATGYAPSG WPIRDVVNDL EGYNAALESA YDAHYWNTES SRVSTWASTH SYSVTTPEAR LQSNDDVTST NAAVKERYIT TKVDEWQSSQ CTINTYLCDL SLDSEAAAAD VWSFNQNVNI SADEVYQQHV VDTIEAWKTT NSYSFLTDLD AQLAQDDLAT FNLEAEERYL EYHNAVLAPN AAAALAAAPT PTPTPTPTPS PPPSPPPSPP PSPPPSPPPS PPPSPPPSPP PSPPPSPPPS PPSPPPSPPP SPPPSPPPSP PPSPPPSPPP SPPPSPPPSP PPSPPPSPPP SPPPPPHFAP FVPLASVPEY DEHESFEMEP FVVFVANPYV VPVYGHRNNH THIPILPVPL ASIVDDVADD ITMPAYLSKY HRNSLSTYGA QWHFRYGRNC KATCAGSRTQ LDVQGPVGAL WNCDSDDLVC PQDNDTSELE GILEALSASD HKHPSCPAAD PCMVVPPPCY YVSTECIPQS VIEYTLNNTS TREAEEGVDE RYLHVPLAVD DLTQVKLVSQ RPPLGNRMRW PYSRGIAMTP ITGRDGNDYY HTCTYGATLS GWSGSVAYGF ALADAEGNHV YDSGHIFGHE MWQAYYGETA SPAFTSDCTD VQAGKWRNRV QYHTPNLASQ NFVFGTCAGQ CSAILGSVPE DQIVAPIGLS KGHLRDRLIE ANDAGLGAAE KNVITAHELV RRAAAATVAL GVKELGVKDV GVKDEHHEKA RGDAVDDVQG KHASLGDVRD FERPFGSFVA FAAPFAFVCV IVLAVFKSRS ILDFARERRE RRTPEGRAER ANLVV
|
| |