Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47570 |
Symbol | |
ID | 7202634 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 141549 |
End bp | 143571 |
Gene Length | 2023 bp |
Protein Length | 609 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182010 |
Protein GI | 219123393 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0215149 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCAAG CCTACATTGA TGGACGACAA CCGTACTGTT GTCCATCAAA CCCTTCTACC GCAGCATCCG GCCCTATCGT CTACGAGGAG CAGCAGACAC TGTCGACGTC GAGTACCTTG GCTGCTCCGT ACTCGTCCCA ACCGACTACC AAACAGACTC CTCCTACCGT TGTAAGATCG GGCATCCCGT CTGATCATCG GAAAGGAGAT CAAGAATATC AAATCCACCA AATCGCCGCA GCCTCGCATA CTAACCGATC CGTTCGGGAC ACCGCTCGCT TAGTTCATCG CGTTCTACAG CAACAGGAAC ACCGGCCTTC ATCACAGGCA CCGACTCCGC TGGAAGCCTT GCCTACCCTG CTCTACAATC CAAGAAGCTC TTCGCTAAAC GAAGCGTTAG CCTCGGCCTC CACCGACACA ATTGTCATTT CCGCCGGACC AGAAGTTGCC GTGGAGGATA TCTTGTGGAT GCCGGTCTCG TGTGGTATGA CCGCCCTACA CGTCGCCGCA CGCATGTTGG TCTGGGTGCA ATATGTCGGC GATGACGATG CACACCAGCC ATGGAACGTG CGCTTTGTTG CCAACGTTGT CGAACGCCAG CTTGCACAAC GACAATCACG ATTGCAAAAT GTGCCGTATC CGTGCACAAC CAACACCGCC GCACGCAACT GGTGGCGCTG GCTCGTTCAA CAAGCGTCCC AAACGGACCA AGCCCGTCGC AATCTTGTGC GCAAGCGAGA TTCCGGTGGC GATTCCGTGT TTGATACCTT TTGGGCCACG TGGGCGCACC CAACGCCGTC GACAGCGGGT GCCGGCCGAA CTGTCACTGA CTATGGCACA CGGCCGGCTC CGCCTCCGGC GCCCTTTGTG GCCCGCTTTC CAGCGGCACT GGATCAAGTT CTAGCCTCAC CGCACCACTT ACATCTTTTG CAGATTTGTA TACAGCGCCA ACGGCAAGCC AGTATTCAGA ACATTACGTT TGCAATGGAT GAGACTATCC CCGACGCTGT CTACGTTGTC GCTCGCTTCT GGTGGGCCTT GACGATTCTG TTGGACGGGG CACGGGGCGA AAACGTGCAG TCGAGCGCTA GCGCTAGGGA GAGCAAAGTT GCCTCTACTC CCGCCGTGTT GCCTATTGTC CCCTTTCTGG CATCAACTGG GTCTTGTCCG GAGCGTCTCG CTCGTTTGGT AGTCTGTTTG TTTCCCGAGC AGCTTTCGTA CATTGTTCCG GACACTGGCG CCACAGTGTT GCACCTGTGG GCTGCCGCTC CCAACCCGAT TCATGAAGAA CGTGACGGTA TGCTGATCCC TCTCTTACGC ACTTGTCCGG CGTTGGCTGC TGTGAAAGAC AACCGAGGCC GTTTGCCTAT CCACGTAGCC CTGCATTGGA ACAAAGCCAT GCCGGACGTT CGTGCGTTGT GGGAACACGC CCCCCATACC GTTCACGTTC CAGATCCGCT CAAGCCGACT TTACCGTTGG TTGTCCTCGT CACGTTGACT GCCCGCAAAC AGAAGATGAA CACCTTGCGC GCGCTGGAGC GACGCAACCC CGCCGTATCG TTGGCGGAAT GGTTGGACGC AACGCGCGAT ACCGAAATGT TACAAGGTAC GCTTCAGTGT CAGCTGCTGA GTTCGATATA CGACGTGTTG CGGACGTTCC CGCAGGCGTT GCAATCGAAT TAGGCTTGTA CAGCGAGTGC TTTGTGCTTG GGGTTTATTC AGCGGACCTT TGTTGTTCGG CACGTACACG ATGAGTTGCC ATTGGAAAGT GATTATCCGC CGCAACGGGC CCCAACTAGC GTTCCAGCTA GTCCTCTTAG GGTATCGACA CGAGTAGTCC TGCAAAATCC GTGGAGACGG CGATCTATTG AAAGCTCCAT CTTGACTTTT ACTTATCCAC AGCTTTCCCC GAGTTAGGAA ATCCGTTGAT CGTTGTCTCA GACATTCACT CATATAGAGC TTGGATGAGA CAGTGCTCCC TTCCTATCAC TGTGGCAGTT ATCCTGGTGG TCCAAACATC CCC
|
Protein sequence | MIQAYIDGRQ PYCCPSNPST AASGPIVYEE QQTLSTSSTL AAPYSSQPTT KQTPPTVVRS GIPSDHRKGD QEYQIHQIAA ASHTNRSVRD TARLVHRVLQ QQEHRPSSQA PTPLEALPTL LYNPRSSSLN EALASASTDT IVISAGPEVA VEDILWMPVS CGMTALHVAA RMLVWVQYVG DDDAHQPWNV RFVANVVERQ LAQRQSRLQN VPYPCTTNTA ARNWWRWLVQ QASQTDQARR NLVRKRDSGG DSVFDTFWAT WAHPTPSTAG AGRTVTDYGT RPAPPPAPFV ARFPAALDQV LASPHHLHLL QICIQRQRQA SIQNITFAMD ETIPDAVYVV ARFWWALTIL LDGARGENVQ SSASARESKV ASTPAVLPIV PFLASTGSCP ERLARLVVCL FPEQLSYIVP DTGATVLHLW AAAPNPIHEE RDGMLIPLLR TCPALAAVKD NRGRLPIHVA LHWNKAMPDV RALWEHAPHT VHVPDPLKPT LPLVVLVTLT ARKQKMNTLR ALERRNPAVS LAEWLDATRD TEMLQASALC LGFIQRTFVV RHVHDELPLE SDYPPQRAPT SVPASPLRVS TRVVLQNPWR RRSIESSILT FTYPQLSPS
|
| |