Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46212 |
Symbol | |
ID | 7201285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 631019 |
End bp | 633478 |
Gene Length | 2460 bp |
Protein Length | 768 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180675 |
Protein GI | 219119847 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGGAGTGTAG AGAAACGAAC GAAAACACAA GTACTTCCGA GAGTACTTCC GAGACCCAAC GTTAGACAGG CAGGCAGGCA CGCACCTTTC CTCTTCCTTT CTCCGAACAC TTCCCGTTCG TACATCCATT TCCGTTTGTT CGATACTCGT ACAATGAATC GTTCGTACGG AATGGCAGCT CACGGAGCGG ATGGTCCGGA CAGCGACGAC GATGACACCA TGTCGGATGT GTTCACCATG GATGGGGACT ACACGTTGTC CTCCGGGATG CCGCCGCCGC CTGCCACGGA GCGCAGCCAT ACGCACATTC CCTTGACCGT CGAGTCGCCC TCCGCCACGC GTTCGCAAGG ACTGAATCCT TACTCCCCCC GGACGGACGC CGCAGCCTAC GACGTGTCGA CGGATGACGG CTCCGTGTCG GGCATCGAAT CCATGTCGGG CTTAATGACG ACCGACGGAT CCTACTTTCA AAACGATACC GTCGTGCACG ACGACGAAAT GTCCTTCCAA GCCAAGGTGC AAGCCGAAAC GAACAAGATT CTGAGTGAGT ACGACATGGA GGAAGACTTG AATCGATCCG GTGATTCCAA GCTCGGACGC TACCATAACC GTACTTCCGT GTCGGCTCCA CGACAAGCCG CGCCCCCCAT GACACCCACG AGCTCGATGG ACGACGACAG CGACAGCGAG GCCGCACGAG CGAGGCCCGA CAACCAGACG CGACACAGCA AAAAGGACCC AAGCTCCGCC GTTGGCCCCC CTCCCGTCTC TTCCAATGGA GCCAGGGGCA GCTCCGTGCG CCATTCCGTG CGATCCCTAC GCAAGCAATC CAACAACAAA CCCGAGGCGC CAACGAAACC GGCCGTCTAC GACTTGTCGA CGCCGGAAAC ATCCCCGACT AATCGTAAAG CGCCGGTTAC GCCACAGACC AAGACCCCCC AGCGGTCCAA CACCACGGCC CGGCCCCCCA ACATGATCGT TGTGGGATCC TCGACTACTA CCGAGTCTTC GGAAAAGAAC ACCCGAGCCG CCGCCGCAAT CGGAGCGTCA GCCGCACCCA CCAGGAAAGG GTGGTTAGGA AGTCGCTTGA GCTTCGTTAT CATCGTGTTG TCGATTATTC TACTAACGGC AATTTGCGTT GCCGTGGGAG CCTTGGTCGC CGCTGGCAAT CGCAATAGTG ATTCTTCCAG CGCATCTTCG TCAGAAGGGG GCTCGAGCAA CGAAATAGGA GCCCCGCCGG TCGATTTCCT TCCCTTCTTG CCGCCGACGA CGGCGCCGAT AGCTTCCGAA ATCCCTTCCA CCGGATCTCC GACACCGGAG CCGAGCGCGT GGAATCGTGA TGTGTGTGGG ATCGATGATC CCGACAAGTT GGTCTTTTTG GGTGAAGGGA CCGGTGACCG CTCCTGTGAG TGGTTGAGCG GGCGTCCCAA TGTCCGCGAC CAATTTTGTC AACCCGGACT CGAGCCTTTT ATTTATTGCC GGGAAACTTG CAACAATTGT GGACCGGCAG AGGGCGACAC GACGGATGCT CCGACCGCAA TGCCTCCGCC GACGAGTACA CCCACATCCG AGCCAGTCAT GATTACAGCC TTTCCCACCC GGGCACCAAC AAGTGCTCCG ACGCCGGCGC CCACCTTGGT AGCGACGATA GCACCAGTGA TCATTGCCAC GCCGGCTCCC ATCGCAGTTG CCACCCCACC TCCTACCGGA ACGTCGACGG TTGGGTCGGC TATTACGAAT GCCGCACCAG CTTCCACAGC CGCAGCCCTG CAAAATCCGG CTTCAGCGCA GTCGAGAGCC TTGGCGTTCG TAGAAAACTC TAGTGCGACA GCGGCTTTGC CAGAAAGCCG CATCGTACAG CAGTTTGCTT TGGCCACGCT GGGCTTTCGA ACCGGCCTAG CGGGACGGCG ACTCGAGGAA GCTCGTCAAT TGCAATCTTG GATGAGCGGC ACGAACGAGT GTTCCTGGTC GGGTGTCACG TGTGATAGCC AATCTAGTGT CATCGGAATC AATTTAAGTG GACGAGGCTT GAGGGCATCG TTGCCTGGGG AATTGGCCAT GCTGTCGAAT TTGGTCACAC TCGATGTGAG CGTCAACGAA TTTTTTGGTA CGATACCCTC GGATTTCGGT AGAATGGTTA ATCTGCGTAC ACTACGCATG GAACAGAATG GTTTGACGGG AAGCATTCCG AATACCATGC GGAATATGCG TATTCTACGT GAGTTTTATG TGGAATGGAA CGAGCTCACC GGAGATTTTC CGAACGATGT AGTTTTGGCG ATGTCGAGTT TGGAGGAATT AAGTATCTAC CATAACAATA TCGCAGGGTC TGTTTCGGAC GCGGTTTGTG CTCTGGGCCT GGACGAATTG TGGATTGACT GTCGTGAGGT CAATGTCGAA ATTGGTTGCT GGACCCGGTG TTTCTTTCAA TGTGGAGGGT CCACTGGTGT CGCTTGTTAA
|
Protein sequence | MNRSYGMAAH GADGPDSDDD DTMSDVFTMD GDYTLSSGMP PPPATERSHT HIPLTVESPS ATRSQGLNPY SPRTDAAAYD VSTDDGSVSG IESMSGLMTT DGSYFQNDTV VHDDEMSFQA KVQAETNKIL SEYDMEEDLN RSGDSKLGRY HNRTSVSAPR QAAPPMTPTS SMDDDSDSEA ARARPDNQTR HSKKDPSSAV GPPPVSSNGA RGSSVRHSVR SLRKQSNNKP EAPTKPAVYD LSTPETSPTN RKAPVTPQTK TPQRSNTTAR PPNMIVVGSS TTTESSEKNT RAAAAIGASA APTRKGWLGS RLSFVIIVLS IILLTAICVA VGALVAAGNR NSDSSSASSS EGGSSNEIGA PPVDFLPFLP PTTAPIASEI PSTGSPTPEP SAWNRDVCGI DDPDKLVFLG EGTGDRSCEW LSGRPNVRDQ FCQPGLEPFI YCRETCNNCG PAEGDTTDAP TAMPPPTSTP TSEPVMITAF PTRAPTSAPT PAPTLVATIA PVIIATPAPI AVATPPPTGT STVGSAITNA APASTAAALQ NPASAQSRAL AFVENSSATA ALPESRIVQQ FALATLGFRT GLAGRRLEEA RQLQSWMSGT NECSWSGVTC DSQSSVIGIN LSGRGLRASL PGELAMLSNL VTLDVSVNEF FGTIPSDFGR MVNLRTLRME QNGLTGSIPN TMRNMRILRE FYVEWNELTG DFPNDVVLAM SSLEELSIYH NNIAGSVSDA VCALGLDELW IDCREVNVEI GCWTRCFFQC GGSTGVAC
|
| |