Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48680 |
Symbol | |
ID | 7194863 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 588147 |
End bp | 590220 |
Gene Length | 2074 bp |
Protein Length | 571 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183248 |
Protein GI | 219125984 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGACCGCTTT CTCACTCGGA GCCTCGTCGT GTCGGTCGTC GAACAAGCAG ACCACTCACT GTTAGATCCA TAAACACACT TGTCTATCCA AGTCCTCTCG CAGTAACTCC TCGACATCAC AATCATGGTA TCTTGGACGA TTGCTGCCGC CTTTCTCTTC ACGCCGACGG CGGCTTGGGT CTCCCGACCC GTGTCGCAAG TTCGATCGTC TCCGTCGATG CTGCGCATGT CCAGCGATCA GTTCACGGTC GCCTGTCTAG GCGACTTGCA CTTGGATCCT CGGTACATGG AAGACTACTA TACTGGTCGT GATCAATGGT TGAATATCAT CGATGAAGCC AAGGCCGATC ACGGCAACGT TGCGCTCGTG TCTCTCGGCG ACTTGGGCGA ATCCAAGTCG GTCCGTCCGG AAGAAACCTC GGAACTCTTC GCCGGCACAA CGGAATGTCA CGAACTCGCC GCCGAGTTCT TGCAATCCTT CGGTGTTCCC TACGAAGTCG TCGGCGGCAA CCACGACCTG GAAGGTATTG ACGAATTCGC CACGGACAAG GAGAATCTCA AAATGTTTCT AGAAAAGCAC GGCAAGCCCA CGCCGCATTT CTCCCGGAAG ATCGCCGACA AGACGCTCTT GGTCGGCCTC TCGTCGACTG TCTTTCGGGA TGCCGAATAC ACCTCGCACG AAGTTACCAT TGACGACGCA CAGATGGCCT GGTTCGAACA GACCCTGAAG GATCATCCCG CCGCGGACGG ATGGCGATTG TTCGTCTTTA CCCACGCGCC ACCCAACGGC TCTGGATTGC GCGTCCTGCA AGAGAACCAC GTCGTTAACG GTTGTTGCTG GCTCAACCAT TCCAACGACA AAAAGTGTCG CAAGTTTATC GAACTGGTAC GCGAGCACCG TTGCATCAAG GCGTGGTTTT CCGGTCACTT CCATTTGGGA CAGGGTACGT AATCTCGGGG GAAAAACACT GCCAGAGGCC GACGCGGCCA CGGCCCCAAC GCAACCCGAC CCTCCAAATC TCAATTGCTC TTTTTCGTTT GTTGCAAAAC CCAATCACAG ATTACCAAGA TTCCATTACC TTCCCCACCA TTGACCCCAA GGACGGCCCT TACCCCAACC GCGGTTCCTG CACCTTTGTC CAAACTTCGG TTATGCGTTC GGGATCCTCG CGGGACGGTC GCCAGCAGTC CCGCCTGATT CGCGGAAACA AGGACGGCTT CGAAATCTGC ACCGTCGACC ACAAGGAGGA TGGCAAGGTC CGTGTGGACG CCACCATTAG TTACCGTGGT GACACCAACG AAGTCGGTAT CTACGAACAC GAAGACGAAG TCAAAAAAGG CGACGAACTC TTCAAGGTCT ACGCCCCATC CGCCGGTGAC GACTTGCACG CTCCGGATGA AGGCCACGTG CGTTACAACG ACGATGGTCA AGTCAAGATT GATCTCGACG TGACCGAAGA CACCAAGGCA TGGTGGTACA TGTCCGACGG CCGTGTCCTC GGTATGCTTA AGGGCATGTT GATTGAATAC GATCGATCCA CCCTGGCTCC GCTCGGTTTG GTCGTGGGCG CCGACGAGCT GGTCGGCAAG CGCGTGGCCG TCATCGACTC GGGACTCGAC GACGAAGAGT GCGTTATATC CGACGAGCTT GGGATGGAAG GCATCGAATG TGGCGGGGAA CCAGGTCGCG AACAGGCTGT CATTCTCGTC GACAAGGACG ACGGTTCGGT CGTTGTGGTA CAACCCAACG AGGATGGCTC GTACTGGCGT AAAATTGTCC GCAACAAGAT GATCCGTATG AAGGAAGTGC GCCGGGTCAA GGCGGCCAAG GAATTTGCCA AGTCGTTGAT GGACAAAGAA GTGGAAGTAG TCAGCTCCTG GGGTCCCTAC ACTACTACCA GTGGCACCGC CAAGAAAACG GGAGTCCAAG GTTTGACGGC ACCGGCACGC AAGTAATTAC GATCTTCATC GTCGCTGTGC CTAGTAGTGT CAAGAATGGA GCTTAGATGA TAATAGGAGA TTGTCACGTA GTAGTATAAC ATTATACGTT AGCAAGGAAC ACAGAACGTG ATAC
|
Protein sequence | MVSWTIAAAF LFTPTAAWVS RPVSQVRSSP SMLRMSSDQF TVACLGDLHL DPRYMEDYYT GRDQWLNIID EAKADHGNVA LVSLGDLGES KSVRPEETSE LFAGTTECHE LAAEFLQSFG VPYEVVGGNH DLEGIDEFAT DKENLKMFLE KHGKPTPHFS RKIADKTLLV GLSSTVFRDA EYTSHEVTID DAQMAWFEQT LKDHPAADGW RLFVFTHAPP NGSGLRVLQE NHVVNGCCWL NHSNDKKCRK FIELVREHRC IKAWFSGHFH LGQDYQDSIT FPTIDPKDGP YPNRGSCTFV QTSVMRSGSS RDGRQQSRLI RGNKDGFEIC TVDHKEDGKV RVDATISYRG DTNEVGIYEH EDEVKKGDEL FKVYAPSAGD DLHAPDEGHV RYNDDGQVKI DLDVTEDTKA WWYMSDGRVL GMLKGMLIEY DRSTLAPLGL VVGADELVGK RVAVIDSGLD DEECVISDEL GMEGIECGGE PGREQAVILV DKDDGSVVVV QPNEDGSYWR KIVRNKMIRM KEVRRVKAAK EFAKSLMDKE VEVVSSWGPY TTTSGTAKKT GVQGLTAPAR K
|
| |