Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46440 |
Symbol | |
ID | 7201543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 345965 |
End bp | 349057 |
Gene Length | 3093 bp |
Protein Length | 1030 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180990 |
Protein GI | 219120506 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCTTC CGCCGTTATC GGTGCGCAAA CGCGGCGTCA AGGATCCGCG GCCGTACGAG CCAACAACAA CCCAGGCGGC GGAGGATGCA GTCGCCCGGA TTATGGATTC GTTGGGGACT GCCGTTTCCC CACACTTGGG GAAAGTGAAT TTACAGAAAG GACTGTACAA TGAGAGCAGT GGCACATCAA CACGGTACCG GTACGAGCGT ACGGAAGAAA GTTGGTTGGA ACGCGCGTTA ACAACGGACG ATGACGACGC CATGGAGACG GACGAACTCG TTACGGGCTC GGCAAATTCC AAAAAGCCGC GGATTCTGGA ACGCGCCACG ACGCCGATCA TAAGTAACGC GAGTCACAGG TCGCAGGACC GCGACGAAAT CATGTCTCCC GTCCCTCTCC GTTCACCCTC TCCCACGGAT GATGCTGGGA TCACCGGGAA CGACACTTAT CTGCGGCGTA CCCCTACCAC GCTGCGTACG GGACACGTGG CTGGCCCATC GTCTCGGCAC GATGCTCTCG AGTATGGGCG CTATCACGAT GCCGTATTGC GCTTCGTACA AGTCAAACGT CGTGTCACGG AACGCGTCGA ACTCAAACAA CGTTCGTTGG CACTCCAGGA GGAGAGAACC TCACCGCGCG TACCGTTCGA CGACGCCATG GAGCTTGTTC CCGACGACCT TTTCATCGCG AACCGGGATG GAGATGCGTT AATCTTGGAC GAAACACGAG CGGACGTTGC GTTCTTGCGT TCCTTACACA ACCTCTCCTC GGACCATGTC AATGCAAGCG ATGCTCGTTT GGCACGCAAA GAGGGCAACT TCTGGCTCCT CCTGTCGCAT TTACGAGAGT TGAGTTTAGA TACCTTGATC TGGGCTGACG ATGCTGCCTC TCTACACCAG CACGAGAGTT CACTGTTGGC CTATGTTGAT TCCCTCGCTG CCAAGGTCAA CGCCGCCCCA CTGGAACTCA CGCAGGCATT GCAACTGAAT ACGTCTGAGT GTCCATCATT GCTTCGTCGT CGTCAGTGCA TCATGCGTTG GCTGGAAGCC TGTTTTCATC AGCAGTTGCC GACAGGATCC ACCCGCGCTC GCCATGCCAC CATAATTTGT GATTCCAAGC TCCTGCGGCA AGAAGGTTTG CCCGAGACGG ACAAGGATGC GGAAATACTG AAAAATGCGT TGGCCTTGAT TTTGGCCGGA CGAGAGGAGG ACGCACAAAT ATTGGCTCGT GATAGTGGCG CTCCTTGGCG AGCGGCGCTT TGGAGTGGAG GGAAACCACA AGGTGTCGTG CACAAGCCGA ACCTCGCAAC CGAGACGATG GATCGGATCC CCACGGGAAA TCCCCGCCGT GCTTTGTGGA AGCGCATGAT GTGGAAAAAT GCCGAAGCGC TGCATCAAAA GGGAAAGGCG GCGGCAGACG AAGCTGCGAT TGCCGCCATT CTTTCCAATA ATCTTAAGAT TGCCCTCTTG AACCCATCCC TCAGGACGTG GGAGAAGTGT CTGTATGTTG CCTTCCGATG CATGATCGGT CGCACCGAAG ATGACCTGCT ACACAAACAC AACAATCTTC GTCGGCAATA CCGCCCTCCC TTCCCTGGAA CACAGTTCGC ACAACACGAA CTCCAGCAGC TTCGTGACAC TGCTGACATT GCTGCTGAAG ATGAGGCCTC TGTTATTCAT AATATCTTAC CAAGCTCTGT TTTTGACGAA GTCAAAGATG ACGATGTAGT GACGGATGCC ACTTCGAGTT TTCTGGTTGG CAAAAGTTCG ATTGCGTCTT ACCTACAGGA CTCAATGGCG GACTTGGACG ATGCAGGAGA AATGCAGCTA CGGTTCCTCA CCCACCTGGG TTTGTACTTG GATTCACTAG CGGTAGGCAC AACGCCAATC TTCATTCAAG GAGTTTCGGA CTGGAAGAAC AGAATGTTGT TGAAGTACTT GCAGTATTTG TCCACCCGCG AAGAACTATG GCATTTGCTT GTACTTTACG CGTCGTTGCT ACCGGAGTCG GTCTTGACGT CTCAACTTCC CAATATGCTG CAAAGCCTTG ACAGTCAAGA AGGCCGCAGA ACGATTGTTG AACAGATGCG TGAACTTTTA CCGCGTGCAG GTTTAGATTT GGTAGTTCTT CAGAATGTGG TACAAGCTAC TTTGCAAACG ACGGATACAG AGAATTCCTG TGTCGACACT CCCACTCGTT TGGATGTCCA AAAGATGCGC TCGATTGCGT GGCTTTCTTA CAGTCAAGAC CATACGGTGG ATGCTCTCGT ATTTTCTAAC TCACTGCTAC GGCACTTCCT TCTTGGGGGT CGTCGAGCGA GTGCCATACT TTTCGTCGAA GACTTTCGGC TGGAAAGTGT CTTGGAGTTA GCGGAAGGCA ATGGAGAAGA TGAAGCAGCG GATGTAGACA CGTGGCGACG TGAACACATG GCACTGCAAT ACTACTTGGA TGCGACTCAA GCTATTGACC ACTGGCGTGA GATTATTTCT ACTGTTGAAA GCACCACCAA ACTTATTGAT GATCGGATCG ACATTGGTCG GTTGGATGAA ACAGACGTCT CGGTGGCATT GAAGATCGAA CGCCGTGCTT TGCTCGAAGA GAAGCGCAAG GCTAGCTTCT CTGTCGTGAG GGCGTCTAAT AGCGCACTCA AAGCTCTTTC CGCCGTCCTA AAGTATAGAG GCGGGTGGTT GTTACTGGAC TACGATGCCG CATCAAGGCA CTGCGACCAA AACGCTCGTT CGGCCGAACT CGATGCCTTG CGCCGCAAAA TTTTACCGCA TTGTATTTCT AGCTATTGCG AAGTCTGCAT GGAGACGGCA GTGTGGATGT CATCCTCCAT GGACGATGCT GTTGCTCAAC TGAGTGAAAG CCCTTCCGCT GTGTTGGATT CGCTCGACAG TCCCCAGGAC GGTGAAATAT CTCCAATCGC CCCTTTCTAT TGGCCACAGC AAGCATTGGA AATTGCCAAT GTTGTTGCAT CAGAAACCTA TGGTATCCTC TCTGCTTTCG GGACGGCAGA AAAGAAACAG CTCGTTTCCG ATTTAGCGGA GGCGTCTGTA GCCAACCTCT TTTATACTAC GAAAAGAGAC TAG
|
Protein sequence | MSLPPLSVRK RGVKDPRPYE PTTTQAAEDA VARIMDSLGT AVSPHLGKVN LQKGLYNESS GTSTRYRYER TEESWLERAL TTDDDDAMET DELVTGSANS KKPRILERAT TPIISNASHR SQDRDEIMSP VPLRSPSPTD DAGITGNDTY LRRTPTTLRT GHVAGPSSRH DALEYGRYHD AVLRFVQVKR RVTERVELKQ RSLALQEERT SPRVPFDDAM ELVPDDLFIA NRDGDALILD ETRADVAFLR SLHNLSSDHV NASDARLARK EGNFWLLLSH LRELSLDTLI WADDAASLHQ HESSLLAYVD SLAAKVNAAP LELTQALQLN TSECPSLLRR RQCIMRWLEA CFHQQLPTGS TRARHATIIC DSKLLRQEGL PETDKDAEIL KNALALILAG REEDAQILAR DSGAPWRAAL WSGGKPQGVV HKPNLATETM DRIPTGNPRR ALWKRMMWKN AEALHQKGKA AADEAAIAAI LSNNLKIALL NPSLRTWEKC LYVAFRCMIG RTEDDLLHKH NNLRRQYRPP FPGTQFAQHE LQQLRDTADI AAEDEASVIH NILPSSVFDE VKDDDVVTDA TSSFLVGKSS IASYLQDSMA DLDDAGEMQL RFLTHLGLYL DSLAVGTTPI FIQGVSDWKN RMLLKYLQYL STREELWHLL VLYASLLPES VLTSQLPNML QSLDSQEGRR TIVEQMRELL PRAGLDLVVL QNVVQATLQT TDTENSCVDT PTRLDVQKMR SIAWLSYSQD HTVDALVFSN SLLRHFLLGG RRASAILFVE DFRLESVLEL AEGNGEDEAA DVDTWRREHM ALQYYLDATQ AIDHWREIIS TVESTTKLID DRIDIGRLDE TDVSVALKIE RRALLEEKRK ASFSVVRASN SALKALSAVL KYRGGWLLLD YDAASRHCDQ NARSAELDAL RRKILPHCIS SYCEVCMETA VWMSSSMDDA VAQLSESPSA VLDSLDSPQD GEISPIAPFY WPQQALEIAN VVASETYGIL SAFGTAEKKQ LVSDLAEASV ANLFYTTKRD
|
| |