Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44133 |
Symbol | |
ID | 7203886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1075030 |
End bp | 1078470 |
Gene Length | 3441 bp |
Protein Length | 1012 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186179 |
Protein GI | 219113191 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGATGACTTC CTAGACACAG CTTGGTGAGG GCACTACAAC AGTAGTACGC TGTTTGGATG CGACCTAACA CTGACTGTAA GTAGTAGCCT AGATCTTACA CAAGAGTTTC GCCCGTTGTG CCAGTCCTGT CTGTTAGGCA CGACGGTGGG TCCTTAAATC CTCCCGTTTC CAAAGACTTC CAAGAGAGCG TATCGGCCTT CGTCATGACA ACAATCCACA TTATTGATAG CTGGGCCGAT CTTCTTGGAG GAATCACGCT ATCGGATTGT CGCAACGTCA CTGACCCATC CTGTGTCGGC GGGGTTGCGA ATGTTGCCGC CTTTGCGCGG CACTTGACCC GGAGCGGGGC GAGCGTTACC GACAGTCCGA GCGGTGGGAC TGCGGAAGAT ATTGAGGAAC CGTACATAGT GACGCCGCGG CTCGATCGGT ACTCGCCATT CGTGCAAATG CATCCACTTG GATGGAGTGT GAACCGGTTG ATCTTTGTTG AAGTATTGCA ATGGAAATTC TTTATCTCGA GTCCCGCCTT GTTGTATTTA AGCAGCACTG CTCGTGGGAG CACTAACGGT AGCAGCACGA TTTCGACGAA CTCCTCGTTG ATGTCGGATT TGCAAACGGA CGAGTTGCCC TTCTTACTGA CTAACGTGGC TGTGCCGCCA GAGAACGACT GGTACCCCTA CACAAAGGCG ATATACTTTG ACGAAACCAC GCACCTGGCT TTTTTGTCGG TCGCGGATTC TGACGAGCCC TTAAATGCGC CGCAGATAGA ATCTGCAACG GGAGCCTTGA ATTATATTGC ACGTTTGAAC AAAGAAAGTG ACTGCGATGC ATTTGTTGAC GATGGGGGTG GAACTGCGGT AGCCCTGGGG AGTAGCTGGG ACAATCGAAC CTGTTGGATT CCAGTGGTTG CCTTTGGTGA TTCGGAAAAT CGATGGAACA ATTTCCTATT GGCGATGACT GCATTAGAAA ACCCTCCGTC TCTAATAATG GATATTGAAG GACACGATGC GCGATTTTTT ACACCGCAAA AGTACAACCA AACTTGGGTT TCTAGTTACA AAATGAATTC TACGACGTAT CGTCAACAAA GCATTGTTCT AGCAGAGGAC AGCCGAACAA TTGTGAGCGT GACGGCGACA GCCATACCCC TAAATGAGTT GCCCGACGAG TTCAAAGACA ATATCTACAC ATCCCACGTA ACCCGCATGT ACGAGTTGGG GCAGGAAGCG GCAAACAAAG ACCCAATTGT GGGAACAAGT ACTTTCATGC CAATTGCTAG AATTGACCAG TATAGACGGT GTATGGGTGG TGAATGCGAG ATCGGAAATC TCTTTACAGA TGCTCTGCGA TGGTATTCTA GCGCGGATGT GGCATTTGTT TCGAGTGGAG GATTGCGAGG CCAAGGATGG CCGGAAGGAG TTGTTCAAAT GTCCAATTTA TGGGAATCAT TGCCGTTTCC GAATACCCTA TGCTCCGGAA CCATGACTGG AGTATCGTTA TTCAAGCTGT TCAACTATTC TACTAGTGTT GCAAACTTTG AAGTCAAAGA AACAGTTTCG GGGGGACAGC TGCTGCAGGT GTCTGGGGCG CGACTTCGAT ATAATACGAA ACTCCCCCAA GGAGCGTCGC GAATGGTTAG GCTGGAGATT TGGGACAAAA ACGCCAATCA GTATGCACCA GTGCAACGCC TCCAATTGTA CAAGTTCGCA ACCGATAATT TTTTATGCGA AACCAACATA CCGTATCCTG AATTGTTGGG GCAAAATTTT TACATCGATG GGGAAGTTCC GGGTGTTGTG CGTGACGATT TACATCAAAA CATTGTTGCG GACTATCTTA CACAATTGAA CACAACTTAC CAAGCGACTA TTGAGGGACG ACTCATCAAT GACACTACTG TTCTGGATGC AATGAATTTG GTACAGATCG AAGGAGGCTG TGGACAAGGA ACTTATTGGG TTTTTACGCA ACAGAGTTGC AAGGTTTGTC CGAATACTGA TCAAGTCTAT TTTGGAAAGA AAGAGCTTGA ATTTGCGAGT GAAAGCGGAT CGTCGAAACC TGTTGAGGGG CGTTTCGAGA TACTGAATAA TGCGGGTTTC CCTGTATCGG TGGGTCCCAG ATCGTTTCCA TTCTGGGTGA CTTTGACAGT TTTCTTGTGC AATGGTACGA TCCCAATTGA TCCAATTCCA GCAGGTGTAA CGCGCGTGTT GCAATCCGGC GAGAAATTGA CGGTAGGTTT ATCTATCTCG TCTGAGCAAC TCGAAGCAGG AACAGCGGTT GCGACTGGAT CTTTTTCTGT GGTGGATGGC GGGAGCTTTC CTGGATGCAT TGGCAATGAA ATTTCTTTTG ACATTCTCGT TCGTGTGGAT CCCAGTCGGG AACTCAATCA GATAGGAGGA ATTCGATGGG TTGGCTGGTC GCTATTCATG GTCCTTGTTT TTTCTGCGAT GTTCTTTTAC ACTTGGGTAT GCCAGCACGA GCGAATTGCA GTTGTCCGTG CTATGCACCG CTTGTTTCTC AACACGGTTT GCCTAGGCAT TGTTGTTTTG GGCTCTGTGT TGATTCCAGT GGGTTTTGAC GACGGTGCAT TCTCGGAAAA CATATGCAAC AGCGCGTGTG CTTCAATCCC ATGGATCAGC GCAGCAGGGT TGAGTACAAT ATTTGCAGCT ATGTATCGCA AGCTGGGATC GATTGTCGGA AAAAATGAAG ATGCTCGTGA ATTTCGTGGT CGCCGTGTTG TTTTAACATT TGCCGTTTTC TTTGGCCTAA ATGCGTCGAT TCTAGTGCCT TGGAGTATTT TAGCCCCATT GCACTGGGAT CGAACGCCAC TCGTTCAAGA AGAATGGAAG AGCTACGGTC GTTGTTCAAC GAGCGATACG TCAAGTTTGG CTTTTGTGGT TATGGCCGGT GTTCTGAATG TGTCGGGATT TGTCTTGATA TGCCGGTTGG CTTATAAAGC CCAGCAAATA CAAGATAGAA GGGACCAATT CGACCAGGCC AAAAGCATTT CATTGGCCCT GTACAGCTGG ATTCAATTAG CAGTCGTGGG AATTCCTGTT CTCTCTCTGA TTAGCGCGGA AACCACTCGT GCTCGATATT TTATGATCGT TGCGCTCATC TTCGCCTTGT GTATCTCCAT GCTGCTTAGC TTGTTTATTC CGATGCAAAT GCAGAAAACG ATGCAAAGCG TGCGAACATT GAGTTTAGGC AGTCGCTTCC TGAGCTCATT TCGATCTAGA TAATCTGCGA CGACACAAGG AGGCGGGAAA CTCAACACAC AGCATTAAGA ATCAAACCGG TTTTACATAC CTTCCTTCCG TGCATTGGGG CAACAACCTC GGAGCCGATT CGAATTAGAT GACATAAGCG GCACCGTGGA CGACGCAGTC ATTCTGCAGT GTCCGGACCA AGTGCGATCA ATTTTCGGAC ATGAGGTTCG C
|
Protein sequence | MTTIHIIDSW ADLLGGITLS DCRNVTDPSC VGGVANVAAF ARHLTRSGAS VTDSPSGGTA EDIEEPYIVT PRLDRYSPFV QMHPLGWSVN RLIFVEVLQW KFFISSPALL YLSSTARGST NGSSTISTNS SLMSDLQTDE LPFLLTNVAV PPENDWYPYT KAIYFDETTH LAFLSVADSD EPLNAPQIES ATGALNYIAR LNKESDCDAF VDDGGGTAVA LGSSWDNRTC WIPVVAFGDS ENRWNNFLLA MTALENPPSL IMDIEGHDAR FFTPQKYNQT WVSSYKMNST TYRQQSIVLA EDSRTIVSVT ATAIPLNELP DEFKDNIYTS HVTRMYELGQ EAANKDPIVG TSTFMPIARI DQYRRCMGGE CEIGNLFTDA LRWYSSADVA FVSSGGLRGQ GWPEGVVQMS NLWESLPFPN TLCSGTMTGV SLFKLFNYST SVANFEVKET VSGGQLLQVS GARLRYNTKL PQGASRMVRL EIWDKNANQY APVQRLQLYK FATDNFLCET NIPYPELLGQ NFYIDGEVPG VVRDDLHQNI VADYLTQLNT TYQATIEGRL INDTTVLDAM NLVQIEGGCG QGTYWVFTQQ SCKVCPNTDQ VYFGKKELEF ASESGSSKPV EGRFEILNNA GFPVSVGPRS FPFWVTLTVF LCNGTIPIDP IPAGVTRVLQ SGEKLTVGLS ISSEQLEAGT AVATGSFSVV DGGSFPGCIG NEISFDILVR VDPSRELNQI GGIRWVGWSL FMVLVFSAMF FYTWVCQHER IAVVRAMHRL FLNTVCLGIV VLGSVLIPVG FDDGAFSENI CNSACASIPW ISAAGLSTIF AAMYRKLGSI VGKNEDAREF RGRRVVLTFA VFFGLNASIL VPWSILAPLH WDRTPLVQEE WKSYGRCSTS DTSSLAFVVM AGVLNVSGFV LICRLAYKAQ QIQDRRDQFD QAKSISLALY SWIQLAVVGI PVLSLISAET TRARYFMIVA LIFALCISML LSLFIPMQMQ KTMQSVRTLS LGSRFLSSFR SR
|
| |