Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50372 |
Symbol | |
ID | 7199148 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011697 |
Strand | + |
Start bp | 124660 |
End bp | 128164 |
Gene Length | 3505 bp |
Protein Length | 1020 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185284 |
Protein GI | 219130254 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCTCTCAC CGACGGTGAA AAAAGTTTGG GTCTTGTGGT ATCGTCTTTG CCTGCGGACC TGTACGGGAC CCATCACAGT CACTACTTCT GTCTTGGCGA CGCTACCTTG CTATCAAAAT GACTTCTGGG CAGACGAGTA TGCCGGGAAA GGAAGAGGAC GAAGAAGAAT GGATCTTTAC TTTGCGGATT CCTTCGTCCT CCGCCAAGGA TGCTGCTGGC GGGGTCTACG AACTTAGTCA ATCCCAGCCG CCACCGACCT TATCTACTCA CGCTAGCAGT AGTGGTAGCT TCATGAACAC CAACAATAGC AATGATTCTG TTTCTGTAGA CGAAACGGCT GAGCAAGAAG CCCTCTTGGT TGCGGGGGCT GCAACACAGC AGATAATTCG ACCGCCTTCT CAGCACTCAT CCGCGAACCG TAACACCGAC TGCGCTACCG ATATCCCGTC TGGTTGGACT TGGCAAAGCC CTTATTGGAG TTTGCAGTCT GATGTTGTCC TGCAAGCGAC TTACCGTCCG TATTGGCAAG CCCTGCCCGA GTTCACCAAT TGTCGGGCCT GCCGAGCATC AGCCTACTGT CGGACTCGTT GTTGTCGGAC GTGTACTGGA ACGCGCGCTT TGGAGTTTTA TACCACTGTT TGGACGACTC AGGATGTCGT GAATGTGGAG GTGGTAGAAG AATGTATTGA AGTCGCAGGG GTGGGCCCGT GTCGGGCTTT GCGAGTGCAT TTGACACCGT TGGATACAGC CTGTTTGCCG GTCATTCCAG ATGCATTGCA GAAAGCCTGG GAGGGCCACG TCCATAATAT GCGCCAGGCC CACGATTCCA ACCTTAGCAA CAAAAATTCC TCCTCTCACA CCGCAAGCAA GTCAGATACG GTCGCCGGTC CTACTATAAT CTTTTGTTTG CCTACCAAAC AAGTTGATCC CCCTTTATTA TGGAAGGTAG AATTTGGTGG TGCTAGCAAT ATGTCACAAA CGCCTATTGC TCTGGAAAAT CTGACTGTTT TGGACGGGCA ACTGCAGCAA TTTAGGAACA CCGTTCCCTC ACCCACACAT ATTTACATTC ACGGATATCA GTCTTGGTCT TTTGCCGGTT CTATTGTGAA AGGGCAAGAT CAACCGCAGT CGGCCATGCC CGACTTTTTA AGTCGGGCCT TCAATTATGG CGGTTCGCCT CCACCCGTCT CCGACGATGT TCTTACGTAC GTGCCACCCT TGTCGCATAA TCATATTCAT CGCGACAACG ACGGCGATGC CTACACGGGC CCGCAATCCT GGAAAACACA TTACCAGTCC GACTTTTTTA CTTGCGTCAC GTCCGATGGA ACGATTCCGT CGTTTTGGAC ATCACGTCGC GAAAAGCAAT TTCCTTTTCA AGCTTTGGAC GAGACAGGTG GACCCGGACT AGTATTGGGG TGGCTGTCAC AACGCGAACA GTACGGCGTC ATTATGGCCG ATGTGGATTT AAGGCGGTAT GCCATGCACG TTTCCGGGCA CGGACAAATA ATTTGGGGCC GTGGCTCTGG TTCGACAACC ACAAATACCA TTGCTCTAGA GACTGATTGG GCCTATGCAC AACTGATCGC CCCACATTCA TACGACGAAG AACCCATGGT GCACTACCTC GAAGCAGCAG CGGGTTACAA TCAAGCCCGA CCGCTCCGCA ACGGCTCGTT ACTCACCGGG TGGTGCTCAT GGTACCATTT TTACGAAAAT ATTACGGCCG GTACGCTAAG TGAAAACGTA TCCAAACTAG CAGCTCTGAA GAACCGAGTC CCAACGAATG TGGTTGTAGT TGATGATGGG TATATGACAG CTTGGGGCGA CTGGGATTCA GTCAAACCCG GTGCCTTTCC GCAAGGCATG GCGGCTGTCG CTCGTGATAT TGTTGCACAA GGTGGTATGC GCGCCGGATT GTGGTTGGCC CCGTACGCAG CCGACAAGCA CTCTCGTTTG GTAAAAACCC ATCCCGATTG GATTATTCGC AACGACTCTG GTATTCCGGC AAATTCATCC AATTGTGGAA AGTTTTTTTA CGGACTGGAC GCCACCAATC CAGCTGTCCG GACCTATGTG TACGAATGCA TCCGACGTGC GGTACACAGT TGGGGTTTCG ACGTCCTTAA GATTGACTTC CTTTACGCCG CTTGTCTGGA AGGCAACGGT AAGCACGATT TGTCGCTGAG CCGGGCGCAG ACGATGGATC TCGCCATGCA AGCCATTCGA GATGCTGCAG GTCCCAATGT GTTTTTGATA GGCTGTGGTT GCCCAGTTGG ATCTGGTATC GGCTACGTGG ATGGCATGCG GGTTTCAGCT GATACAGGTC CAACTTGGTA TCCCGCTTTA CCGCTTCCGT GGTGGGATCA TGGGACTCTA CCTTGTTTGC GGTCAATGGT CCGTAACAGT ATGAGCCGAG CACCGTTGGG ACATCGATGG TGGCACAACG ATCCGGATTG CTTGCTATTG GGTGAAAGCA CCCGCCTTAC GGACGAAGAA GTAGCGAGTG CGGCCTCGGT TGTGGCCATG ACGTGCGGTA TGATGCTTCT TTCAGATGAT TTGACCAAGG TCAGCGTGGC GCGGACCAAT ATTTTGACCA AAATTTTCCC CATGACTGGT GTTACGGCCG TTGTCTTAGA TTTGCACAGT GCGAGTGACG GCTTGCCAAG TTTATTGCGA CTCTGGTGTA CGGACAAGTA CGACTTGTTG GATTCGTTTC GTGAGCGTAT GGTGGTGAGC GCGCAAGACC ACAATGCGGA AGCGACTTAC TTTGCGCGGC AGTCGTCATC GTATCATCCT GATAAGGACC AACAGCATCC GATCGAACGC CAACGTAGCT GTATTCACGT GACAAAAGGT CTGGGAACGT GGACGGTTGT TTCGGTCAGT AATTGGTCCG ATCGGACCGC TGTGGTTAAC TTACCTCCTC CTGCTTTGTT ACCTCCGCCC ATGACTGGAT GGGAGCAAGG CGATGAGGAG CCAGAATCAT TTTTGCAGAC TCCTGAAGAA GTTGACTGTG AGCAGCACGG TTGCCACGCC TTTGGATTCT GGTCGTCCAA GTACACTTGG TTGCCCAACC AAAAATACAA CGACAATGGG CAAGGCCCGG AACGAATTCT TCGTCGAAAG CTAGTTGCTC ACGAGACGGA AATCTATCAT ATTAAAGCCG TAACACCTGA CGCAGCGCAA TACGTTGGCA GTGATTTGCA CTTTTCCTGC GGACACGAGG TCCTGTACTT TCGGGCACAG AAGAATCAAG TCAGTGTCAC TCTCAAAACG AAATACCATC GTGTCGGGCA CATTTTCCTT TTCCTCCCCT GTATTAATAC GAATTCTGTC AAAGTGACTG TCAACGGAGA AGCTGGGCGA TGGCATGCGG TAGGAAATGT ACCGAACGGG CACGACAACG GACATGCCCA ACTGATTGGA CGAGTCTTCC GGGTGGCGGT CGTCGTGCAC GCCAATGGCC GTCCACAAGA TGGACAAGTC AAGGTTGAGT TTTAA
|
Protein sequence | MTSGQTSMPG KEEDEEEWIF TLRIPSSSAK DAAGGVYELS QSQPPPTLST HASSSGSFMN TNNSNDSVSV DETAEQEALL VAGAATQQII RPPSQHSSAN RNTDCATDIP SGWTWQSPYW SLQSDVVLQA TYPCLPVIPD ALQKAWEGHV HNMRQAHDSN LSNKNSSSHT ASKSDTVAGP TIIFCLPTKQ VDPPLLWKVE FGGASNMSQT PIALENLTVL DGQLQQFRNT VPSPTHIYIH GYQSWSFAGS IVKGQDQPQS AMPDFLSRAF NYGGSPPPVS DDVLTYVPPL SHNHIHRDND GDAYTGPQSW KTHYQSDFFT CVTSDGTIPS FWTSRREKQF PFQALDETGG PGLVLGWLSQ REQYGVIMAD VDLRRYAMHV SGHGQIIWGR GSGSTTTNTI ALETDWAYAQ LIAPHSYDEE PMVHYLEAAA GYNQARPLRN GSLLTGWCSW YHFYENITAA WGDWDSVKPG AFPQGMAAVA RDIVAQGGMR AGLWLAPYAA DKHSRLVKTH PDWIIRNDSG IPANSSNCGK FFYGLDATNP AVRTYVYECI RRAVHSWGFD VLKIDFLYAA CLEGNGKHDL SLSRAQTMDL AMQAIRDAAG PNVFLIGCGC PVGSGIGYVD GMRVSADTGP TWYPALPLPW WDHGTLPCLR SMVRNSMSRA PLGHRWWHND PDCLLLGEST RLTDEEVASA ASVVAMTCGM MLLSDDLTKV SVARTNILTK IFPMTGVTAV VLDLHSASDG LPSLLRLWCT DKYDLLDSFR ERMVVSAQDH NAEATYFARQ SSSYHPDKDQ QHPIERQRSC IHVTKGLGTW TVVSVSNWSD RTAVVNLPPP ALLPPPMTGW EQGDEEPESF LQTPEEVDCE QHGCHAFGFW SSKYTWLPNQ KYNDNGQGPE RILRRKLVAH ETEIYHIKAV TPDAAQYVGS DLHFSCGHEV LYFRAQKNQV SVTLKTKYHR VGHIFLFLPC INTNSVKVTV NGEAGRWHAV GNVPNGHDNG HAQLIGRVFR VAVVVHANGR PQDGQVKVEF
|
| |