Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43161 |
Symbol | |
ID | 7196915 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2203104 |
End bp | 2208142 |
Gene Length | 5039 bp |
Protein Length | 1601 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176931 |
Protein GI | 219110359 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.496244 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGAAACAAAA AGGCTCCTCG TTGGGACGAA GCCATGCGAG GGATATAGCC TCCGTTCGAA CTAGCGATAC TCTCTCTCTC TTTGCACAAA ACTGTAAACG CTGAGAGATA AACTGATCTT GTTATTCGCT GTACCTTTTC TGTATCGATT AGTACCCCCA CGGTTTGGAC AAACTTTCAA CAATGGCGAA TACCGAAGGT CTTTCGGACG AAAACGTCGA GTCGCTGCTC CGCTTCGACT CCTCTGATCA CACGCGGGAC CGTCCCAGGA ACAACCCATC CGCACGGTTA CGCGAGCGGA TGGATGCTCC CATGGGCGAA CGACCCGCGG CGACAGTGAT CCGTCGATCG ACGGACAGTG GTCGACGACC TCGAGGCCGT GCCATCGTGG AACCCACGGC ACCGACCCGT CTCTCGACCA GTGCACACGG GTTACCTCAT GCCGACAACG GCAATGCTCC CTCGCCGCGT GCCGCAGCAG CGTCATCCCA GTATTTACCG CAAAATCGTC ATCAAGAACA ACGTCAGCCA GAACTAGAAC AAGCAGAATT GAATTTCCCG GTCGTGGGAT CGATTGTGGA ACGACCACGG AATCGCACGC CAGGCGCTTC TTCGTCCGAG CGCACACGTC CATCCAAATT TGCTCAGGAA AGGCACGTAC GCGCCGGCGA TGCAACGTCC TTGAATCAGG GCTTTCCGTC ACTGAACGTA CCGCTCGGAA CCTTTGTCAA GGACAACCAC CAACAACGAC GGAACGCAAC GAAAACAACG ACCAGTATTC CTCCCACATC CCCTCTACTA GTAGAGTCAG CAGAGGTGCC AGCGAGAACG TCCAATTCGA TTGACGAGCT ACGAACTATT AGTCAGCGAG ACGCCCAAGG TATGCTACAG AACATGACAC CTGTGGAGAT CCAATCGCAC GTACAGGACC TACGACAGGC GTTGCCGTCG TCTACACTCG CCTTCTTGCA ATCGCGATCC AAAAAGTCAC CGGTTGCAAA ACCTCCGACC GTACTTGCCG AAGCAAAGGT CGAACATTCG GAACCAGCAC GTGTAACACG TGACGATCGC GTGCTAGGCA AGGAAGAATT CTCACAGTTG CTCTCATCCA TCAGTACCTA CGAGGATCTA GATGCCGCCT ATGCGACGTA CGTACCCACT GAGCTACAGG AGAAGGCGGC CAGTACGCTC GATCAACAGC AGGACGATTC TTTTCCCATG GCGTGTGATC TGTTGCGCTC TACCGCTCCT CGACAAAATT TATGGGCCGC GCGGGCTGTG TATCATCGAT TACGTGACGA CTGGGAGGCT GGTCAGTTTT GCTCGCTCGA GACGGATCGA GATGTCGTTT GGCCATATCC GGTAGTGCTG CCAGTTTCGT TGCGGTGCTT GCTGGATAAT TCGTTCCAAC GAACCAATGG GTTTGTGCTA CATACCTACG TATTGCAATC AATTTACATG TTGCTCTTGT TGCGGTCGAG CCGTGAGCAT GTTTTGGACG TAACTGGCCA GTACGTGTCA TCCAGTATGG TGTATCAAGA ATGCGCTTTA GACGATGCCG TGCCGACGCC CCCGACTTCT GCATATTATC GCTCAAACCA AGTTACCCCT CTTTCCGTCG GTGCGCACGA AAATGTAGCG TACTCCACTT CGTCGAGCTC GGAGTCGGCT ACAATGGACG GACAAGCGTT TCAATCGGAT CCAATGTGGA CCTTGCTATC CAAAATGAGA ATCGTACCGC GCCTGGCAAG TTTGCTTTAT TGTCCAGATG TTCCACTTAT ACTGCTTCCT GCGAAAGCTT TAGTTTCGAT ATGCGGGATT TTGGCTATGA TAGGACAGCG GTCGCCGGGA GCGGCGTCGG CTATTGTTCA GCACCCGACG CTCTTGTCTA ATTTAGTGGA ACGAACCTGC GTACCATTCG ACGGAGAAAG TCGCTGTAAC CCGAATGTCG CTCTTCCAAC TATTTTGTTA CTTTGCACGT TGGCTCGGCA ATCGCGAGTG GCAGCTTTGG GCATTTCAGT GGATGCGGTG CTGCTGCAAA TTTTGTCACG CAAGGCTGAA CACGAGTCCG AGTATCGGCT ACAACGGTGG ACAGTTATTT TATGGCGTAC CTTGCTTCGA TACGGTCACT GCTTATCCCC TCTATCCGCT CTAGTGCCGT TAGCTGCTCC GCATCTATCA CTAGGAGTGT CTTATCGTTT TTCTTTGGCC TCCGATTTTG CATCTGCATT TGCGACGGTT CAACGCTGTG CATGGGTAGC TTCGCTGATG GACGAAAATG ATAAACGCAT TACTGCAGAG CAGCAAACAA TTTTGGCGCA CTCTGGTGCT TGGTTAGCTT CTTCGATTCG AAACGCAATT GTCAATGTTT CCACTCAAGT TGCCGTTTCT TCGATCGATG AAAGTACCTC GGATTGGATT GCCCGTCTTC GATATGGTTC CAGCTGCTTT CAATTGCTCG CTTCTTTTCT GGCCATTGCT CAAGACAATT CTACAAATAC TGAAGAATGT AAATCAGACA ACTTGACTGT TTTGGAAGAA AGAGACTGTG TGTCTGCGTT GGTATCTATC GAAAATTCGT GTGTCATGCA ACGAATCCTT GATGTTTCGA TGCGTTCCGC CTTTGTTGTC GAATTTTCTG GAAATGCTCC ATTCAACGAA GAAGAGGAAG CCTGCGCTTG CACCTTTCTC GTTGAGTATT TGAAGCTACT GGAATCTTTG AGAGAACGGT TCGATGTCGA CATTGATTCA ACCGAAACTG GGCGCAATCT CCGCTTACTT CTGGAAAGGA TTACTCGTCT TTTGGAAGAC AAATTTCTCT TCGCACGGCT GCGTGGCGAA GTCTCAGAGG ACGGTGCTGA GTCAAAGAAG TATAACCGCC AGCGGCGGGC CTGGTTAAAC CGGTGCCACA GTGGCATTGT GAAATTTCTT GTCACACAGC ACAGCACTCT TTCTGAGCTC TCTGGGCTGG CATGCACAAT TATCGGTCGT CTCGAGAACG GTGACGAAGC TCTGGCTGCG AACCTACTGA GTCATGACTG TCTATTCTCG TCGCAGAAAC AACAATTCCC AAGAGCGGTC TCGCCTATTT CAACAATGAT GGTTCGAGAA CTATGTCGTA CGCAGATAGG CAAGACTCAA CTAGATCACA GTTTCAAGCT TCAACATGGT TTAGGAATCA CGGCAGGAGG CGTGGGACAA TTTGACATAT CGTCTCTGTT GAGTCAAGCG GACGCTGGTC CTGCCCCGGA TCAAACTTCG CCTGATGCGT TCTTGCCTGT TGGTAAACTT TGGGTCTGGC AAATTCTTTC TGGTGCGTCG GCTCTCGGTG TTGGTCAAAT TAGACTCCAG GCGGGACATG AGGAAGTGAT TTGTATTCTT ATGTCATGCT TGGACCTTAT TCAAGGTTTT GAAAAGTACC AATACATCGG TGGCGAGGGA TATGGAAACA GTCTTCCACG GGGTGAAAAG CTATATCACT TGATGAATAT TTGCCTTCAG ACAGAAATTG TCCTTTCCGA AGGTATTATT TTAGCGGCTG CCACAAATTT GTTTGTTGAA TATACCCAGA GGAACTGCAA TCAGAGCTTC GTTGCCAACT TTTGCGACGC ATGCTTGCTT CATTCGACTG CAAGAAGTCG CCGGGAAGTC AACGAAAAAA AGACAGAAGA AGAGACGAAT CTTCTGGCAT TACTTGATGA GCCGTCGGAC ACTTTACTCT TGACGAAGGC GTCGATGAGG AGTTTGTCGG ACTTTGTCAT TGATTTATAC GAGTCCTTCG TCGATCATGG TGCACAGTAT CCCTTCTTCA CGAAATGTAT TCGCACTTTT CTCCTCCCTA CTTTTCCAGC AAGAATACGA TGCGAATTGC TAGGTCGCTT GCGGGGTCTA CTACACTTGC TTTCTCTCGA AGGGGACAAC ATGTCGGAGC TATTATACTT TTACCTATCG GCGAAATCTC CACTTGTTGA TGAGTCCTCT CGCGACGATC CAGAATTTCT GGATGAGATT ACTAGTGCAT ATGCGATAGG AAGTGGGACT CGCGGAGACG AGGGGTTCGT CTCCATGTTT GCAGTGGCAG CTTTGACAAA GAGTTTGGCA ATATCGCTGG CCTTTCCAAG TAGCGGACAA TCAGCCTCGA AGCGGCGAAT TCAACGCCTT GAGCATTCTG TGGCGATCAA AGTTCTTCAA AGCGCTGCTC GCTTTGCGTC GTCACGAAGG ACTTGCTCCG CTCTCGTATC TGCTGTCATG CAAGGAGATG CAAATGACGA TACATGTAAT CTGAATTCGG AACTGGTTTC AGCACCAATT GACGATGAAA AATGGAACGC AATTATAAGC GTTCTTCGGA ATCGATTCGC TGACTCTGTA AAAATTATGT TGAATGAAGG AACCGAAATT CGTGCCGACA TGGATCTGCT TCGCAACTTC AGTGGCTACT TTAACGCTCT GTTGGACGGT TGTTTTGCGG AATCTCGATC GCGGTGTGTG GAGTTGTTAA GCTTGGACAA GGCAGCTGTG CAAACTCTGA TCGATTTTGC CTACGATACA AACAGCGTCG ATTGGGACGC GGTAGAGGAT CTCTCTTTGC TTGATGAGGC AGCTGACTAT CTTCAAATTC ACAACGAACA GCTGGATGTT ACACTGCAGG CTTTGCATAT TGTGCAAGAT TGTGTGGAAC AAGGGTGGGT GTCCTTTACC AAGCCACGGC ATGTATTGCA ACTACCCATC AGTAAAAAGA ATCGTGCCTG GCGGCTCTGG GTGTGCTACG TGCTTTGTCT CGAGCTGCTG CCGCACCCCG AGACACACGT CGACCAGCCA ACATTTACTC AAATGCTTCA GAAGATAACC GGCGATGCCG TTGCCCTTCG TCGAGAATTG ATCGAATACG GGCTGGTGCT GAGAGAAGGA GACGGATCTT CGTATTGGCG TCCCCATTAT AGCGTAAGTA GGCTCCAAGA GTGGATAGCC GGCATTCCAC GTCGGGTCAT ATAGATTTGC ATGCAAGCCA ATACGAGCGA CGAAATTCGC AGCTTGTAG
|
Protein sequence | MANTEGLSDE NVESLLRFDS SDHTRDRPRN NPSARLRERM DAPMGERPAA TVIRRSTDSG RRPRGRAIVE PTAPTRLSTS AHGLPHADNG NAPSPRAAAA SSQYLPQNRH QEQRQPELEQ AELNFPVVGS IVERPRNRTP GASSSERTRP SKFAQERHVR AGDATSLNQG FPSLNVPLGT FVKDNHQQRR NATKTTTSIP PTSPLLVESA EVPARTSNSI DELRTISQRD AQGMLQNMTP VEIQSHVQDL RQALPSSTLA FLQSRSKKSP VAKPPTVLAE AKVEHSEPAR VTRDDRVLGK EEFSQLLSSI STYEDLDAAY ATYVPTELQE KAASTLDQQQ DDSFPMACDL LRSTAPRQNL WAARAVYHRL RDDWEAGQFC SLETDRDVVW PYPVVLPVSL RCLLDNSFQR TNGFVLHTYV LQSIYMLLLL RSSREHVLDV TGQYVSSSMV YQECALDDAV PTPPTSAYYR SNQVTPLSVG AHENVAYSTS SSSESATMDG QAFQSDPMWT LLSKMRIVPR LASLLYCPDV PLILLPAKAL VSICGILAMI GQRSPGAASA IVQHPTLLSN LVERTCVPFD GESRCNPNVA LPTILLLCTL ARQSRVAALG ISVDAVLLQI LSRKAEHESE YRLQRWTVIL WRTLLRYGHC LSPLSALVPL AAPHLSLGVS YRFSLASDFA SAFATVQRCA WVASLMDEND KRITAEQQTI LAHSGAWLAS SIRNAIVNVS TQVAVSSIDE STSDWIARLR YGSSCFQLLA SFLAIAQDNS TNTEECKSDN LTVLEERDCV SALVSIENSC VMQRILDVSM RSAFVVEFSG NAPFNEEEEA CACTFLVEYL KLLESLRERF DVDIDSTETG RNLRLLLERI TRLLEDKFLF ARLRGEVSED GAESKKYNRQ RRAWLNRCHS GIVKFLVTQH STLSELSGLA CTIIGRLENG DEALAANLLS HDCLFSSQKQ QFPRAVSPIS TMMVRELCRT QIGKTQLDHS FKLQHGLGIT AGGVGQFDIS SLLSQADAGP APDQTSPDAF LPVGKLWVWQ ILSGASALGV GQIRLQAGHE EVICILMSCL DLIQGFEKYQ YIGGEGYGNS LPRGEKLYHL MNICLQTEIV LSEGIILAAA TNLFVEYTQR NCNQSFVANF CDACLLHSTA RSRREVNEKK TEEETNLLAL LDEPSDTLLL TKASMRSLSD FVIDLYESFV DHGAQYPFFT KCIRTFLLPT FPARIRCELL GRLRGLLHLL SLEGDNMSEL LYFYLSAKSP LVDESSRDDP EFLDEITSAY AIGSGTRGDE GFVSMFAVAA LTKSLAISLA FPSSGQSASK RRIQRLEHSV AIKVLQSAAR FASSRRTCSA LVSAVMQGDA NDDTCNLNSE LVSAPIDDEK WNAIISVLRN RFADSVKIML NEGTEIRADM DLLRNFSGYF NALLDGCFAE SRSRCVELLS LDKAAVQTLI DFAYDTNSVD WDAVEDLSLL DEAADYLQIH NEQLDVTLQA LHIVQDCVEQ GWVSFTKPRH VLQLPISKKN RAWRLWVCYV LCLELLPHPE THVDQPTFTQ MLQKITGDAV ALRRELIEYG LVLREGDGSS YWRPHYSICM QANTSDEIRS L
|
| |