Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20872 |
Symbol | |
ID | 7201683 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 628326 |
End bp | 632721 |
Gene Length | 4396 bp |
Protein Length | 1317 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180870 |
Protein GI | 219120255 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.358862 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGAAGCCTC GTACGGCCGA GCCACCCAAT AACATACACA GCTTCGAACA CCAAAGCCGT CCACCGATTT CCATTTCCCC TCCGTTCTTT AGTATTCTAC GAGGAAGCAT CGTGGGACTG AAGAATCAAC GTTAGTACTC TAAAAGGAGC TTACTTTATC ATCAAGTCAA TTCCCTTGGT GTCGCTATAT ACAGCTGGGA ACAACACAAC GACCATGGAA GAATCAGCGC GAAAAGATGC CTCTTCGGAA CGCAAACGCA TCCCCGAAGA GGAAGCGTCG CTACCTTCCC ATCTCTTCTT CTTTTGGGCT CGCGGTCTCT TTCAAAGAGC TTCGGTGTTG AGCAAGCAAG GCAAAGCTCT GGAACACGAA GACTTGCTCC CTCTCCCTAC GATTGACTAC GGTAAACGCA TTGGACCGGC GTTTGCCAAT GCCTGGAACA AGGAAGAAGA GCACATGCAA AGCGAGCAAA AAAGACACAG CGCGTCGGAA GCGCCCACGG TAATTGGTGC GGGTTTGGCG GATGCTGTGG ATGGTAGCTA TTCCACAACC CGTGTTCGCC ACGCCATTTT TGCGGTAATC GGTAGGCGAT TTCTATTTGC TGGTCTCATC AAAGTTCTAA ACACGGCTTT GCAATTCAGC TTCCCCCTAC TTCTCAACGA GATCCTCGCA TTTATAGAGG ACACGCAGGC TGGAAGAATT CCCGAAGATG CTTCGTGGGA AGACAAGTAC CGTGGCTACT GGCTATCGGC TATTTTGTTT GCCGCGATGG CAGCGAAAGC CATTACGGAA AACGTATACT TTCATAAGGT ATACCGAGCT GGCTACCAGG CTCGAGTTGC TGTTTCCGCG GCTGTTTATA ATAAGGCTCT ACGTCTAGCA AACGCCGAGC GCCAAGGCAC TACATTGGGC GAGCTTATCA ATTTAATGCA AGTCGATGCC ACCAAGATTG AAATGTTTGT ACCTCAAATT CACGTTTTGT GGGATGGTGT GCTGCAAATT TGTGGATACA TTACTATCCT GTATACGCTA ATTGGATGGC CGTGTTTTGC GGGTCTAGCA ATCATGATGT TTGCTGGCCC GGTACAAGGA ATCATCATGA AGCGATTGTT TGCTTTGAAT CGTACTATGG TCAAGCACAC GGATTCTCGC ATCAAGACGA CCAACGAAGC TTTACAAGGC ATTCAATGTG TGAAGATGTA TACTTGGGAG GAGTCCTTTC AACGCGAAAT TGGTAAAGCC CGGAACGAAG AATTGGATAA TCTGAAGGGT GTCGCGTACT TGCGCGGTTT TTCGCGGGCA TACATGGGTG CCTTGCCGGG AATTGTGGCG GTCGCATCGT TCATTGTTTT CGCGGCCGCC AAGACTGGCT CCACTATTTC CGCATCAACC CTATTTGCTG CTTTAGTGGC GTTCGATCAG TTGCGATTTC CTTTGCTCTT TTATCCTTTG GCGTTGGCAC AGCTTGCACA AGCTAACGTT AGCGCTCGCC GTGTTGAAAT CTTTTTGCAA ATGCAAGAGA TTGGAAAAGA CGATTTGAAA GACGGAGGTC TGTACTTCCG TGACGACAAA AAAGCAGAAG GCGGTGGCGA AATTGCTGTC AAAGATGTGA ACATTTACTG GAGTGATCCC AATGTGCCAA TTGACGCTAG CGATGACGAC AATCATTCCG TAACTACAAA AAGTGAAGTG AGTTCAATGG ACGAAGCGGA AACCCCAACC AAACGCTTTC CAAAGGCCAT TCTTGAGAGC GTTTCTCTCC GAGTGGCCCC GGGAGAGCTC TGTGCGGTTG TCGGCCGTGT CGGTAGCGGC AAAAGCACAC TTTGCTCTGC AATTTTAGGT GAAACTCTAT TACAGAGCGG AGAAGTTCAA GTTAAGGGGA AAATTGCATA TGCATCGCAG TCAGCCTGGA TACTGAATGC GACACTGCGT GACAACATTC TGTTTGGTAT GCCATTTGAT CAAGAAAAAT ATGATAAAGT GCTGAAAGCT TGCCAGCTCT CACATGACCT GGATATGCTT GACAACGGTG ACATGACCGA AATTGGAGAG CGAGGCATTA ATCTGTCTGG TGGTCAAAAG CAGCGCGTTT CAGTTGCTCG TGCGGCATAT TCGGATGCCG ACCTTGTAGT GCTGGACGAT CCCTTGTCCG CTCTCGATCC TGAAGTGGGA CGCCAGTTGT TTGAGGAGTG CATTGTTGAT CTAATGAAGG AAAAAACTCG ACTCTTCGTC ACAAATCAAC TCCAATTTCT TCGGTATTGC GACTCAGTTG TTGCTCTTGG GAAGCGAAAG GTCATCGAAC AGGGAACGTT TGACGACCTG AATGCTGCCG AAGGTGGAGA AGTGAGGCGA CTTTTGAACG AGTTGAAGTC TTCCGAACAG TCACAAAACC ATGAACAGGA GGAGAATTCT AAGGTGGCAA CTGTTGCGAG AACGGCATCC GCCGCAAAAG ATCCCTCCGT CAACAGAAAG AAAGAGAAGA AAAGCGATGC TGGTCTCGTG ACAAAGGAAG AGCGGAATAT TGGGGCAGTG TCATGGGAGG TATACAAGAA ATACGTACTA GCCGGGGGTG GTTATTTCAA GTTCTTCTGC GTGTATTTTG GATTTGTTCT CTCCGCAGCT AACGGTTTGG CCAGCACATC CTGGGTGTCT TTTTGGACGA GCGATTCTGA GTACGAAAGG AACTCACAGG TGTTTTACCT TAGTATGTAC GCTATGCTTG CAGTTACTCT CGGACTGTTC ACCTACATGC GAGCTTTCCT CCTCGCTCGG TTCGGCGTTC GTGCCGCGGA AAAGTTTCAC AAAGACTTGC TGGAGTCCGT TCTCCAAGCA CCCCAAAGTT TTTTTGATAC TACACCTGTG GGTCGGATTC TTTCTCGATT CTCGAAGGAC ATGTATTCGA TCGATGTTGA GTTGAGCGAC TATTTCGATT TTTTCCTTTT CACGTCACTT ACTGTCGTCG TTTCTCTGGG AACAATCATG TTTGTGACGC CCTGGTTCGG AGTTGCTATT CTACCACTGG GACTTGTCTA TTTTCGTGTG CTTAATTACT TTCGGAACGT CTCTCGTGAG ACCAAGCGCT TGGAAAGTAT TTCGCGCTCT CCTGTATACG CTCATTTCTC TGAAACCCTC GGTGGGCTTT CCACCATTCG AGCCTATGGA CAGTCCATTC GCTTCATGGA AGATTTTGAA GGCAAAGTTG ACTACAATAC TCGTGCTTAC TATAGCAATA AGACGGCTGA CCGATGGTTG TCAGTTCGTC TTGAATTGAT CGGTGCAACG ATTGCAGGGC TTGCAGCGGT ATTCTCCTCC AACGTTGCTA TTTCTGATTC TGTTTCCGGT CAAGACAGCG ATAGCAATTT TGCTTCGTTG GCCGGTTTGT CTCTTTCCTT TGCTATCTCT TTAACCAGTT TACTAAACTG GTGCGTACGC TCGTTTGCGC AGCTCGAAGC CGCCATGAAT GCGTGTGAAC GTGTGCTATA TTACACGGAG AACATTCCGC AAGAAGCCCC GCGTACTTCG GACGAATTGG AAGACGCTAC CTCCTCGTCT ACCGAGCACT CTTTGTCGAA TCCCGCTGTT TATGCCACTT CAAAATCTGG TGGAAAAGCG GATCGTGCTG CGTTTAAGTG GCCCGACAAG GGAGAAATTA CGTTGAAGAA TCTTCGAATG CGGTATCGAG CGGAAACACC GCTGGTCTTG AAGGGACTAA ACGTGACCAT TCATGGTGGA GAAAGGATTG GAGTCGTAGG ACGTACGGGG AGTGGCAAGA GCTCACTCCT GTTGACTCTG TTGCGTTTGG TGGAACCTTC CTTAGAAGAG GGAGATTACC AAGCTCCTCT TTCAATCGAT GGAGTTGACG TGCTTCGCAT CGGCCTGAAA GATCTCCGCT CCAAGCTTGG TATTATTCCA CAAAACCCTG TTTTGTTTTC CGGTACCGTT CGTAGCAACA TTGATCCGTT CGACGAATAC TCCGACAAAC AAATTTGGGA TGCCTTGTCC CGATGCGGAA TGAAAGAGTC GGTCGAAAAT ATGCCGGGTA TGCTGAATGC TAGTATCGCT GAATACGGAG AGAATTTATC GGCCGGAATG CGCCAGATGC TGGTCCTTGG TCGTGCTTTG TTGAAGCAAT GCCGTATTTT GCTCTTGGAT GAAGCCACTT CGAGCGTGGA CTACGAGACG GATCGTGAGA TCCAAAGAAC GCTGCGGGAA GCCTTTAATC AGTGCACCAT TCTCACTATT GCTCATCGCA TCAATACTAT TATGGACAGC GACAAGATTC TGGTCATGAA GGATGGATAT GTGGAGGAGT TTGCCCCTCC TCAAGAGCTT CTCAAGGACG AGAATTCCAC CTTTTCGGAA ATTGTACGAC ACGCCAAGTC CGGAGAGCAT CAGTAG
|
Protein sequence | MEESARKDAS SERKRIPEEE ASLPSHLFFF WARGLFQRAS VLSKQGKALE HEDLLPLPTI DYGKRIGPAF ANAWNKEEEH MQSEQKRHSA SEAPTVIGAG LADAVDGSYS TTRVRHAIFA VIGRRFLFAG LIKVLNTALQ FSFPLLLNEI LAFIEDTQAG RIPEDASWED KYRGYWLSAI LFAAMAAKAI TENVYFHKVY RAGYQARVAV SAAVYNKALR LANAERQGTT LGELINLMQV DATKIEMFVP QIHVLWDGVL QICGYITILY TLIGWPCFAG LAIMMFAGPV QGIIMKRLFA LNRTMVKHTD SRIKTTNEAL QGIQCVKMYT WEESFQREIG KARNEELDNL KGVAYLRGFS RAYMGALPGI VAVASFIVFA AAKTGSTISA STLFAALVAF DQLRFPLLFY PLALAQLAQA NVSARRVEIF LQMQEIGKDD LKDGGLEVSS MDEAETPTKR FPKAILESVS LRVAPGELCA VVGRVGSGKS TLCSAILGET LLQSGEVQVK GKIAYASQSA WILNATLRDN ILFGMPFDQE KYDKVLKACQ LSHDLDMLDN GDMTEIGERG INLSGGQKQR VSVARAAYSD ADLVVLDDPL SALDPEVGRQ LFEECIVDLM KEKTRLFVTN QLQFLRYCDS VVALGKRKVI EQGTFDDLNA AEGGEVRRLL NELKSSEQSQ NHEQEENSKV ATVARTASAA KDPSVNRKKE KKSDAGLVTK EERNIGAVSW EVYKKYVLAG GGYFKFFCVY FGFVLSAANG LASTSWVSFW TSDSEYERNS QVFYLSMYAM LAVTLGLFTY MRAFLLARFG VRAAEKFHKD LLESVLQAPQ SFFDTTPVGR ILSRFSKDMY SIDVELSDYF DFFLFTSLTV VVSLGTIMFV TPWFGVAILP LGLVYFRVLN YFRNVSRETK RLESISRSPV YAHFSETLGG LSTIRAYGQS IRFMEDFEGK VDYNTRAYYS NKTADRWLSV RLELIGATIA GLAAVFSSNV AISDSVSGQD SDSNFASLAG LSLSFAISLT SLLNWCVRSF AQLEAAMNAC ERVLYYTENI PQEAPPDRAA FKWPDKGEIT LKNLRMRYRA ETPLVLKGLN VTIHGGERIG VVGRTGSGKS SLLLTLLRLV EPSLEEGDYQ APLSIDGVDV LRIGLKDLRS KLGIIPQNPV LFSGTVRSNI DPFDEYSDKQ IWDALSRCGM KESVENMPGM LNASIAEYGE NLSAGMRQML VLGRALLKQC RILLLDEATS SVDYETDREI QRTLREAFNQ CTILTIAHRI NTIMDSDKIL VMKDGYVEEF APPQELLKDE NSTFSEIVRH AKSGEHQ
|
| |