Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47974 |
Symbol | |
ID | 7203223 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 598961 |
End bp | 602043 |
Gene Length | 3083 bp |
Protein Length | 897 aa |
Translation table | |
GC content | 60% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182428 |
Protein GI | 219124264 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0286581 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATCTCCATC GTCGTTTTGT GTTCTTCAAT AGAACTACGA AGCAACCCCT CCGTCTCTCT GCTCCCAGCC CTGCCCTACC GTCCATCCGC CTTATCCTCA GTCCCAATGT CGACCTCCGC TCAATTCAAA CTGAGCGACT TTCCTCACAA AGTCCTCGAA CCCATTGCCA CCCTCACCGC CCCACCGACT TACGCCACTC TCAAAGTGGC CCAACGTCAA CTCAGTACCA ACGCCGCCGC CATCCCTACG CTCAACGGCG GCGGCGCGCA CGGCCACATG GCCCTCACAC TTACTGCCCG CGCCTACGCC GACATCAGCG ACGTCCCGTT CGACATCCCC GTCGCCCCTC CGGCCAACCC TCCCGTCGGC ACCACGCAAC CGCAAATCAC CGAGTTCAAC CGCATCCACC AACGCGATGC CGACATTTAC AACCTTTATG TCGCCGTCAA TAACGCCCTC CGTCAGCAAC TCCTCGACGC CATCCCGAAA ATCTACGTAC GCGCCCTCGG GCATCCCATT TTCGAATTCA GCACCGTCAC CTGCCTCGAT CTGCTCTCTC ACCTTTGGAC CAAATACGGC ACCATCAAGC CCGCCGACCT TCAGAACAAT TTGCAGTCCA TGTATACCCC GTGGAACACC GCTGAGCCCC TTGAGACTGT TTTCCTTCAG CTCGACAACG CCATTGCGTT CTCTATTGAC GGCAACGACC CCATCTCTGA GGCCGCCGCA GTTCGTGCCG GCTACGATGT CCTTGCCCAC TCCGGCCTTT TTCCCCAAGA CTGCAAGGAG TGGCGGAAAT TACCCCTTGT TTCTCACACC CTTGCCAACT TCCAGGCCCA CTTCACTCTT GCCGACAAAG ACCGGCGCCT CACGGCCACT ACTGGATCCC TCGGCTACGC GAATGTGCTC ACGGCCACTC CCTCGCTTGC TCCCACCATT AGCTTGGACC CTCTCAGCCT TCCTTTCTCA GCTCTCTCTA TGTCGGATTC CTCTGTTCAC TCGCCTGCCA TGACCTACTG CTGGACTCAC GGCACCAGCA AAAACCGGCG CCATACCAGC ACCACTTGCA AGAACAAGGC ACCGGGCCAT CGCGACGACG CGACGGCGAC CAACACGCTT GGCGGCTCCA CCAAAATTTG GACAGCCCCC AAGCCCCCTG AATAGGAAGG AGGGACGGCT ACGCCGATGA CTAACTCTAG TAATATCGAT CATTTAAATC ATATTACTAG TCTTAATTCA TCTGTAGTCC CCTCCCCGCC TAGTCCACAC ACCTCAGCAA TTGCCGACAC AGGCTGCACC GGTCATTACA TCACGGTTGA CTGCCCCCAC CAAAACAAAC AACCAGCAAA CCCAAGCCTC TCCGTCTGTG TCCCCAACGG CTCCGTCCTC CGCTCAAGCC ACGTTGCCAC CCTGGCCCTT CCTGGTTTCT CCCCTACCGC CTGCCAAGCC CACATATTTC CTGGGCTCGC TTCACATCCG CTCCTCTCCA TTGGCCAACT ATGCGACGAC AGCTGCACAG CCACCTTCTC AGCCACTCGC CTCGACATTC ATCGCGACCA AACGCTGCTG CTCTCCGGCA CCCGCTCCCG CCACACCGGC CTCTGGCACC TCGATCTCGC CCCATCCCCT CCTCCCGCCA CAGCCCATGC TCTCATCCCT CATTCCTCCC TGACCGACCG CATTGCCTTT ATCCATGCCT CCCTCTTCTC CCCGGCTCTC TCTACTTGGT GCAACGCCAT CGACTCCGGT CATCTAACCA CCTTCCCGGA CATCACCGCC CGCCAAGTAT GCAAATACCC ACCGGCCTCC CCTGCCATGA TTAAAGGCCA CCTCGATCAA CAACGGGCCA ACCTCCGGTC CACCAAGTCT CCTCCCATTG GTCCCCTGGC GGCCCCCATT GCCCCTACTG CCACCTCAGC TCACGACCGC CCACCTGTTG CTCGCACGCA CCACGTCTTT GCCACCCATC AGCGCGTCAC TGGACAGATC TACACTGACC AACCAGGCCG TTTTCTCACT CCTTCCAGCG CTGGACACAC CAACATGCTC GTCCTCTACG ATTACGACAG CAATGCCATT CACGTCGAGC TGATGAAGAG CAAATCCGGT CCCGAGATCC TTGCGGCGTA CAAACGTGCA CACACGCTTT TCACCGAACG CGGCCTCCGA CCTCAACTCC AACGCCTGGA CAACGAAGCC TCTGCAGCCC TCCAAACCTT CATGACTTCC GAACATATCG ACTTTCAGCT TGCTCCCCCG CATCTGCACC GTCGCAATGC AGCCGAACGG GCCATCCGCA CCTTCAAGAA CCACTTCATT GCTGGCCTCT GCAGCACTAA CCCGGATTTC CCGCTCCACC TGTGGGACCG ACTCATTCCC CATGCCCTCC TTACCCTCAA CTTACTCCGT AGCTCCCGCC TCAATCCCAA GTTATCGGCC CACGCTCAGC TCCACGGTGC CTTCGATTAC AATCGCACTC CACTCGCTCC CCCTGGCACT CGCGTCCTCG TTCACGTTAA GCCCGCCGTT CGCGAAACAT GGGCACCCCA TGCAGTCGAA GGTTGGTATC TCGGCCCAGC TCTGCACCAT TATCGCTGCC ATCGCGTCTG GATCACCGAA ACACGGGCAG AACGTGTCGC CAACACCCTT TCTTGGTTAC CTAGCCAGAT CCCCATGCCT ACCGCCTCAT CCAACGACCG TGCCCTGGCC GCCGCCCGCG ATCTGGTCCA TGCGCTCCAA AATCCCTCCC CTGCTTCCCC GTTTGCACCT CTCGACGCAC ACCAGCACCA GGCCCTCACC CACCTTGCCG ATCTCTTTGC CACCATTGCC GCACCTGCCT CTGCCGCCCA GACACCTGCT CCCGTCCCCA CGGTCCGTCC CCCTGACCTA CCCGCCACCC CACCTCAGGT CCGCTTTGCC GTCCCGCTGG TCACCGCTGC ACACGCCCCT GCCCTTCCGA GGGTGCCAAC ACCCTCGCCC GCACTTCCGA GGGTGCCCAC CATGGCCACC TATTGCTCTC GCACAGGTAA CCCCGGCCGC CGACGCCGCA CAGCACGCAA ACAGCCACCA ACCCCAACCC TAG
|
Protein sequence | MSTSAQFKLS DFPHKVLEPI ATLTAPPTYA TLKVAQRQLS TNAAAIPTLN GGGAHGHMAL TLTARAYADI SDVPFDIPVA PPANPPVGTT QPQITEFNRI HQRDADIYNL YVAVNNALRQ QLLDAIPKIY VRALGHPIFE FSTVTCLDLL SHLWTKYGTI KPADLQNNLQ SMYTPWNTAE PLETVFLQLD NAIAFSIDGN DPISEAAAVR AGYDVLAHSG LFPQDCKEWR KLPLVSHTLA NFQAHFTLAD KDRRLTATTG SLGYANVLTA TPSLAPTISL DPLSLPFSAL SILNSSVVPS PPSPHTSAIA DTGCTGHYIT VDCPHQNKQP ANPSLSVCVP NGSVLRSSHV ATLALPGFSP TACQAHIFPG LASHPLLSIG QLCDDSCTAT FSATRLDIHR DQTLLLSGTR SRHTGLWHLD LAPSPPPATA HALIPHSSLT DRIAFIHASL FSPALSTWCN AIDSGHLTTF PDITARQVCK YPPASPAMIK GHLDQQRANL RSTKSPPIGP LAAPIAPTAT SAHDRPPVAR THHVFATHQR VTGQIYTDQP GRFLTPSSAG HTNMLVLYDY DSNAIHVELM KSKSGPEILA AYKRAHTLFT ERGLRPQLQR LDNEASAALQ TFMTSEHIDF QLAPPHLHRR NAAERAIRTF KNHFIAGLCS TNPDFPLHLW DRLIPHALLT LNLLRSSRLN PKLSAHAQLH GAFDYNRTPL APPGTRVLVH VKPAVRETWA PHAVEGWYLG PALHHYRCHR VWITETRAER VANTLSWLPS QIPMPTASSN DRALAAARDL VHALQNPSPA SPFAPLDAHQ HQALTHLADL FATIAAPASA AQTPAPVPTV RPPDLPATPP QVRFAVPLVT AAHAPALPRV PTPSPALPRV TPAADAAQHA NSHQPQP
|
| |