Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37580 |
Symbol | |
ID | 7202428 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 467472 |
End bp | 470546 |
Gene Length | 3075 bp |
Protein Length | 982 aa |
Translation table | |
GC content | 62% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181563 |
Protein GI | 219122461 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.252616 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCCGT CTGCCGATTT CACCATCTCC GACTTTCCTC ACAAAGTCCT CAATCCAATC GCCACCAATA CCATCGCACC CTCCTATGCG TCGCTTCTCC TCGCCCAACG CCAGCTCAGC GCCAATGCAT CCGCCATCCC CAGCCTCAAT GGCGGTGGCG CCCATGGCCA CATGGCCCTG ACGCTCACCG CCGCCGCGTA CGCCGAACTG TCCGACGTCC CCTTCGTCAT TCCCGTTGCT CCCCCGGCCG ACCCCGAACC GGGCACCACG CAACCTCAAA TCACGGAAAA TAATCGGCTC CACAAACGCG CTGTGGCCAT CCACAGCCTC TACGTGGCAG TCAATAATGC CCTCCGTCGC CAGCTCCTCG ACGCCGTTCC TCGCGTCTAC GTCCGGGACT TGGAGCACCC CCAGTTCGCG TACAGCAAAG TCACCTGCCT TGACCTGCTG GACCATCTCT GGCGCAACTT CGGCACCATC TCCGCTTCTG ATTTAAAAAA CAACATCCAG TCGATGTACA CCCCCTGGAA CCCAGCCGAC CCGATTGAAA CCATCTTTCA CCGCCTGACC GACGCCATTG CGTATTCGAC GGCAGGCCGC GACCCCATCT CCGAAGCCGC TGCCGTTCGC GCCGGCTACG ACGTTCTCGA GCATTCCGGC TTGTTCCCTC GTGCCTGCGA AACTTGGCGC ACTGCCTCGC CCGACACGCA TACGCTTGCC AACCTCCGTA CTCTCTTTAA GGTTGCCGAC ACGGACCGCA AGCGCACCGT CACCACGGGC GCCCTTGGCT ACGCCAATGC CCTTTGTGCC CCTTCCTCTG CCCCCCCTTC GATTGTGTCC GACACCCTCA GTCTTCCCTT TTCTGCTCTC TCTGTGTCGC ATTCCTCTGC CGCCACCACG GAGAAAACAT ATTGCTGGAC CCATGGCTCC AGCAATAACC GTCGGCACAC AAGCGCCACC TGCAAGAACA AGGCTCCTGG GCACCGCGAC GACGCCACTG CTGCCAATCC ACTTGGTGGG TCAACCAAAA TTTGGACTGC CCCCAAACCC CCTGAATAGG TCAGAGGGAC GGCTACACCG ACACTTAACA CTTGTAATAA CGATCTAATC AATCATATTA CTAGTCTTAA TTTGTCTGTA GTCCCCTCCC CGCCTAGTAT TACAACCTCG GCCATTGCCG ACACCGGGTG CACAGGACAC TACATTACCG TGTCCTGCCC CCACTTCAAC CAACAGCCAG CCTCCTCTCC ACTCTCTGTC CCCGTTCCCA ACGGCGCTAC CCTCCGTTCC AGCCACACGG CCACTCTCGA CCTCCCTGGT TTTTCCCCTG CCGCTTGCCA AGCTCACATC TTTCCCGGCC TTGCTTCACA CCCCCTCATT TCCATCGGCC AGCTCAGCGA CGACGGCTGC ACCGCCACCT TCTCCGCCAC CCGACTCGAC ATCCACCGGG ACACCACCCT GCTTCTTACA GGCGCTCGAG CCCCCACCAC CGGCCTCTGG CACCTCGACC TGACCCCAGC CAAGACCGCC AATGCCCTCC TTCCCGACAC CTCCCTGGCC GACCGCATCG CTTTTGTCCA TGCGTCCCTT TTCTCCCCTT CTCTCTCCAC TTGGTGCACC GCTCTCGATG CCGGGCGCCT CCCAACCTTC CCCGACATCA CGTCCAAACA AGTGCGCAAG TATCCTCCCC GCTCGATGGC AACCATCAAA GGCCACTTAG ACCAACAACG CGCAAATCTT CGCTCCACTA AGCCCTCCCC CGTTCCGCTG GTGGCCTCAC CCAACCCTCT CCACGAATCC CCGCTTGACT TCTGCCCGGC TCCGACCACT CCTCCCGCTG GCCGCACTCA CCATGTCTTC GCCGCACACC AACGAGTCAC CGGCCAAATC TACACAGATC AACCGGGCCG TTTTCTCACT CCCTCGAGTG CAGGCCACAC GGATATGCTC GTGTTGTACG ACTACGACAG TAATGCAATT CACGTTGAAC TAATGAAGAG CAAGTCCGGT GCCGAGATCC TGGCCGCCTA CCAGCGCGCC CACTCCCTCT TCACCCACCA TGGCCTCCAG CCGCAGCTCC AGCGTCTGGA CAACGAGGCA TCCACCGCTC TCCAGTCATT CATGACCGCC CACCAAGTTG ACTTCCAATT GGCGCCACCC CATTTACACC GTCGCAACGC CGCCGAACGC GCCATACGTA CCTTCAAGAA CCACTTCATA GCCGGTCTCT GCAGCACGAA CCCGGATTTT CCGCTTCATC TTTGGGATCG CCTCATTCCC CACGCTCTGC TTAGTCTCAA TCTCCTCCGC GGCTCCCGCA TCAACCCCAC CCTCTCAGCC CACGCCCAAC TCCACGGCGC GTTCGATTAC AACCGCACCC CGCTTGCCCC CCCCGGTACT CGCGTCCTCG TGCACGAAAA ACCCGCCGTC CGAGAAACTT GGGCGCCCCA TGCTGTTGAA GGCTGGTATC TTGGCCCGGC CATGAACCAT TACCGCTGCC ATCGCGTTTG GATCACCGAG ACACGTGCTG AACGCGTTGC TGACACGCTG GCATGGTTCC CCAGCAAGAT TCCCATGCCC ACCGCCTCTT CCACGGACCG CGCCCTGGCC GCCGCCCGTG ACTTAGTGTG TGCCCTCCGG AATCCCGCTC CTGCTTCACC GTTTACGCCC CTCGACGCCA ACCAGCACCA GGCCCTCACC CAACTCGCAG AACTCTTTGA GTCCGTTGCT GTCCCGGCCT CTCCCATCGC CGCACCCGCT CGAGCGCCCC CGGTCCCCGC CCCTGTCCCA GCACTTACCC CAGCACAGGT CCGCTTTGCC GTTCCCATCG TCACAGCCGA GCACGCCCCC GCACTTCCGA GGGTGCCCAC CCTTGCGCCG CCACCTCCGA GGGTGCCCCC CACGGCCACC TATCACTCTC GAACCCGAAA TCCCGGCCGC CGCCGTCGCA AAGCACGCAA GCCCCCGCCA ACCCCAACCC TAGTTCCGGC TCATCCACAC AACACCCGCA CCCGACCCTT TCTTGCCCCG CCTCCGCCAA CGCAGTCGTC GACCCCGCAA CCGGCGCCTC TTTAG
|
Protein sequence | MSPSADFTIS DFPHKVLNPI ATNTIAPSYA SLLLAQRQLS ANASAIPSLN GGGAHGHMAL TLTAAAYAEL SDVPFVIPVA PPADPEPGTT QPQITENNRL HKRAVAIHSL YVAVNNALRR QLLDAVPRVY VRDLEHPQFA YSKVTCLDLL DHLWRNFGTI SASDLKNNIQ SMYTPWNPAD PIETIFHRLT DAIAYSTAGR DPISEAAAVR AGYDVLEHSG LFPRACETWR TASPDTHTLA NLRTLFKVAD TDRKRTVTTG ALGYANALCA PSSAPPSIVS DTLSLPFSAL SVSHSSAATT EKTYCWTHGS SNNRRHTSAT CKNKAPGHRD DATAANPLVP SPPSITTSAI ADTGCTGHYI TVSCPHFNQQ PASSPLSVPV PNGATLRSSH TATLDLPGFS PAACQAHIFP GLASHPLISI GQLSDDGCTA TFSATRLDIH RDTTLLLTGA RAPTTGLWHL DLTPAKTANA LLPDTSLADR IAFVHASLFS PSLSTWCTAL DAGRLPTFPD ITSKQVRKYP PRSMATIKGH LDQQRANLRS TKPSPVPLVA SPNPLHESPL DFCPAPTTPP AGRTHHVFAA HQRVTGQIYT DQPGRFLTPS SAGHTDMLVL YDYDSNAIHV ELMKSKSGAE ILAAYQRAHS LFTHHGLQPQ LQRLDNEAST ALQSFMTAHQ VDFQLAPPHL HRRNAAERAI RTFKNHFIAG LCSTNPDFPL HLWDRLIPHA LLSLNLLRGS RINPTLSAHA QLHGAFDYNR TPLAPPGTRV LVHEKPAVRE TWAPHAVEGW YLGPAMNHYR CHRVWITETR AERVADTLAW FPSKIPMPTA SSTDRALAAA RDLVCALRNP APASPFTPLD ANQHQALTQL AELFESVAVP ASPIAAPARA PPVPAPVPAL TPAQVRFAVP IVTAEHAPAL PRVPTLAPPP PRVPPTATYH SRTRNPGRRR RKARKPPPTP TLVPAHPHNT RTRPFLAPPP PTQSSTPQPA PL
|
| |