Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40300 |
Symbol | |
ID | 7198230 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 55851 |
End bp | 58053 |
Gene Length | 2203 bp |
Protein Length | 707 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184294 |
Protein GI | 219128175 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGATAT CTCCGCAGGA AAGAGAGGCC CGAAGGGGAG TGCAAAGGGA GAATTCGCTT GGCCTCATCG ACATCTTCGG GGATCGGGCG GTCGTCGACC CCTACGCCAC GAGTAACCAG GAATTTTCCA AGTTGGCATC CAAAAACCCC ACCGAAAGGA GGTCAAACAA TTTACAGGGC TCTAAGTCGC TGCACGATTC TCGCACCAGC GCCAGTCTAG GTCCAAATCA TCGACGGCGA CAAATGCAGG TGGAATCGGA TGCAATTATG TCTGGGGCGG AAACGGAGGA CACTTGTCGT TCTTCGGCAA GCCGGATACG ACGAAGTGGA GAGAGGGTTC GGCGCAGTAA AGGGCGGGAA CGGAAAGCTG GAGTCAGCCG CACACGATCG TCCGGTACGC CGCGTTCTCT AGAGGCAGGT CTAAAGACTA CCCTTTCCCA CGAGAGCAAT CAGGATAGTG CACAAGCGGC TGACGATGAA ATGCTGCTGG AATTGGTTGA ACATCGCACT AAGGAACGCA GTCGTACTGG CTTGGTAAAG GTAACGCAAC GTACACTTTC CGTGCGAGGC GAAAGTAGGA GTGCACGAGG TTATAGATCG TCCGAACTAG CACCAGTGCC TCGTTCCAAA AGTTCCCGAG TCGAATCTAC CGACAAGTCC AGCGCCAAAG AATCGGTCCG GCTCCGAAAA ATTCCAGGTG AGCCTACTTC GACGTCGACT AGATATTCCT CAGGACGTAG AACACAGTCC GCCCGAGGTA CACGCAGTAG CAATTTATCG TGTGAGTTGA CGTCACACCT GCAGGGAAAA GCGAGAGTCT CAAAAAGGGG ACCTTCCTCA GTGACTGGAC ATCGCCGTCG TACCGACTCT AACGATTCAG CACGGTCGCA CGAGAAACCT CGTCGGTCAC GAACTCGAAG GGAACACACG CATGCTTATG CAGAAAAAAC GGGAGCTGGC AAAGGGGAAA GCGCTTCTCC TCGTCGTCGA CACCGCTCAC ACACTCATTC CCCTGTACGA CCTCCGAAGT CACCCCGTCG CCATCGACAG CTTTCTCCCC GAAAAGTCTT GGACTCGCCC AGTCTAGCCG GAGACGAAGA AAAATTTAGA CCTAGGCTCA CTTTGAAAGC CCCTTCATTG AAGGATTTAT CTTTGGATGA GCAATCCCCG AGATCACTTA CTGAAGCAGA AGGGGCATGG CTAGAGAATG CTAGCAAGGG AGACAACAGC ATGAAGCAGA AGAGTCAGGC TAGTTCCGAA TACGAATCCG AAGAAGAAAG TACCGCTGGA AGGAATAGCA ATGGCTCCGT GTTACAATTT GATCCTAGCC AATCGGATAA TGTATACCGT GTCAAGCAGG TCACAAGGAT GGATTCGGGG CTGCAAATCG GAGAGAACGG TCAGAGTGTA GACGTTTCGA TTGCGCAATT ATGCGATCCT TTGGGTGACG CCAAACCAGT TCTACGGGCT AGAAAAGAAT TGCCAATATT TGACTTTTTG GAAAGTAACG ATGATGCCAT GATGACATTT ACCGATACGG ACCCGATGGA AGTCAGCCAT AAAGCCGTTG GTCTTGACAG TCCTTCAGGC GGCGAGAATA GTGACAGTGA TGGTATTGAT GTGAATAAAT TTCAGATGCA TGTGATGAGT CCGGGCGTCT ATCATTCTAG CGACGAAGAT AGACGCGAGG CTCGGGCTCT TCGACCTCAC ACCAATCTTA CGTTTCGTAG CTCCGGAGGT GAAAGAGGAG CCGATGAAGC TATTTTGATG GCTAGTCCGA ACAAAGAATT ACCTATTTAC CCATCCGGAG GTGAAGAAGA GGAAGAGGAT GCAGTTCGGG AAGAAGATAA GATGGATTAT ATGGTGTCCA GGAAAAGAGA TCTGGGTCTG CCAAAAACAA TGGACTGCGA AAGCGAATCG TTCGACTTTG TGTCGTACGG AAGTGACTAC GCCAGCGAAA CGGACCGGTC ATCCCGCGGC ATCCGTAGCA AAAGTCGAGC TAAATCGCAG CCATCTGTGC TGGGATTGTC TTCTGGTGGC CAAGGGGACG AAGTCGGCAA AGTGCGGGTA AAAAAGCAGG GCCAAAAGGG GCGCTCAAGA CAGTCCGCGG TGAGAAAGAA ATCAAATGAC GGCTCTCCAG CGCGTCCATC CGACACTCAA GCGGACGTCG ACGACACTCG AAAACGCCGA CCTCGGAAGT TGA
|
Protein sequence | MTISPQEREA RRGVQRENSL GLIDIFGDRA VVDPYATSNQ EFSKLASKNP TERRSNNLQG SKSLHDSRTS ASLGPNHRRR QMQVESDAIM SGAETEDTCR SSASRIRRSG ERVRRSKGRE RKAGVSRTRS SGTPRSLEAG LKTTLSHESN QDSAQAADDE MLLELVEHRT KERSRTGLVK VTQRTLSVRG ESRSARGYRS SELAPVPRSK SSRVESTDKS SAKESVRLRK IPDIPQDVEH SPPEVHAVAI YRGKARVSKR GPSSVTGHRR RTDSNDSARS HEKPRRSRTR REHTHAYAEK TGAGKGESAS PRRRHRSHTH SPVRPPKSPR RHRQLSPRKV LDSPSLAGDE EKFRPRLTLK APSLKDLSLD EQSPRSLTEA EGAWLENASK GDNSMKQKSQ ASSEYESEEE STAGRNSNGS VLQFDPSQSD NVYRVKQVTR MDSGLQIGEN GQSVDVSIAQ LCDPLGDAKP VLRARKELPI FDFLESNDDA MMTFTDTDPM EVSHKAVGLD SPSGGENSDS DGIDVNKFQM HVMSPGVYHS SDEDRREARA LRPHTNLTFR SSGGERGADE AILMASPNKE LPIYPSGGEE EEEDAVREED KMDYMVSRKR DLGLPKTMDC ESESFDFVSY GSDYASETDR SSRGIRSKSR AKSQPSVLGL SSGGQGDEVG KVRVKKQGQK GRSRQSARVH PTLKRTSTTL ENADLGS
|
| |