Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_1785 |
Symbol | |
ID | 7196749 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 2143053 |
End bp | 2145266 |
Gene Length | 2214 bp |
Protein Length | 433 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177456 |
Protein GI | 219111409 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.287483 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGTATTTTCG AGTGCGATTA TTGTCATGCT GACATTTCCC AATTGCCGCG AATACGTTGT GCGGTGTGTG TCGATTTCGA TTTGTGCCTG GACTGCTTTA CTTCGACTGA CCATGCAACG GCAATAGCCC GTCTAAAGGC GGCACCGGAC CAAAACGGTA TGGCAGCCTT GTCCGCCATT CAACACGAGG CTACACACGG CTATCGTGTT TGTGATAGCA CGCGTTATCC ACTTTTTTCC ACTGCTCGCA CCCGAGTTAA CAGATTGAAT ACCAATGTCG AGAAGAATTC CGGTATGAGC AATGAGGAAC AAAGAGTTTC CGATATTAAT GTAGCATCGA GCGGGATTGA TGAAATAGGG GATGAAATGG ATGTGGATGA AGCAGCGCTT GGTTCAAACG AGGAAATTAT GGAACGGGAA CAAGATGCCA ACTCGATGGA CGTGACTGAG CTAGAGAGAA CCGCTGTTAA CGACGGCGTC CTGACTGAAG AATTAAACTC TACCGATAAA TATATCGTTT ACGATGATCC AAAATTTTTT TGGACTGTGG AAGAAGACTT ACGTTTGCTG GAAGGTATCC AGACGAACGG TTTGGGAAAT TGGGTTGAGA TCGCAGAGGC AGTCGCAGGT CAAGGATCCA TTGGTAAGAC CCCTCGTCGC TGCATGGAGC GCTACTTTGA CGATTTTCTT GGACGCTACG GCCACATTCT CCCGTCACAC ACTTTACAAG CTGAAGGTGA GGATGAAGTG GAAGAATCCG ATGCTACAAA GTATAGTGTA GAAGAATTTG ATAAGGGAGA TACCGACGAC ACTCCGTCGC GTACTTCCAA ACGTCGGGCG GTGATGATGC GTAGCCCAAG CTCAATGTCA ACCATGGCGT TCACGAGCCG CAAGAAGTAC AAGGCCATTC CAACCGAGAC TCTCGAAGGA TATGGCGAAT TTTGGGCCAA TCCATATTTA CCGCCAATTG AGGGCGTGAG GACTGGTCAA GAAGTAGGAC GTGACCATGC TTACAAGGCA GAACAGCTTT TCGTAAAAAT GAGCATGGCG ATGGATAGCA TTGAACAAGT TAAAAATTTA CACAAAGAAT GGACTGAGAC TCGTCTTTTG AAACCCGGTG GTCCTACCGT CCTTCCTCCT CGACCGGACG ATGTTGTTGG AATGACAGGT GCTGAACTCT CTGGCTTCAT GCCTAGACGG GAAGATTTCG ATGTGGAATG GGAAAATGAT GCCGAGCAGG CCGTGGCAGA TATGGAATTT CTTCCTGGCG AGCCGATAGA GGACAAGCAA CTTAAACTAC AGGTACTGGC AATCTACAAC TCTAAGCTTG ATGAACGTGA GAAGCGCAAG AAATTCGTCC TCAGTAGAAA GCTATATGAT TATCGGAAAA CCCAAACAGA ACACGAGAAG CTCCCACAAG ACGAACGTGA CCTTGTGCAT CGAATGCGTC TGTTCGAGAG ATTTCATACG CCCGAGGAAC ACAAAGAATT TCTTGCGGAT CTGCTCAAGG CGAAGCGCCT TCGCAAGGAG ATTGCAAAAC TGCAAATGTA CCGAAGACTT GGCATCCGCA CATTGCTCGA AGCGGAAAAA TACGAATTAG ACAAAGAGCG CCGGCAGTTC CACAAGACAG CCCACACACA GAAGAACACC GATGTCAGCA CACCAGATGA GAATACTGCC GCAACCTCGG CGGAGGTAAG TGGCCGTTCA GGAATGTCTC AGTCCGTAGG AACTGTTTCA TCTTCATATT GGAAGCAATA CCGCACGGGT GATCGTCGTG AGAGGAAGAG TATCAATCGG GGCGTGCCCT GGGCAGACAG CCAAGAGACG AGCAATAATT TGAAAAATAA TTCGGCTGAT GGGTCTAGCA AAGTAGTAGA CACTAGAAGA GATGATGGGG ATATGGATGC TGTGCAGCCA GTAGAAGATA CAATTGCGAT ACAGGCCAAA CTCGAAGTTT CTTCGAGGGC AACCAAGGAA GACGACTTTG CACATTTGCC TGGTTACAAT CTGCTTTCCT CTCGTGAAGT GTTGTTGTGC CAACGCACAA GGCTAACGCC AGAACAATAT TTGGAGGTAA AGAACGTGCT GATTCAAGAG TCACTGCTTA AGGGGCTTCT GGATAGGGAG GGTCCCGGAT CTAGCAAAAG AGCGTTGGTA CGGATCGACG TAGAGCGACG GGGCGACGTA ATAGACTTTT TAGTTCGGGC CGGC
|
Protein sequence | GIFECDYCHA DISQLPRIRC AVCVDFDLCL DCFTSTDHAT AIAQLNSTDK YIVYDDPKFF WTVEEDLRLL EGIQTNGLGN WVEIAEAVAG QGSIGKTPRR CMERYFDDFL GRYGHILPSH TLQAEGEDEV EESDATKYSV EEFDKGDTDD TPSRTSKRRA VMMRSPSSMS TMATGAELSG FMPRREDFDV EWENDAEQAV ADMEFLPGEP IEDKQLKLQV LAIYNSKLDE REKRKKFVLS RKLYDYRKTQ TEHEKLPQDE RDLVHRMRLF ERFHTPEEHK EFLADLLKAK RLRKEIAKLQ MYRRLGIRTL LEAEKYELDK ERRQRDDGDM DAVQPVEDTI AIQAKLEVSS RATKEDDFAH LPGYNLLSSR EVLLCQRTRL TPEQYLEVKN VLIQESLLKG LLDREGPGSS KRALVRIDVE RRGDVIDFLV RAG
|
| |