Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43022 |
Symbol | |
ID | 7196231 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1817070 |
End bp | 1819640 |
Gene Length | 2571 bp |
Protein Length | 623 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177379 |
Protein GI | 219111255 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.599773 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAATTAGCCC AACGATGCCC AACTGTGCAT GGTGGACAGC AGTTCTGGCC TCTATAATAG TTGCCACTAT ATATGCCACT CTCTATGCGA CGGAATCTCG ACAAAGCCTG CGTTTCGGTT TGCACTTCGA CAATGGCGAT GAAGAGAATT GGGACCCCAG GAACAGTCCC GTTGTGTCCA TTGTTCCACA ATTTTTGCTG AATCATAGTG TGTATACTAC GTCCGATTTC CGCAACAAGC AGGGCATTGC TCCCCCCTTC TGGGGATGCA AAAATGAGAG CTGTAACGCT TCGGGGGTAT GGGGACCTTG TTTCTGGCCA CACAAAAAGA TTCGATGGGT AGGCGAAGTT GAACGAATTG GTCGCACAAA TAGACCACAG TACCAAAACG GCCCGATTGA CCCAACACTT ACGGACGATC TTTCTGGTCT TTGTCGACCA GGTTTTTTGA TTATCGGAGC CGGGAAATGT GGGACCAGGT ATGTGCTTCG AGAACAGCGA TCCACAGACA ATGCTTTTGC TTCGGCTGAA CCATGGATAA TCTTTTTTCG ATGGGTAGCT CGCTTTATCA CTATTTGACC GACCACCCTC GCGTTTTACC GGCAAAAGAA AAGCAGATCC ACTATTTTAA GGTGAGCCTC GGTGTTCGTG GCAATGTCCC GACCCGCAAG CCGATTTGAA TGGCGTTCTA ATTCCGTCAC TCTTTTTAAT ATGAAGTATT ACATGCAGTA TCCCATGCAA TGGTATCTTC GCCACTTTCC GACAGCCAGC AGCTTTCTAG CTGCTGGTGC TTTGATGTCT GGCGAAGCAA GCCCAGGTTA CTTACCATAT CCCGATGTTG CGTATCGCAT CAAGAAGCAC ATGCCTGGAC CACGTATCGT TGCTGTTGGG CGAAACCCAT TGGAGCGTGC GTACTCGTCG TATATGTACA ACTATGTCGC ACCTGTTATG AAGATGATGC GCAAAGGCAG GGTGCCAAAC ATTTCAAGAG GACTGACGGA TAAAGAGTAC GAAGACCATT TGTTTTCTTT TGAAGAAATG GCGCGAGCAG AGCTTTTCGT CTTGCGGGAC TGCTTAAGTT CAGACGGAAC CGGCGTCAGA AAGGCAAAGG ATCGGTACAG CTCTCTTAAC TGGGCTGCTG CAGAATACAG ACGTCGAGAG AAAGGAGGAC TTCCTCCTTT GATTGATCTG GAAAGCTTCT GTTACGGTGA TTGGGTGGAC AAGGTAGTCC CTCGCAAGCA ATGGAAGGAC TTGATAGAAA ACAACCCGGA CAAAGTGATT TACCGTAAAA ATGTGCACCT CACCCAGTCA TTTATTGGTC GAAGTCTGTA TGTTCTCCCA TTAGAGTGGT GGTATGCTCT GTACTCCGAG AAGGAAATAA TATTTATTTG TACCGAAGCG ATGAGTGACT TTTCGGGTAC GCCGATGAAT AAGCTCGCGG AATTTTTGGG GTTACCTCCG CACAACTTTT CAACCATTGT AAGCAAAGGA GCATACAATG TTGGAGGACA CCGAGGATAT GATAAGGAAA TTTCCTGGGA TGAGATCAAA GATGAGTTGA ATGTCACTGA GAGGCCAAAG TACGAGGCCA CTCTCTCGGA AGACTTTCTT CGCGAGGTCA AGGCTTTTAT CGAGCCATAC AATGAGCGTC TTTTTAAATT GATTGGCCAT CGCTGCGAAT GGTAGCATTT CATGTTGCCT AAAAAGTAGT ATTCTCCCGT TGTACACATA GTGCAGATGT AACGGAAACT TTGACGGTTT ACGCTGGTAC AGTCTAGTTA GCTTCCTTCG CAGCCTTCCT TGCTTTCCTC CGCTCATCCG ATAACGCCTT CTTTTCTTCT TTTGTTAATT TTTTCTCAAA AAGTCCCTGT ACTTCTACTT TCGGCAGCTC CTCTTCGTCG GCAATCTTAG GACGGGGTTT CGTGGGCGGG GGCGATGAGC CAATAGCGAA CGAATCGTCA ATCTGTACGG CCACTCCAAC AGCTCCACCC GACCATCCCA ACATCCTAGA ATCACAAAAC ATTCCTTCAG ACGTTACTCC TCCGACGGCG GTCTTTTTTA CTACCATCTC TTCGCCCTCA TCGGTAAGTA CTTTCGACCC GACTGGTGCA ATGGCGACCC TGAAGCAATT CAAGAGAGGT CAAATGAGTT TCTGCGTTTC TTTGGAAAAC GGTCCATGTG TCACCGATCC AAAGTTGAGC AACAATACCT GTTCCCTTCC CTGACATTTG CCGCCGATGT CACCACTGTA ATAGGGTTGC TTTCGTCGCC GAGATTTACT TGGCACGCTT TAAAGGATTT TCCCGATTTT CCGCCGCAGT CATTTATTTT AAGCACTAAG CCAACTTTAT ATTCTGTATG ATATACCATG ACAACAAGAT ATTTTATATT GGTGAAGGGT AAACGTGAAT GCAGCTCGAT AGACCAACGC AAGTCTGAGA ACAAGAATTG CTCCAACAAT CTCGGTACAA AAATGATTTT TCGTGTCGAG ACCTATTTTC CGTCTGGCTG GCGGTCGACA CGTCGAAAAC ACACAGGTTT GGAGAATTTG ATTGACTGTT GTGATGCATG A
|
Protein sequence | MPNCAWWTAV LASIIVATIY ATLYATESRQ SLRFGLHFDN GDEENWDPRN SPVVSIVPQF LLNHSVYTTS DFRNKQGIAP PFWGCKNESC NASGVWGPCF WPHKKIRWVG EVERIGRTNR PQYQNGPIDP TLTDDLSGLC RPGFLIIGAG KCGTSSLYHY LTDHPRVLPA KEKQIHYFKY YMQYPMQWYL RHFPTASSFL AAGALMSGEA SPGYLPYPDV AYRIKKHMPG PRIVAVGRNP LERAYSSYMY NYVAPVMKMM RKGRVPNISR GLTDKEYEDH LFSFEEMARA ELFVLRDCLS SDGTGVRKAK DRYSSLNWAA AEYRRREKGG LPPLIDLESF CYGDWVDKVV PRKQWKDLIE NNPDKVIYRK NVHLTQSFIG RSLYVLPLEW WYALYSEKEI IFICTEAMSD FSGTPMNKLA EFLGLPPHNF STIVSKGAYN VGGHRGYDKE ISWDEIKDEL NVTERPKYEA TLSEDFLREV KAFIEPYNER LFKLIGHRCE CSSSSAILGR GFVGGGDEPI ANESSICTAT PTAPPDHPNI LESQNIPSDV TPPTAVFFTT ISSPSSGKRE CSSIDQRKSE NKNCSNNLGT KMIFRVETYF PSGWRSTRRK HTGLENLIDC CDA
|
| |