Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_8670 |
Symbol | |
ID | 7196142 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1044868 |
End bp | 1047130 |
Gene Length | 2263 bp |
Protein Length | 726 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177209 |
Protein GI | 219110915 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGACCG CATCCGTCGA CGTGGCGAAC CCGCTCCTCC AGCAGGAGGA TTTGCCCAAA TTCGCCTCCA TTCAGCCCAC CGATCTGACA CCTGCCGTGG AAGATCTCTT GTCCAAGATG AATCAGGATT TTGATTCCTT CGAGACAAAA CTGACGCAAG CATCCGGTAG TATGGAATTT GAAGACGTCT TGCCGGAACT GGAACGCATG CAGTTCGGCC TCGGCTACGC CTGGGGCGTC GCCGGACACC TGAACGGTGT CAAGAATGGA GACGAACTCC GGCAAGCGTA CGAGGCCAAT CAGCCCAAAA TCGTGGAAGC CATGAGTAAA TTCCGTCAGA GCAAACCGGT GTACGACGCG CTCTCGGCCA TTGACGAAAA GATCAAAAGT ACGGATGACG CTTCCTTTGC ACTGTCCCAG AGACGTCGGG CTGTCGAGAG TTCGCTACGA TCCATGACAC TGGGTGGTGT CGGGTTGGAG GGTGACGACA AGAAACGTTT CAACGAAATC AAAATGAAAC TGGCGTCGCT GAGTACAACC TTTTCCAACA ACGTCTTGGA CGAAACCAAG GCGTTCAGCG TCACGATCGA AGACGGCAGT AAGCTGGAAG GCGTGCCCGA TTCCGCCAAG GCCATGTGGG CCCAAGCGCA CATCAATTCA CTGAAATCCA AAGATGGCAA GGAAGATGAG GAGGTCCCGG AAATGGACGC CAACAAAGGA CCTTGGCGCA TTACTCTCGA CATGCCGTCC TACATTGCCG TCATGAGTCA CCTTCCGGAC AGGGCGTTAC GCGAACAGGT CTACAAGGCT AGCATTCAGC GGGCGTCGGA GCAGAGTAAC GACAAGAATA ATGTACCACT CTTGTACGAG ATTCTCAAAC TGAAGCAAGA AACTGCCAAG CTTCTGGGCT TTGACAACTA CGCCCAACTC AGTTTGTCTT CCAAAATGGC ACCGTCGGTG GAAGCTGTCC GTGAGTTATC CGATCTGATT GCGGAAAAAG CCTTGCCAGC GGCCGAAAAG GAGCTTGCTG AAATCACGGC GTTGGCGCGG GAAAAGGGTG GCGAGGAATA CAGCACGGAG AACCTCGACA AGTTGATGCC GTGGGATTCC ACCTTTTGGA GCGAACGGTT GAAAGAATCG AAGTTCAATT TGACCGAAGA AGAAACTCGT CCGTTCTTTG CCTTGCCCTC TGTTCTGGAT GGTATGTTTC AGCTGGTTGA GCGTATTTTC AACATTGAAG TCAAGAAAGC GGACGGCGAC GCCGAAGTTT GGAACAAGGA CGTTTCTTTT TTCAAGGTGT ACGACGCCGA TTCCGGCAAG CACATTGCCA GTTTCTTTTT GGACCCATAC TCGCGTCCTG AAGACAAACG CGGCGGTGCC TGGATGGATG TGTGTGTCGG TAAGTCGGAA GCCGTACAAC GCGACGTGCC TGTAGCCTAC CTGACCTGCA ACGGGTCTCC ACCGGTCGGT AGCACGCCGT CTCTCATGAC ATTTCGTGAG GTCGAAACCT TGTTTCACGA ATTCGGACAC GGTCTGCAGC ACATGTAAGT GGACAATGTG TGTAAGGAAA TTTTTGGAAA AGAAGGTGTT CCATGTCTAA CGCGTTGCAC CTGCTTGCTA CGTCAGGTTA ACCACTGCCT CAGTGGGAGA TGTGGCTGGC ATCAACGGCG TCGAATGGGA CGCCGTCGAA TTGCCTTCAC AGTTCATGGA AAACTGGTGT TACGATAGAC CCACTATTTA CGGCTTTGCC AAGCACTGGA AGACGAATGA GCCGATGCCC GAGGAAATGT TCAATAAGCT TTGTGAACAA AAGACGTTCA ATGCTGGTAT GATGTCCTGT CGTCAGTTGC TGTTTGGTCA ATTGGACATG GAGTTGCACT CCAACTTCGA CCCGGAGGCT GGCGAAAGTG GAAAAGGCGA ATCCGTCTTT GACGTCCACC GCCGTATGGC GGCCAAGTAC ACACCGTACA GTGAACCGTT ACCCGAGGAT CGTTTCTTGT GCACCTTTCA GCATATCTTT GCCGGTGGTT ACAGCGCGGG ATACTATAGC TACAAGTGGG CCGAAGTCAT GTCGGCCGAC GCCTTCTCGG CGTTCGAAGA AGTTGGCCTT GACAACGAAG AAGAAGTTAA GAAAGTGGGA CGAAAGTTCC GGGATACAGT CTTGAGTTTG GGAGGAGGTG TCGATCCCAT GCAAGTGTTC AAACAATTCC GCGGTCGTGA ACCAACACCA GATGCTCTCC TACGCCACAA TGGGTTGGTT TAA
|
Protein sequence | MSTASVDVAN PLLQQEDLPK FASIQPTDLT PAVEDLLSKM NQDFDSFETK LTQASGSMEF EDVLPELERM QFGLGYAWGV AGHLNGVKNG DELRQAYEAN QPKIVEAMSK FRQSKPVYDA LSAIDEKIKS TDDASFALSQ RRRAVESSLR SMTLGGVGLE GDDKKRFNEI KMKLASLSTT FSNNVLDETK AFSVTIEDGS KLEGVPDSAK AMWAQAHINS LKSKDGKEDE EVPEMDANKG PWRITLDMPS YIAVMSHLPD RALREQVYKA SIQRASEQSN DKNNVPLLYE ILKLKQETAK LLGFDNYAQL SLSSKMAPSV EAVRELSDLI AEKALPAAEK ELAEITALAR EKGGEEYSTE NLDKLMPWDS TFWSERLKES KFNLTEEETR PFFALPSVLD GMFQLVERIF NIEVKKADGD AEVWNKDVSF FKVYDADSGK HIASFFLDPY SRPEDKRGGA WMDVCVGKSE AVQRDVPVAY LTCNGSPPVG STPSLMTFRE VETLFHEFGH GLQHMLTTAS VGDVAGINGV EWDAVELPSQ FMENWCYDRP TIYGFAKHWK TNEPMPEEMF NKLCEQKTFN AGMMSCRQLL FGQLDMELHS NFDPEAGESG KGESVFDVHR RMAAKYTPYS EPLPEDRFLC TFQHIFAGGY SAGYYSYKWA EVMSADAFSA FEEVGLDNEE EVKKVGRKFR DTVLSLGGGV DPMQVFKQFR GREPTPDALL RHNGLV
|
| |