Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42576 |
Symbol | |
ID | 7195957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 481276 |
End bp | 482687 |
Gene Length | 1412 bp |
Protein Length | 461 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176591 |
Protein GI | 219109674 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.536498 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTCACAGAC TCGAAGGAGA GCGTCAATGG ACGAAATGCC CAACTTCACT CGTTTCACGC CGGATCAGCA ATGCTCCCCA CACATCGCAA GGAGTAACGG GCGCGAGCTC TCTGAGGATG CTGTTTTCCA ACGCATTCCT CGTACGCTTA CCCCTCCGCA AACACCTCGA TCGTTCGCCA TCGAAGACGA GTTTTCTTTG ACAACCAAGC TGCGTCCCTG GCGTAAACCG TTGGATAAGC CGAAGCGCCC ACTGAGCGCG TACAATCTAT TTTTCCAACT AGCGAGACAG CGACTTATTT CGGATACACC GAGCAACCTT CCCTTTACGG CAAAGGATGT TGAACTTATC AGTATGAAGC ACAAACAAAA AAAGGAAAAA CGTCGTCATC GTAAGACACA TGGTAAAATT AGCTTTGCCG ATCTTGCGCG GTCGATTGCC TCGCAATGGA AAGAACTGAG CGACGACGAC AAAGTGATCT TCGAAGAACG CGCAGAAATG GAAAAATGTC GTTACAAGCA AGAGTTAAGC GAGTGGAGCG CGAAGCAAGA GCCAAGCGCG GAACGAAAGG CGGCCATGTT GCGCAAGGTG TCTCTCAAAC AAGGCTCAAG CTTTTCAATG GCTACGACTG CGGAACAACT ATCGAGCACC AGCAGAGCTC CTGATCACGG AAACCCCATC CGCTCTACCA ATTCTTTTGG AATTTCCAGT CCACCACAAC GGCCTCCGTA CGATTTGGAC ATGTACACAG CAAGCCACGA GGCAGCGATA CAGGAAGAAG CATCGCTTAA CGCACTCATC GCACACCAAG GTCAATCGCT GGCTCGCTAT CAAGCCATGA TGGAGCAGCA AATGGACCAA CATGCTTCCA TGCACCCGAT GTCTTCTATG CGGGGACTTC CCTCTAGCAA TTACAACGAT CGTCTCCATC GCAATTCATA TAGACCAGCC CATCCTAATC CGATGAACGG CACATTCGGG AGCGGAGGCA AAACTAACCT TCACTATAAT GATGGGACGC CTGATTGCTA CGACATGGCT CAGCAATGCT ACAGTATGGC TCACCGACAG CAGCAGCAAA ACTGCAGGCA CTATTCTAAA CAACAACGAC ATGGACCGCC GCCCAGTGAT ATGCACTCCG GGATACCGAG TAACACAACC GAAGTCATGC TAGCGCGTGC GAGAACTACA ATGCAACCAC ACATGGCACC GTCGAGCTCT GACAGCTACT GGACCATGGA CCGGTCGACC GAACACCGTC GACTCATACC ACCAAACTCT GCAGGCTATC GGCATTCGCG CTCGGCGCAG GATGAGAGCA CGCTAATGTT GGACCCGCAT CAAGGTCTTT CGTCGCCGCT ATTGGGGACA GCGGACGAAC ACCTGGATCC GTTTGCGAAC GTGTACATCT GA
|
Protein sequence | MDEMPNFTRF TPDQQCSPHI ARSNGRELSE DAVFQRIPRT LTPPQTPRSF AIEDEFSLTT KLRPWRKPLD KPKRPLSAYN LFFQLARQRL ISDTPSNLPF TAKDVELISM KHKQKKEKRR HRKTHGKISF ADLARSIASQ WKELSDDDKV IFEERAEMEK CRYKQELSEW SAKQEPSAER KAAMLRKVSL KQGSSFSMAT TAEQLSSTSR APDHGNPIRS TNSFGISSPP QRPPYDLDMY TASHEAAIQE EASLNALIAH QGQSLARYQA MMEQQMDQHA SMHPMSSMRG LPSSNYNDRL HRNSYRPAHP NPMNGTFGSG GKTNLHYNDG TPDCYDMAQQ CYSMAHRQQQ QNCRHYSKQQ RHGPPPSDMH SGIPSNTTEV MLARARTTMQ PHMAPSSSDS YWTMDRSTEH RRLIPPNSAG YRHSRSAQDE STLMLDPHQG LSSPLLGTAD EHLDPFANVY I
|
| |