Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39361 |
Symbol | |
ID | 7195111 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 284536 |
End bp | 286253 |
Gene Length | 1718 bp |
Protein Length | 561 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183459 |
Protein GI | 219126427 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00105183 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGACT TGAACGCCGA TTTGCTGCTG AATGTTGTAC AGTATCTGTG CGCACGTGAC GGATGTCGTT TGGCTGCGAC GGCATCTCGT TACTTCTACC TAGTGCATCA CTACCGTCAG TTGCGGGGAG CGGAACTGGT TGCTGCTACG TCTTACATTC CACAGGTCAA ATCGCGTCAA CAAACACCGC GAGACGTTTA CGAACAAGCC TTATCGAAGC TCCAATCGAA ACCAAACCTT TGCTTAGCCT TCAACAAGCC GAGCGGTCCT TTGGCAGATC ACTTGCCACT GTTAGTGCCA GATGATTGCG TCATCTTGGG GGCAATTTCC AGTTCCATCC AGTCCTCGTT TACGGGACAG TTGGAATATA AATCAAATTC TTCTATCATG TTAGGTTCCA TGAAAGACGC ACAGATTCGC CCCTTTTGTC TTCAACAGCA CGAACAATCA GGCGACTACC AGGGGACACT AGACCAGTTG GATTCTGCAA ACTGTAGTTG GAAAGTATTT ATGGTGTATG CGTGCGGTGA CTTGGCGGCC GACGTAGAAC CGTTTGTCAC GCATCTGCAA AATCGCTATC CTAACAGTAC GATTGTGGGA GGCATTTGCA ATTCCGGCTA TATTTCTGTA CCAATTCAGC AAAAGACGAA GAGTGAATTG ACCTCGATGC CTTTCCACAG TCTTTTGCAT TTGAATCAAC GCCTTGGGGG TCGAACACCT GATGAGGGCA TTACAAAGGT CGAGCTTGTG CATCACGTAC ATGGTGTGAT GCAGACCAAA CGATTTCAGT TGAAGGTGAT GGAGGACGAA GGCGGTGTCT TTGGGGTAGC ACTGGCAGGA GATGTACCGG TCCGATCAAT GGTATCTAGA GGCGTCACGT CGTTGACATA TAGAGGTGCC CCGCTGCCGA CCACTCCATT TTATGCGGAC GTTGTTGAAT TTCATCGTCC AGGAGACGAG GAATATATGT TTCAGGGTGA AGATCCTCCG TCATATCACT TGATACGCCG CGTTCGGGAT ACAGACTCGG GAAAAACGTA CTCGGCGCAC GAAATGATGA TGAAATTTGG ATCTCCCAAT TTGATTGGGA TTCGCCGTCC GAATGAGGAT GGATTCGAGC TTCATATAGC AGCGGTAAGT CGATCGCTGA TATGATTCTC TTTTAGTCGG AAGTGATGGA CCTCAACTTT TATTTCATTG ATCGACAGAA TGACATCTCT CGAAGCTTAA ATGCGTTTCT GTTTATGGTT AGAGGCTCCC CCGATACAGA AAAAGCTCTA ACGGACGCCA ACATTGACTT CTTTGACATT GACGGGGCTG CCTGCATGCA AGATATGGAA GTTTCCATTC GCCATCTGAA AGAACAAACC CATGGCGAAC AATTGTTGGG TGCTGTCATG TTCAGCTGCA GTGCCCGGGG CCCGACGGCT GGCAATTTGC TGTCGGTGGA CATGGCAGAC GCCACCTCGT TTGCCAACGG CTTCCCAAAC GTCCCATGTT TGGGATTCTA TGCTGGCGGA GAAATTGGAC CGGTTGCTCG TGCAGGGCGT CAAGACGTTT TTCGTAGTGG AAATGCCACA TTGCAGGGCT TTACCGCTGT TTTTGCTCTA TTCATTGTAC CCGAAATCGA CCTCGGCACG ATGATTCTGG ATGACAGGCG CGAAAATGTT GAGGCGTTCT TACGAGGTCG CTTGGCCACG GGAATTCATG ACATTTAA
|
Protein sequence | MDDLNADLLL NVVQYLCARD GCRLAATASR YFYLVHHYRQ LRGAELVAAT SYIPQVKSRQ QTPRDVYEQA LSKLQSKPNL CLAFNKPSGP LADHLPLLVP DDCVILGAIS SSIQSSFTGQ LEYKSNSSIM LGSMKDAQIR PFCLQQHEQS GDYQGTLDQL DSANCSWKVF MVYACGDLAA DVEPFVTHLQ NRYPNSTIVG GICNSGYISV PIQQKTKSEL TSMPFHSLLH LNQRLGGRTP DEGITKVELV HHVHGVMQTK RFQLKVMEDE GGVFGVALAG DVPVRSMVSR GVTSLTYRGA PLPTTPFYAD VVEFHRPGDE EYMFQGEDPP SYHLIRRVRD TDSGKTYSAH EMMMKFGSPN LIGIRRPNED GFELHIAASE VMDLNFYFID RQNDISRSLN AFLFMVRGSP DTEKALTDAN IDFFDIDGAA CMQDMEVSIR HLKEQTHGEQ LLGAVMFSCS ARGPTAGNLL SVDMADATSF ANGFPNVPCL GFYAGGEIGP VARAGRQDVF RSGNATLQGF TAVFALFIVP EIDLGTMILD DRRENVEAFL RGRLATGIHD I
|
| |